Lecture 8 Frame Set

<body> <h2 align="center">CS 771/871 Operating Systems</h2> <p>[ <a href="../CS771">Home</a> | <a href="roster.htm">Class Roster</a> | <a href="syllabus.htm">Syllabus</a> | <a href="status.htm">Status</a><img src="images/new_small.gif" align="bottom" width="40" height="40"> | <a href="glossary.htm">Glossary</a> | <a href="search.htm">Search</a> | <a href="notes.htm">Course Notes</a>]</p> <hr> <h1 align="center">Distributed File Systems</h1> <hr> <h2>File Systems and Operating Systems</h2> <p align="center"><img src="images/l8f1.gif" align="bottom" width="437" height="408"></p> <p align="left">QUESTION: Which of these layers belongs in the OS?</p> <hr> <h2> Different File System Architectures</h2> <ol> <li><font size="4"><strong>Embed all layers in kernel (Mainframe Approach)</strong></font></li> <li><font size="4"><strong>Embed bottom two layers (UNIX approach)</strong></font></li> <li><font size="4"><strong>Only bottom layer, other layers provided by library<br> </strong></font><font color="#FFFF00"><font size="4"><strong>Problem</strong></font></font><font size="4"><strong>: no one owns the logical disk,<br> must trust user programs not to overwrite other users data</strong></font></li> <li><font size="4"><strong>Only bottom: server provides other layers</strong></font> <ol> <li><font size="4"><strong>servers own logical disk</strong></font></li> <li><font size="4"><strong>all requests must go through server</strong></font></li> <li><font size="4"><strong>Different servers can provide different file system architecture and semantics</strong></font></li> </ol> </li> </ol> <hr> <h2> Design Issues</h2> <ul> <li><font size="4"><strong>Centralized vs Distributed Data</strong></font> <ul> <li><font size="4"><strong>The common tradeoffs exist</strong></font></li> <li><font size="4"><strong>Consistency of global state is major difficulty in distributed approaches</strong></font></li> <li><font size="4"><strong>If distributed, duplications or division</strong></font></li> </ul> </li> <li><font size="4"><strong>Naming (Directory services)</strong></font> <ul> <li><font size="4"><strong>Tree</strong></font></li> <li><font size="4"><strong>Directed Acyclic Graph (DAG)</strong></font></li> <li><font size="4"><strong>Graph</strong></font></li> <li><font size="4"><strong>Forest</strong></font></li> <li><font size="4"><strong>Symbolic Links (File System Pointers)</strong></font></li> </ul> </li> <li><font size="4"><strong>File Sharing</strong></font> <ul> <li><font size="4"><strong>UNIX semantics (read after write returns data just written).<br> Strict time ordering<br> Process management (e.g. pipes) assumes passing of environment including open file pointers</strong></font></li> <li><font size="4"><strong>Session semantics (updates only visible at end of session).<br> Can have race conditions</strong></font></li> <li><font size="4"><strong>Immutable Files (Write once - like CD-ROMs)</strong></font></li> <li><font size="4"><strong>Transaction oriented (semantics guarantees serializable)</strong></font></li> </ul> </li> <li><font size="4"><strong>Server stateless </strong></font></li> </ul> <hr> <h2>Observer File Usage Patterns</h2> <ul> <li><font size="4"><strong>Small files (less than 10K)</strong></font></li> <li><font size="4"><strong>Reading more than Writing</strong></font></li> <li><font size="4"><strong>Access sequential, random rare</strong></font></li> <li><font size="4"><strong>Most files short lived</strong></font></li> <li><font size="4"><strong>Sharing rare</strong></font></li> <li><font size="4"><strong>process uses only a few files</strong></font></li> <li><font size="4"><strong>distinct file classes</strong></font></li> </ul> <p><font size="4"><strong>If these patterns prove universal in distributed systems, then can exploit<br> (see later </strong></font><a href="lecture8.htm#usageOnNew"><font size="4"><strong>discussion</strong></font></a><font size="4"><strong> on new hardware implications)</strong></font></p> <hr> <h2> Comparison Stateless and Stateful Servers</h2> <p> </p> <table width="100%"> <tr> <td width="50%">Advantages of Stateless Servers </td> <td width="50%">Advantages of Stateful Servers</td> </tr> </table> <p></p> <table border="2" width="100%"> <tr> <td width="50%">Fault Tolerance </td> <td width="50%">Shorter request messages</td> </tr> <tr> <td width="50%">No Open/Close needed (less set up)</td> <td width="50%">Better performance with buffering</td> </tr> <tr> <td width="50%">No tables needed for state</td> <td width="50%">Readahead possible</td> </tr> <tr> <td width="50%">No limits on "open" files</td> <td width="50%">idempotency easier</td> </tr> <tr> <td width="50%">Client crashes no problem</td> <td width="50%">File locking possible</td> </tr> </table> <hr> <h2> Caching <font size="5"><strong>In Uniprocessor</strong></font></h2> <ol> <li><font size="4"><strong>Cache in User Process</strong></font> <ul> <li><font size="4"><strong>Manage pool of buffers, try there first</strong></font></li> <li><font size="4"><strong>Tailor to usage patterns</strong></font></li> <li><font size="4"><strong>Avoid some system calls</strong></font></li> </ul> </li> <li><font size="4"><strong>Cache in Kernel (UNIX)</strong></font> <ul> <li><font size="4"><strong>Can manage pool of buffers over all processes (better utilization of memory)</strong></font></li> <li><font size="4"><strong>Can share buffers between processes (UNIX semantics)</strong></font></li> </ul> </li> <li><font size="4"><strong>Cache server: user process which caches for all</strong></font> <ul> <li><font size="4"><strong>Simpler programming than first option</strong></font></li> <li><font size="4"><strong>Can tailor</strong></font></li> <li><font size="4"><strong>more overhead</strong></font></li> </ul> </li> <li><font size="4"><strong>No cache</strong></font></li> </ol> <hr> <h2> Caching <font size="5"><strong>In Distributed Server</strong></font></h2> <p><font size="4"><strong>The above cache-ing approaches can be done on either client or server.</strong></font></p> <ul> <li><font size="4"><strong>Server caches avoid disk access but incur network overhead.</strong></font></li> <li><font size="4"><strong>Client caches lead to cache consistency problems</strong></font></li> </ul> <p><font size="4"><strong></strong></font> </p> <table width="100%"> <tr> <td width="50%">Mehod</td> <td width="50%">Comments</td> </tr> </table> <p></p> <table border="2" width="100%"> <tr> <td width="50%">Write Through </td> <td width="50%">Works, no help for reads</td> </tr> <tr> <td width="50%">Delayed Write</td> <td width="50%">Ambiguous semantics</td> </tr> <tr> <td width="50%">Write on close</td> <td width="50%">Session semantics</td> </tr> <tr> <td width="50%">Centralized Control</td> <td width="50%">UNIX semantics, centralized problems</td> </tr> </table> <hr> <h2> Replication</h2> <ol> <li><font size="4"><strong>Increase Reliability (data and processor redundancy)</strong></font></li> <li><font size="4"><strong>Split workload</strong></font></li> </ol> <p><font size="4"><strong>Replication can be:</strong></font></p> <ul> <li><font size="4"><strong>Explicit (controlled by programmer)<br> Directory may permit multiple file handles to be associated with a name</strong></font></li> <li><font size="4"><strong>Lazy replication: copies made by system in slack time (like system backups)</strong></font></li> <li><font size="4"><strong>Group communications: broadcast requests to all processors having copy (usually just done for write)</strong></font></li> </ul> <hr> <h2>Primary Copy Replication</h2> <ul> <li><font size="4"><strong>Writes sent to primary server</strong></font></li> <li><font size="4"><strong>Server write intention to stable storage</strong></font></li> <li><font size="4"><strong>Primary orders secondary servers to update</strong></font></li> <li><font size="4"><strong>If primary crashes, reads stable storage and continues update</strong></font></li> </ul> <p><font color="#FF0000"><font size="4"><strong>Question</strong></font></font><font size="4"><strong>: What to do if secondary crashes?</strong></font></p> <p><font size="4"><strong>Notice similarity to commit.</strong></font></p> <p><font size="4"><strong>No updates when primary is done (but still can read from secondaries)</strong></font></p> <hr> <h2> Voting Algorithms</h2> <ul> <li><font size="4"><strong>Majority must agree to update</strong></font></li> <li><font size="4"><strong>Updates are assigned unique version numbers</strong></font></li> <li><font size="4"><strong>Reads first request version number</strong></font></li> <li><font size="4"><strong>Read OK if majority returns same version number</strong></font></li> </ul> <hr> <h2> Gifford's Algorithm</h2> <ul> <li><font size="4"><strong>Read Quorum Nr, at least Nr must agree on version</strong></font></li> <li><font size="4"><strong>Write Quorum Nw, al least Nw must agree to update new version</strong></font></li> <li><font size="4"><strong>Nr + Nw > N</strong></font></li> </ul> <p><font size="4"><strong>Consider the following cases:</strong></font></p> <ol> <li><font size="4"><strong>Nr = 1, Nw = N. All servers must agree to write, can read from anyone</strong></font></li> <li><font size="4"><strong>Nw = 1, Nr = N, No reads until all servers updated (eventually)</strong></font></li> <li><font size="4"><strong>Nr = N/2, Nw > N/2, just majority algorithm above</strong></font></li> <li><font size="4"><strong>Nw = N/2, Nr > N/2, Again must wait for eventual update</strong></font></li> <li><font size="4"><strong>Nr small, Nw nearly N, best for more common reads</strong></font></li> </ol> <hr> <h2> </h2> <p><font size="4"><strong><img src="images/undercon.gif" align="bottom" width="40" height="38"></strong></font></p> <hr> <h2><a name="usageOnNew"><font size="1">.<br> </font></a> Exploiting New Hardware: Matching Usage Patterns</h2> <p><font size="4"><strong><img src="images/undercon.gif" align="bottom" width="40" height="38"></strong></font></p> <hr> <h5>Copyright chris wild 1996.<br> For problems or questions regarding this web contact <a href="mailto:wild@cs.odu.edu">[Dr. Wild]</a>.<br> Last updated: October 09, 1996.</h5> </body>