Lecture4 Frame Set

<body> <h2 align="center">Lecture 4</h2> <h3 align="left"> Correctness Concerns</h3> <ul> <li> <h3 align="left">Asynchronous interaction is difficult to reason about </h3> </li> <li> <h3 align="left">Single process software is difficult to get correct anyway </h3> </li> <li> <h3 align="left">Because of variability of interaction, distributed systems seem non-deterministic and failures are harder to reproduce </h3> </li> <li> <h3 align="left">Some advocate a more formal proof-theoretic approach to correctness </h3> </li> <li> <h3 align="left">But formal proofs are not easy and they can have bugs too. </h3> </li> </ul> <p align="left">  </p> <hr> <h3>Correctness Concerns 2</h3> <ul> <li> <h3>State system invariants (IMPORTANT: inability to do so is often a sign of system incomprehensibility). </h3> </li> <li> <h3>Assume invariants hold going into an operation </h3> </li> <li> <h3>Show operation preserves invariants (Seems like an induction proof doesn't it). </h3> </li> <li> <h3>Guarantee mutually exclusive use of state variable during certain critical regions where invariants may be temporarily inconsistent. </h3> </li> </ul> <p>  </p> <hr> <h3>Example 1: Logical Vector Clocks</h3> <h3>Consider assertion: for-all i,j: Ci[i] >= Cj[i] about <a href="lecture3.htm#Assertion1">logical vector clocks</a></h3> <h3>States that a process is always more up to date on its own time than any other process is.</h3> <h3>WHY?<br> Because time is monotonically increasing and<br> only a process can increment its own clock.<br> The clocks of other processes are never changed by a process (only remembered and passed on).</h3> <p align="left"> </p> <hr> <h3> Example 2: <a href="lecture3.htm#LogicalClocks">Logical Clocks</a></h3> <p>Expanding on <a href="lecture3.htm#LimitationsofLogicalClocks">previous notes</a> about limitations of logical clocks. Recall<br> if a->b then C(a) < C(b).<br> However if C(a) < C(b) we cannot imply anything about the casually relationship between events a and b. Either (a -> b or a || b). We only know that (not b->a).</p> <p>However <a href="lecture3.htm#LimitationsofLogicalClocks">vector clocks </a>(see <a href="lecture2.htm#ISIScommunicationsPrimitives">ISIS</a> also) can provide a partial order to event times.<br> Consider a vector timestamp Ta and Tb, then the following relationships can be defined:</p> <p> <img src="images/timest.gif" align="bottom" width="760" height="345"></p> <p>Now can state:</p> <p align="center">a -> b iff Ta < Tb</p> <p align="left"><font color="#FF0000">Question: for two different events a and b, can Ta = Tb?</font><br> <a name="Homework1">Homework</a> (see also <a href="lecture4.htm#Homework2">part b</a>): Prove if Ta < Tb then a -> b</p> <hr> <h3>Huang's Termination Detection Algorithm</h3> <h3>Problem: how to know when all processes have finished a computation (need consistent global view of this computation, be it an election, deadlock detection or resolution, token generation, etc).</h3> <h3>A process is either IDLE or ACTIVE in the computation. A computation message is sent to initiate a computation.<br> DEFINITION: a computation is terminated iff all processes are idle and there are no messages in transit.</h3> <h3>There is a controlling agent which initially has weight = 1.<br> Weight is used to coordinate work sent and results received.<br> Let B(DW) be a computation request message sent with weight DW<br> and C(DW) be an acknowledgement message with weight DW.</h3> <p>  </p> <hr> <h3>Huang's Termination Detection Algorithm 2</h3> <ul> <li><font size="4"><strong>Rule 1: a process having weight W may send a computation message to P as follows:</strong></font> <ul> <li><font size="4"><strong>Derive W1 and W2, such that<br> W1 + W2 = W, W1 and W2 > 0</strong></font></li> <li><font size="4"><strong>Set W = W1</strong></font></li> <li><font size="4"><strong>Send B(W2) to P</strong></font></li> </ul> </li> <li><font size="4"><strong>Rule 2: On receipt of B(DW), process P having weight W does:</strong></font> <ul> <li><font size="4"><strong>W := W + DW</strong></font></li> <li><font size="4"><strong>If P idle, P becomes active</strong></font></li> </ul> </li> <li><font size="4"><strong>Rule 3: An active process having weight W may become idle by:</strong></font> <ul> <li><font size="4"><strong>send C(W) to control agent</strong></font></li> <li><font size="4"><strong>W := 0</strong></font></li> </ul> </li> <li><font size="4"><strong>Rule 4: On receiving C(DW), the controlling agent having weight W:</strong></font> <ul> <li><font size="4"><strong>W := W + DW</strong></font></li> <li><font size="4"><strong>if W = 1, conclude computation terminated</strong></font></li> </ul> </li> </ul> <p> </p> <hr> <h3>Correctness of Huang's Termination Detection Algorithm</h3> <h3>Let<br> A : set of weights of all active processes<br> B : set of weights of all computation messages in transit<br> C : set of weights of all control messages in transit<br> Wc: weight of controlling agent</h3> <h3>Then the following invariants hold:</h3> <h3>I1: Wc + SUM{over union of A,B and C} = 1 (conservation of weight)</h3> <h3>I2: for-all W in union of A,B and C) W > 0 (weights are never negative)</h3> <h3>------</h3> <h3>By I1, Wc = 1 implies SUM{over union of A,B and C} = 0</h3> <h3>By P2, SUM{over union of A,B and C} = 0 implies UNION A,B and C is empty</h3> <h3>A UNION B = empty implies termination.</h3> <h3>if assume message sending is finite and reliable, then eventually C will become empty and Wc = 1 so noting the termination</h3> <h3><font color="#FF0000">QUESTION: In what way related to</font><a href="lecture3.htm#TwoPhaseCommit"><font color="#FF0000"> two phase commit</font></a><font color="#FF0000"> <br> and </font><a href="lecture3.htm#DistributedMutualExclusionAlgorithm"><font color="#FF0000">Distributed Mutual Exclusion Algorithm (Ricart and Agrawala)</font></a></h3> <hr> <h3>Correctness of <a href="lecture3.htm#DistributedMutualExclusionAlgorithm">Ricart-Agrawala Algorithm</a></h3> <p>Proof by contradiction: Assume that two sites Si and Sj are executing the critical section (CS) concurrently and that Si's request has a smaller timestamp than Sj (timestamps are totally ordered). Si must have receieved Sj's request after it made its own request. But Sj can only be in the CS if Si returned a reply to it before Si finishes the CS. But this is not possible since Sj has lower priority than Si's request.</p> <p> </p> <hr> <p><a name="Homework2">Homework </a>(part b - see also <a href="lecture4.htm#Homework1">part a</a>):<br> State invariants for the </p> <ul> <li><a href="lecture3.htm#TokenRing">token ring mutual exclusion algorithm</a></li> <li><a href="lecture3.htm#Election">Bully election algorithm</a></li> </ul> <p> </p> <hr> <h3>Processes</h3> <p><font size="3"><em>On Uni-processors</em></font>,<br> processes are mainly to create illusion of virtual processor.<br> Therefore they are meant to keep computations logically apart.</p> <p><em>For distributed systems<br> </em>they are additionally used to create cooperating computations, fault tolerant computations and real time and parallel systems.</p> <p> </p> <hr> <h2>Threads</h2> <p><font size="4"><strong>Single Address space<br> Multiple threads of control = </strong></font></p> <ul> <li><font size="4"><strong>program counter</strong></font></li> <li><font size="4"><strong>set of registers</strong></font></li> <li><font size="4"><strong>execution stack</strong></font></li> <li><font size="4"><strong>Child Threads</strong></font></li> <li><font size="4"><strong>Other State Info</strong></font></li> </ul> <h3>AKA mini-processes or lightweight processes</h3> <hr> <h3>Threads Share Memory</h3> <ul> <li><font size="4"><strong>Can easily share memory objects.</strong></font> <ul> <li><font size="4"><strong>open files</strong></font></li> <li><font size="4"><strong>global variables</strong></font></li> <li><font size="4"><strong>buffers</strong></font></li> <li><font size="4"><strong>signals</strong></font></li> <li><font size="4"><strong>timers</strong></font></li> <li><font size="4"><strong>child processes</strong></font></li> <li><font size="4"><strong>semphores</strong></font></li> <li><font size="4"><strong>accounting</strong></font></li> </ul> </li> <li><font size="4"><strong>Can destroy each others state information</strong></font></li> </ul> <p><font size="4"><strong>Threads can execute in parallel on appropriate shared memory multiprocessors (such as high end workstations).</strong></font></p> <hr> <h2>Server Applications</h2> <p><font size="4"><strong>In Client Server model, </strong></font></p> <ul> <li><font size="4"><strong>server receives requests from many processes</strong></font></li> <li><font size="4"><strong>Requests are usually independent and atomic</strong></font></li> <li><font size="4"><strong>Computation to satisfy request may take considerable time</strong></font></li> <li><font size="4"><strong>Requested resource probably shared (file, printer, database, web site)</strong></font></li> <li><font size="4"><strong>Some applications may require synchronization between requests (e.g. distributed simulation)</strong></font></li> </ul> <hr> <h2>Server Implementations</h2> <ul> <li><font size="4"><strong>Single Process, Single Thread</strong></font> <ul> <li><font size="4"><strong>No parallelism, </strong></font></li> <li><font size="4"><strong>blocking system calls, </strong></font></li> <li><font size="4"><strong>serializes requests, </strong></font></li> <li><font size="4"><strong>inefficient</strong></font></li> </ul> </li> <li><font size="4"><strong>Single Process, Multiple Thread</strong></font> <ul> <li><font size="4"><strong>Possible parallelism, </strong></font></li> <li><font size="4"><strong>threads block on system calls,</strong></font></li> <li><font size="4"><strong>interleaved requests</strong></font></li> </ul> </li> <li><font size="4"><strong>Finite-state machine, simulate multiple computations using state tables</strong></font> <ul> <li><font size="4"><strong>non-blocking,</strong></font></li> <li><font size="4"><strong>not truly parallel</strong></font></li> <li><font size="4"><strong>complex</strong></font></li> </ul> </li> </ul> <p><font size="4"><strong>Consider analogy to dentist's office:</strong></font></p> <ul> <li><font size="4"><strong>one dentist, one patient</strong></font></li> <li><font size="4"><strong>many patients, many dentist (one per patient)</strong></font></li> <li><font size="4"><strong>many patients, one dentists</strong></font></li> </ul> <hr> <h3>Using Threads: Organizational Models</h3> <ul> <li><font size="4"><strong>Dispatcher, interchangeable workers</strong></font></li> <li><font size="4"><strong>Peer/Team</strong></font></li> <li><font size="4"><strong>Pipeline (assembly line) specialized workers,<br> process broken down into worker tasks.<br> producer/consumer</strong></font></li> <li><font size="4"><strong>Mixtures of the above</strong></font></li> </ul> <p><font color="#FF0000"><font size="4"><strong>QUESTION: Do threads make software easier to write?</strong></font></font></p> <hr> <h2>Design Issues/Threads</h2> <ul> <li><font size="4"><strong>Static vs Dynamic creation</strong></font></li> <li><font size="4"><strong>Mutual Exclusion</strong></font> <ul> <li><font size="4"><strong>(Binary Semphore)</strong></font></li> <li><font size="4"><strong>Trylock (non-blocking)</strong></font></li> <li><font size="4"><strong>Condition variable</strong></font></li> </ul> </li> <li><font size="4"><strong>Global variables</strong></font></li> <li><font size="4"><strong>Scheduling</strong></font></li> </ul> <hr> <h2>Threads in User Space</h2> <p><font size="4"><strong>Advantages:</strong></font></p> <ul> <li><font size="4"><strong>No change to underlying OS</strong></font></li> <li><font size="4"><strong>Flexible scheduling</strong></font></li> <li><font size="4"><strong>eliminates overhead of system call</strong></font></li> </ul> <p><font size="4"><strong>Disadvantages:</strong></font></p> <ul> <li><font size="4"><strong>Blocking system calls</strong></font></li> <li><font size="4"><strong>Swap due to page faults</strong></font></li> <li><font size="4"><strong>Clock interrupts?</strong></font></li> <li><font size="4"><strong>Other interrupts (signals)</strong></font></li> </ul> <hr> <h2>Threads in Kernel Space</h2> <ul> <li><font size="4"><strong>Cost of system call</strong></font></li> <li><font size="4"><strong>Reentrant library procedures (static variables)<br> In non thread system, procedures calls are mutual exclusive<br> A program cannot be in two places at the same time.</strong></font></li> </ul> <hr> <h2>Scheduler Activations</h2> <p><font size="4"><strong>Hybrid solution:</strong></font></p> <ul> <li><font size="4"><strong>Keep thread management at user level</strong></font></li> <li><font size="4"><strong>Systems calls/page faults block thread not process</strong></font></li> <li><font size="4"><strong>Kernel signals user level thread manager (UPCALL) on blocking or unblocking events.</strong></font></li> <li><font size="4"><strong>Problem: wait if interrupt thread in critical section on unblocking event</strong></font></li> <li><font size="4"><strong>Problem: upcalls violate layered approach<br> Not surprising since thread management is between peers.</strong></font></li> </ul> <hr> <h2>RPC and Threads</h2> <p><font size="4"><strong>Many RPCs are to processes on same machine:<br> Can share memory(map page registers to calling stack)<br> Not just for threads</strong></font></p> <p><img src="images/l4rpc.gif" align="bottom" width="536" height="211"></p> <p><font size="4"><strong>For server RPC: don't need to save/restore state while waiting.</strong></font></p> <p><font size="4"><strong>Implicit receive: create new thread to handle incoming message</strong></font></p> <p><font size="4"><strong>Pop-up thread: created to handle RPC</strong></font></p> <hr> <h5>Copyright chris wild 1996.<br> For problems or questions regarding this web contact <a href="mailto:wild@cs.odu.edu">[Dr. Wild]</a>.<br> Last updated: October 03, 1996.</h5> </body>