<< Exercise 5 | Index | Exercise 7? >>
To add a question because something is unclear or was not understood, just insert the question and add the prefix %q% for each addition (q like question). This is the "question-style". Like this:
* %q% What kind of problems could have decentralized nature? |
|
If you want to answer a question or add a comment please put a %a% in front. This is thea "answer-style" (a lilke answer). An example:
* %a% This is an addition to something that I consider important. |
|
For citations or references to the slides of Prof. Suri pleas add the lecture and slide number in braces: (<lecture>.<slide>).
Please make sure that you enter an author name, else your changes will not be saved!
Exercise 6
Dependability
- Try to formulate a definition of dependability of a general computer-based system. What is the dependability of a car? Of an Operating System? Of a space shuttle?
- Revisited: Explain the difference between Faults, Errors and Failures.
- What is the difference between a fault model and a failure model? Name, and explain a few failure models that can be useful in a distributed system.
- What does a designer mean if he/she says that faults (of a certain type) are assumed to be independent? What does this mean for the design at hand?
- What is a SPF?
- Dependability is achieved using one or more of these general techniques. Explain each of them using examples:
- Fault removal
- Fault forecasting
- Fault prevention
- Fault avoidance
- What is a definition of a fault tolerant system? In what way does it differ from a dependable system?
- There are many attributes of dependability. Explain the difference (and relation) between the following attributes:
- Reliability
- Maintainability
- Safety
- Performability
- Security
- Faults can be classified as being either omissive or assertive. Give examples of faults in a modern PC belonging to the different groups.
- What are arbitrary faults? Byzantine
- For dependable systems one often hears the word "coverage". What does it mean that a system has 100% coverage? Can 100% coverage even be achieved?
- Revisited: Fault-Tolerance is achieved through redundancy. What are the types of redundancy available? Give examples of designs where one or the other is suitable.
- Most of the time you want to achieve exact agreement on a result. But sometimes only inexact agreement is possible. Give some examples of applications where this is the case.
- Explain the principle behind the following convergence functions: Fault-Tolerant Midpoint and Fault-Tolerant Average. When are they appropriate, when not?
- What is the difference between forward and backwards recovery?
- Distributed systems are usually considered suitable for implementing highly dependable systems. Why do you think this is so? Which properties are of importance when designing a highly dependable distributed system?
Diagnosis
- Why do we only talk about error detection and failure detection and not about fault detection?
- Give some examples of local failure/error detectors.
- Local failure detectors don't work as well on the system level. Why is this the case? Tip: What are the assumptions usually made for system level failure detection in distributed systems?
- Someone tells you that his/her system needs 2f + 1 nodes to correctly detect f failures. Which is the failure model for the system do you think? What is the model if 3f + 1 nodes are required?
- What is a perfect failure detector? Why don't we always use them?
- Explain the concepts of weak/strong accuracy/completeness.
- What is an eventually weak failure detector?
Consensus and Group Membership
- What is network partitioning and how does one detect it? What do you do to "mend" a partitioned network such progress can be made
- What is the consensus problem? How would you formulate the fault tolerant consensus problem?
- Give an algorithm that solves the fault tolerant consensus problem when the coordinator in a system can fail (silent).
- What is the difference between uniform consensus and non-uniform consensus?
- What is a group membership service?
- Define a group membership algorithm using distributed agreement.
- What happens with the group view when one of the members fail?
- What does it mean that a membership service implements linear membership?
Fault-Tolerant Communication
- Which failures do you have to consider in a distributed setting as opposed to a centralized?
- Explain how reliable delivery can be implemented if links can fail by not delivering messages.
- What makes a asynchronous system more difficult to design than a synchronous with respect to reliable delivery? (tip: both can fail by not sending messages)
- How do you achieve reliable multicast as opposed to point-to-point sending of messages?
- How do you tolerate value faults in messages? Find out how this is done in TCP/IP over Ethernet.
- Revisit: define the Byzantine agreement problem.
- Give an example of an algorithm that solves the Byzantine agreement problem.
- Why do you need 3f +1 nodes to achieve agreement with Byzantine errors? Why is not a ma jority enough?
Implementing Ordering
- What is a causal hole and how do you avoid it when implementing a causal ordering protocol?
- Give an example of how you could implement a total ordering protocol.
Replication Management
- Explain how a state machine can help in implementing replication schemes that avoid partitioning by design.
- What do you do if parts of the applications are non-deterministic, say have user input?
- Explain the difference, using examples, of the following replication approaches. What are the advantages/disadvantages of the approaches?
- Active replication
- Semi-Active replication
- Passive replication
Recent Changes
- Exercise1 . . . 16. Februar 2006, 11:39 Uhr von chrschn:
- Exercise5 . . . 16. März 2005, 09:56 Uhr von chrschn
- GroupFooter . . . 16. März 2005, 09:26 Uhr von chrschn
- Exercise4 . . . 10. März 2005, 16:48 Uhr von Alexandre Greff Buaes?
- Exercise3 . . . 10. März 2005, 11:56 Uhr von Alexandre Greff Buaes?
- Exercise2 . . . 09. März 2005, 14:25 Uhr von Alexandre Greff Buaes?
- Exercise6 . . . 04. März 2005, 12:34 Uhr von chrschn
- Dep1Uebungen? . . . February 27, 2005, at 05:13 PM by chrschn
- GroupHeader . . . February 25, 2005, at 01:21 PM by chrschn
- Index . . . February 25, 2005, at 12:21 PM by chrschn
- SideBar . . . February 24, 2005, at 01:34 PM by chrschn
- Exercise11 . . . February 23, 2005, at 03:41 PM by chrschn
- Excercise11? . . . February 23, 2005, at 03:41 PM by chrschn
Zuletzt geändert am 04 März 2005 12:34 Uhr von chrschn
