...
/Protocols for Maintaining Fault Tolerance: Part II
Protocols for Maintaining Fault Tolerance: Part II
Learn to add new elements to a running state machine system.
So far we’ve just discussed removing faulty elements and haven’t yet explored adding repaired or new elements to the system. Let's see how we can successfully integrate a new or repaired component into a system of state machine replicas.
Integrating repaired elements
It is not enough for the element being added to be non-faulty. It must also be in the right state to behave consistently with other components. Let's start by introducing some notation:
We define
An element is self-stabilizing if its current state is completely defined by a fixed number of previously processed inputs, say
Logical clocks and fail-stop failures
When using logical clocks and assuming only fail-stop failures, we only require the state of a state machine replica
For an output device
, we require little information to integrate it. This information may include setup and startup information and other trivial information that changes infrequently and can be stored in state machine replicas. For a client
...