System Design Deep Dive: Real-World Distributed Systems/

...

Protocols for Maintaining Fault Tolerance: Part I

Learn how to keep enough replicas in an ensemble of replicas to maintain fault tolerance.

We'll cover the following...

Modeling replica replacement
- Replacing replicas on failures
- Replacing output devices and clients
Managing a system of state machine replicas

Our protocols for $t$ fault tolerance in a system provide us with a guarantee that our system will not fail if no more than $t$ replicas fail. With this guarantee, we must ensure that the number of faulty nodes in an ensemble of replicas does not exceed $t$ . We can do this by replacing faulty replicas with non-faulty replicas. Let's formally discuss this.

Modeling replica replacement

We define $P(\tau)$ as the total number of nodes running state machine replicas in an ensemble of replicas and $F(\tau)$ as the number of faulty nodes in that ensemble at time $tau$ . $P(\tau) - F(\tau)$ must be greater than a certain number to guarantee that our system will produce the correct output. Here is how we can formally define this combining condition:

Here, $Enuf = P(\tau)/2$ when Byzantine failures are possible. And $Enuf = 0$ when only fail-stop failures are possible.

If the condition above holds, our system will provide the correct output. This is ensured by having the minimum number of non-faulty nodes present in the system, depending on the respective failure types. For Byzantine failures, we need a majority, which means more than half of the total nodes. Therefore, any integer greater than $P(\tau)/2$ . We only need one non-faulty node for fail-stop failures, which ...

Prologue

File Systems

Google File System (GFS)

Google Colossus File System

Facebook's Tectonic File System

Databases

Google Bigtable

Google Megastore

Google Spanner

Key-value Stores

Many-core Key-value Store

Scaling Memcache

SILT

Amazon DynamoDB

Concurrency Management

Two-phase Locking (2PL)

Google Chubby Locking Service

ZooKeeper

Big Data Processing: Batch to Stream Processing

MapReduce

Spark

Kafka

Consensus

Understanding Consensus: Two Generals, FLP, & Byzantine Generals

Two-phase Commit

State Machine Replication

Paxos

Raft

Epilogue

Protocols for Maintaining Fault Tolerance: Part I

Modeling replica replacement