Advanced System Design Interview Prep: Crash Course/

...

Design of the Google File System (GFS)

Learn how Google designed its first file system, GFS.

We'll cover the following...

The problem that GFS solved
GFS API
Design of GFS
- Workflow of a file operation
- System scalability with a single manager
Data consistency
- States of a file region after data mutation
- Consistency assurance by GFS
Conclusion

Google needed a distributed file system that could horizontally scale in terms of storage and read/write IOPSInput/output operations per second while using commodity hardware. Large distributed programs such as world wide web crawlers and processing frameworks such as MapReduce need substantial storage with good read and write performance. The storage requirements surpass the capabilities of traditional single-node file systems, NAS (network-attached storage systems), and SAN (storage area network systems). The conventional systems face throughput and vertical scaling constraints due to hardware limitations, and the cost of special purpose storage networks is high. In response to these challenges, Google introduced the Google File System (GFS).

Google File System (GFS) is a distributed file system designed to store and process substantial amounts of data by utilizing a storage cluster of commodity servers. GFS aims to fulfill the following objectives:

Press + to interact

In terms of scale, GFS should be able to store large files where collective need can reach tens of petabytes. Hundreds of concurrent clients might be using the storage system at any instance.

GFS API

GFS supports two specialized operations, record append and snapshot, along with other basic file operations shown in the following illustration. The record append operation allows multiple clients to append small records to a file concurrently, while ensuring that records from different clients are not interleaved, maintaining the consistency of individual records. The snapshot operation allows the clients to create a copy of a file or a directory tree at a low cost.

GFS design is optimized for large, batch-oriented workloads. Most files are mutated by appending new data to them rather than overwriting existing data. So, it focuses on providing the atomicity guarantee for append operations. The snapshot operation helps capture the consistent state of the file system, which helps in backup and recovery. The copy-on-write approach used in snapshots ensures that only changes from the original data need to be stored, optimizing space utilization.

The manager is like an administrator that manages the file system metadata, including namespaces, file-to-chunk mapping, and chunk location. The metadata is stored in the manager’s memory for good performance. For a persistent record of the metadata, the manager logs the metadata changes in an operation log placed on the manager’s hard disk so that it can recover its state after the restart by replaying the operation log.

Besides managing metadata, the manager also handles the following tasks:
- Data replication and rebalancing
- Operational read/write locks on files and directories to perform

...

Introduction

Design Problems

Conclusion

Design of the Google File System (GFS)

The problem that GFS solved

GFS API

Design of GFS