Problem statement

A typical system consists of the following components:

A client requesting the service.
Service host(s) entertaining client requests.
A database used by the service for data storage.

Under normal circumstances, this abstraction performs fine. However, as the number of users and, therefore, the database queries increase, the service providers are overburdened resulting in slow performance.

In such cases, a cache is added to the system to deal with performance deterioration. A cache is a temporary data storage that can serve data faster by keeping data entries in memory. Caches store only the most frequently accessed data. When a request reaches the serving host, it will retrieve data from the cache (cache hitwhen the requested data is found in the cache, the server responds with the data immediately.) and serve the user. However, if the data is not available in the cache (cache missWhen the requested data is not found in the cache, it is called a cache miss.), the data will be queried from the database. Also, the cache will be populated with the new value to avoid cache misses for the next time.

As we progress in this lesson, we will understand what a distributed cache is and why do we need one. Because data is written to cache and databases, the order in which data writing happens has performance implications. We, therefore, discuss various writing policies next. We also explain different policies that we will use to evict less-frequently accessed data in the distributed cache. Since cached data may get outdated, we formulate cache invalidation methods next. We further discuss the library called cache client which will send requests to the cache servers. Before concluding our lesson, we explain different mechanisms of data storage on the cache.

What is a distributed cache

A distributed cache is a caching system where multiple cache servers coordinate to store frequently accessed data. Distributed caches are needed in environments where a single ...

Introduction

Abstractions

Non-functional System Characteristics

Back-of-the-Envelope Calculations

Building Blocks

Domain Name System (DNS)

Sequencer

Rate Limiter

Distributed Cache

Blob Store

Content Delivery Network (CDN)

Load Balancers

Key-Value Store

Distributed Messaging Queue

Pub-sub

Distributed Task Scheduler

Distributed Search

Distributed Logging

Distributed Monitoring

Monitoring Server Side Errors

Monitoring Client Side Errors

Databases

Sharded Counters

Concluding Building Blocks

Design YouTube

Design Quora

Design Google Maps

Designing a Proximity Server like Yelp

Design Uber

Design Twitter

Newsfeed System

Design Instagram

Design URL Shortening Service / TinyURL

Design a Web Crawler

Design WhatsApp

Design Typeahead Suggestion

Design Collaborative Document Editing Service / Google Docs

Spectacular Failures

Concluding Remarks

Appendix: System Design Interviews

All content below this will likely go away

Design Exercises

Archived temporary lessons

Design Resource Allocator for a Large Datacenter

Design Zoom

Continuous Monitoring using Data Processing

Design Live Commenting at Facebook

Security

For Noor: Placeholder for Illustration Making

Appendix

Backup of our Lessons

Caching Billions of Tiny Objects on Flash

Design Quora

Copy-Design YouTube

Identity & Access Management

Copy of CDN (02-03-2022)

Introduction to Distributed Cache Clone

Problem statement

What is a distributed cache