...

/

System Design: Distributed Monitoring

System Design: Distributed Monitoring

Learn about the importance of monitoring in a distributed system, and explore our high-level plan for designing it.

The modern economy depends on the continual operation of IT infrastructure. Such infrastructure contains hardware, distributed services, and network resources. These components are interlinked in such infrastructure, making it challenging to keep everything functioning smoothly and without application downtime.

Press + to interact

It’s challenging to know what’s happening at the hardware or application level when our infrastructure is distributed across multiple locations and includes many servers. Components can run into failures, response latency overshoot, overloaded or unreachable hardware, and containers running out of resources, among others. Multiple services are running in such an infrastructure, and anything can go awry.

Press + to interact

When one of the services ...