Building Resilient Event-Driven Microservices Apps in .NET 7/

...

Fault Tolerance and Fault Injection

Explore fault tolerance and fault injection in cloud application development.

We'll cover the following...

Anticipating and tolerating faults
Using our faults against us

The concept of fault tolerance—that is, the ability of an application, platform, or runtime to tolerate a systemic fault—by itself seems a simple enough concept to grasp. After all, we’d expect an application to be able to gracefully recover if certain services were not available. In many cases, though, applications have been written with an understanding that the underlying infrastructure that hosts it is always available unless a catastrophic event occurs. While this reliability might be built into on-premises data centers and rarely challenged, the same assumption doesn’t hold for cloud platforms, services, and components.

Though cloud platforms will offer certain Service-Level Agreements (SLAs) for uptime on ...

Introduction

The Sample Application

The Producer-Consumer Pattern

Message Brokers

Domain Model and Asynchronous Events

Containerization and Local Environment Setup

Localized Testing and Debugging of Microservices

Microservice Observability

CI/CD Pipelines and Integrated Testing

Fault Injection and Chaos Testing

Modern Design Patterns for Scalability

Minimizing Data Loss

Service and Application Resiliency

Telemetry Capture and Integration

Observability Revisited

Conclusion

Fault Tolerance and Fault Injection