Azure VM Fault and Update Domains
A brief introduction of Azure VM faults and update domains.
We'll cover the following...
Introduction
Redundancy, high availability (HA), uptime, and service-level agreement (SLA) are all terms referenced around one important concept: application (app) availability. You can build the best application in the world with the fanciest whiz-bang features that can cure cancer, but if that app isn’t available to users, it’s worthless. Apps and the infrastructure they are built upon must stay online and available as much as humanly possible.
One of the most common ways an app can lose a few points off of its 99.999% SLA is infrastructure downtime. Stuff happens; power goes out, rack cabling gets “accidentally” disconnected, a rogue VM goes nuts on a shared hypervisor, and more. If an app is running on a set of VMs, one way ...