Reading Cloud Infrastructure Architecture Documents
5 exercises — read a realistic AWS Multi-AZ production setup document. Understand ALB, Auto Scaling, RDS failover, ElastiCache, and CloudFront, and extract key infrastructure decisions.
Reading cloud architecture documents
- Availability Zones → separate data centres; "Multi-AZ" = survive a data centre failure
- Scaling direction → horizontal (more instances) vs. vertical (bigger instance)
- Caching layers → where does data live? RAM (ElastiCache), CDN (CloudFront), disk (S3/RDS)?
- Stateless vs. stateful → stateless tiers scale freely; stateful tiers need sticky sessions or shared state
- Numbers matter → min/max instances, TTL, failover time — read and remember these
0 / 5 completed
1 / 5
AWS Multi-AZ Production Architecture
{ex.passage} According to the passage, what is "Multi-AZ" and why is it used for the RDS database?
Multi-AZ = synchronous standby in a second AZ for automatic failover:
The passage states: "AWS maintains a synchronous standby replica in a second AZ. If the primary instance or its AZ becomes unavailable, RDS automatically fails over to the standby — typically completing within 60–120 seconds — with no manual intervention required."
Key concepts unpacked:
These are often confused. Multi-AZ is for availability (survive failures). Read Replicas are for read scalability (distribute SELECT queries). Multi-AZ standby does NOT serve read traffic — it only exists to take over in a failure.
Infrastructure vocabulary:
The passage states: "AWS maintains a synchronous standby replica in a second AZ. If the primary instance or its AZ becomes unavailable, RDS automatically fails over to the standby — typically completing within 60–120 seconds — with no manual intervention required."
Key concepts unpacked:
- Availability Zone (AZ) → a physically separate data centre within an AWS Region. An AZ can fail (power, networking) without affecting other AZs.
- Synchronous replication → every write to the primary is simultaneously written to the standby before the write is acknowledged. This means zero data loss on failover.
- Automatic failover → AWS detects the failure and promotes the standby to primary without a human pressing a button.
These are often confused. Multi-AZ is for availability (survive failures). Read Replicas are for read scalability (distribute SELECT queries). Multi-AZ standby does NOT serve read traffic — it only exists to take over in a failure.
Infrastructure vocabulary:
- failover → the process of switching from a failed component to a standby
- high availability (HA) → system design that minimises downtime
- no manual intervention → automated operations, no on-call engineer needed for recovery