Intermediate Estimation Language #sla #reliability

Reading SLAs & SLOs

6 exercises — interpret uptime percentages, error budgets, percentile latency targets (P99), and SLA breach language for engineers and stakeholders.

0 / 6 completed

1 / 6

An SLA states "99.9% uptime per month". Approximately how much downtime does this permit per month?

43 minutes — 99.9% uptime = 0.1% downtime per month.

Calculation:
• 1 month ≈ 30 days × 24 hours × 60 minutes = 43,200 minutes
• 0.1% of 43,200 = 43.2 minutes

Uptime → downtime reference table (per month):

SLA	Nines	Monthly downtime	Annual downtime
99%	Two 9s	~7.2 hours	~87.6 hours
99.9%	Three 9s	~43 min	~8.7 hours
99.95%	Three-and-a-half 9s	~21 min	~4.4 hours
99.99%	Four 9s	~4.3 min	~52 min
99.999%	Five 9s	~26 sec	~5 min

Saying it in English:
• "99.9% uptime — that's three nines — permits about 43 minutes of downtime per month"
• "We're on a four-nine SLA, so we get fewer than 5 minutes downtime per month"
• "Each additional nine reduces your downtime budget by a factor of ten"

2 / 6

What is the difference between an SLA, an SLO, and an SLI?

3 / 6

A service has consumed 80% of its monthly error budget by day 20. What does this mean, and what would you say?

4 / 6

An SLA clause reads: "The latency SLO for the search API is P99 < 500ms." What does P99 mean, and how would you describe it?

5 / 6

Your monitoring shows the service is at 99.94% availability over the past 30 days. Your SLO is 99.9%. How do you describe this status?

6 / 6

How do you clearly explain an SLA penalty to a non-technical stakeholder?