Intermediate Vocabulary #sla #sre #uptime #devops

SLA Percentages & Uptime

5 exercises — reading uptime percentages, calculating allowed downtime, understanding error budgets, and mastering SLI/SLO/SLA terminology used in DevOps and SRE roles.

Uptime quick reference

99% (two nines) → ~87.6 hours/year downtime
99.9% (three nines) → ~8.76 hours/year
99.99% (four nines) → ~52.6 min/year
99.999% (five nines) → ~5.26 min/year
Error budget = 100% − SLA target (the allowed failure time)
SLI → metric; SLO → internal target; SLA → contract

0 / 5 completed

1 / 5

Your SLA guarantees 99.9% uptime per year. A colleague asks how much downtime that allows. What is the correct answer?

2 / 5

What is meant by "five nines", and how much downtime does it permit per year?

"Five nines" — 99.999% uptime
"Five nines" is a shorthand used in SRE and SLA negotiations for the highest practical tier of availability.

The math:

99.999% = 0.001% maximum downtime
0.00001 × 525,600 min/year = 5.26 minutes/year
Per month: ~26 seconds

The "nines" naming convention — count the 9s after the decimal:

Name	Uptime	Downtime/year
Two nines	99%	~87.6 hours
Three nines	99.9%	~8.76 hours
Four nines	99.99%	~52.6 min
Five nines	99.999%	~5.26 min

Realistic note: Five nines requires extreme engineering effort and cost. Even AWS and Azure major services target 99.99%. True five nines is rare and expensive.

3 / 5

An SRE says: "We've used 80% of our error budget this month and it's only the 20th."
What is an error budget, and why is this a concern?

4 / 5

A team is debating their SLA tier for a new payment processing service. The options are 99%, 99.9%, or 99.99% uptime. Which should they target, and why?

5 / 5

An incident report states: "The service exceeded its SLO, triggering an SLA breach. The team must now review SLIs."
What do SLI, SLO, and SLA stand for, and how do they relate?