Listening: Enterprise SLA Review Meeting
An enterprise customer reviewing SLA performance. 3 questions on SLA/SLO/SLI vocabulary, incident impact language, and technical terms used in account review meetings.
SLA meeting vocabulary — essentials
- SLA — Service Level Agreement: the external contract uptime/response guarantee
- SLO — Service Level Objective: the internal engineering target (stricter than SLA)
- SLI — Service Level Indicator: the actual measured metric
- blast radius — scope of impact: how many users/services/regions were affected
- cascading failure — one component's failure triggering failures in others
0 / 3 completed
1 / 3
Enterprise customer: "We need to revisit the SLA. The current agreement guarantees 99.9% uptime, but last quarter we saw two incidents that together resulted in four hours of downtime. That's well outside the agreed terms."
What does SLA stand for, and what is the customer asserting?
What does SLA stand for, and what is the customer asserting?
SLA (Service Level Agreement) is a contract between a service provider and a customer that defines the level of service expected — typically including uptime guarantees, response times, and remedies for failures.
The mathematics of 99.9% uptime:
The mathematics of 99.9% uptime:
- 99.9% uptime = ~8.7 hours of permitted downtime per year
- ~43.8 minutes per month
- 4 hours of downtime in a quarter = approximately 4× the monthly allowance — a clear breach
- uptime — percentage of time the service is available
- downtime — period when the service is unavailable
- SLA breach — the vendor failed to meet the contracted level of service
- remediation / credits — compensation provided when an SLA is breached (often service credits)