Error Budget Reviews in English: SLO Vocabulary for SRE Teams

Learn the English vocabulary for error budget reviews — burn rate, SLI/SLO/SLA definitions, exhausting the budget, and freeze and resume decisions.

Error budget reviews are a core ritual in Site Reliability Engineering. They are the meetings where teams examine how much of their allowed unreliability they have consumed, decide whether to freeze new feature releases, and plan corrective action. These reviews require precise English — the difference between “the budget is exhausted” and “the budget is at risk” has real operational consequences. This article covers the vocabulary you need to participate fluently.

Key Vocabulary

SLI (Service Level Indicator) An SLI is a specific, measurable metric that reflects the health of a service from the user’s perspective — for example, request latency or availability percentage. “Our primary SLI for the checkout API is the proportion of requests completing successfully in under 300 milliseconds.”

SLO (Service Level Objective) An SLO is the internal target your team commits to for an SLI. It is typically expressed as a percentage over a rolling time window. “Our SLO for the checkout API is 99.9% availability over a rolling 28-day window.”

SLA (Service Level Agreement) An SLA is a contractual commitment made to external customers or partners. Breaching an SLA typically has financial or legal consequences, making it stricter than an SLO. “Our SLA guarantees 99.5% uptime — our SLO is intentionally set higher at 99.9% to give us a buffer before we breach the contract.”

Error budget The error budget is the amount of allowed downtime or failed requests derived from the SLO. If your SLO is 99.9% over 28 days, you have approximately 40 minutes of allowed downtime in that window. “We have consumed 73% of our error budget this month, mostly from the deployment incident on the 12th.”

Burn rate Burn rate measures how quickly you are consuming your error budget relative to the rate at which it would be naturally consumed if you were meeting your SLO exactly. A burn rate greater than 1 means you are consuming budget faster than it replenishes. “The burn rate alert fired because we were consuming our monthly error budget at six times the normal rate during the database failover.”

Budget exhaustion / exhausting the budget Budget exhaustion means the entire error budget for a period has been used up. When the budget is exhausted, the team typically freezes new feature releases until it is replenished. “If we exhaust the budget before the end of the month, we will need to freeze deployments and focus exclusively on reliability work.”

Feature freeze A feature freeze is a period during which no new features are deployed to production, typically triggered by error budget exhaustion or a critical reliability incident. “We are entering a feature freeze effective immediately — all engineering effort is redirected to reliability until the budget recovers.”

Toil In SRE, toil is manual, repetitive operational work that is directly tied to running a service. High toil consumes engineering time that could otherwise be spent improving reliability or building automation. “One of the main findings from this error budget review is that 40% of our on-call time is toil that could be automated.”

Useful Phrases

  • “Let’s walk through the error budget report for the last 28 days.”
  • “We have consumed approximately 60% of our budget — we are on track to exhaust it if the current burn rate continues.”
  • “Based on this data, I am recommending a deployment freeze until the budget replenishes.”
  • “The root cause of the budget consumption was the cache eviction incident — I’ll pull up the post-mortem.”
  • “Are we aligned that reliability work should take priority over roadmap features until we are back in budget?”
  • “Our SLO is 99.9% but our SLA threshold is 99.5% — we have room before we affect customers contractually.”

Common Mistakes

Confusing SLO and SLA Non-native speakers frequently use SLO and SLA interchangeably, but the distinction matters. An SLO is an internal aspiration; an SLA is an external legal commitment. Breaching your SLO is a signal; breaching your SLA can cost money or customers.

Saying “the budget is full” instead of “the budget is recovered” or “the budget is replenished” The error budget is consumed over time and replenishes as the time window rolls forward. It is not “filled” manually. Use “the budget has recovered,” “the budget has replenished,” or “the budget has reset” when the window rolls over.

Using “exceeded” to mean “consumed” Saying “we exceeded our error budget” is correct — it means you went beyond the allowed limit. But “we exceeded our SLO” means something different — it means you performed better than the target, not worse. Be precise: “we breached our SLO” or “we consumed the error budget.”

Error budget reviews are one of the most technically specific conversations in SRE. Building fluency in this vocabulary allows you to participate in freeze/resume decisions, communicate risk to stakeholders, and advocate effectively for reliability investment.