Advanced Vocabulary #sre#reliability#errorbudget#toil

Advanced SRE Vocabulary

5 exercises — Practice advanced SRE vocabulary in English: error budget policies, toil elimination, postmortem culture, capacity planning, and production readiness reviews.

Core Advanced SRE vocabulary clusters
  • Error budget: error budget policy, burn rate, fast/slow burn, budget exhaustion, budget recovery
  • Toil: toil definition (manual, repetitive, automatable), toil budget (50%), toil elimination vs. reduction
  • Postmortem: blameless culture, contributing factors, proximate/systemic cause, action items, 5 whys
  • Reliability: production readiness review (PRR), launch checklist, change management, rollback criteria
  • Organisation: embedded SRE, platform SRE, CRE, SRE engagement model, hand-back criteria
0 / 5 completed
1 / 5
An SRE lead explains the error budget policy to a development team:
"Our SLO is 99.9% availability, giving us 43 minutes of error budget per month. The policy has three tiers. At 50%+ budget remaining: normal release velocity, no restrictions. At 10-50% remaining: caution mode — all deployments require SRE sign-off and a rollback plan. At under 10% remaining: budget freeze — no non-critical releases until budget recovers. At 0%: reliability sprint — engineering focuses entirely on SRE work until budget is restored."
What is an error budget policy and why is it important for the relationship between development and SRE teams?