Advanced Vocabulary #incident-response #sre #devops #on-call

Incident Response Vocabulary

5 exercises — the vocabulary every SRE, DevOps, and backend engineer needs to respond to and communicate about production incidents: blast radius, postmortems, escalation, war rooms, and runbooks.

Core incident response vocabulary clusters
  • Impact terms: blast radius, scope of impact, SEV-1/2/3, affected users, degraded service
  • Process terms: triage, contain, investigate, mitigate, resolve, post-incident review
  • Roles: incident commander (IC), on-call engineer, communication lead, scribe
  • Metrics: MTTA (mean time to acknowledge), MTTR (mean time to resolve), error rate, SLA breach
  • Documentation: runbook, playbook, postmortem, action items, timeline
  • Tools: PagerDuty, Opsgenie, Statuspage, alerting, escalation policy
0 / 5 completed
1 / 5
The incident commander says on a call:
"We've identified the blast radius — it's only the payments service, the rest of the platform is operating normally. Let's contain it before we investigate root cause."
What does blast radius mean in incident response?