Communicating During an Outage
4 exercises — write clear, structured incident announcements, updates, and resolution messages when production is down.
0 / 4 completed
Incident message anatomy
- Declaration: [P1 INCIDENT] + what is broken + scope + IC + bridge link
- Updates: timestamp → root cause → action in progress → ETA
- Escalation: @person + context + what was tried + bridge link
- Resolution: [RESOLVED] + timestamp + what was fixed + post-mortem date
1 / 4
Production payment service is returning 500 errors for 30% of users since 14:32 UTC. You are the Incident Commander. Which Slack message best declares the incident?
Option B follows the industry-standard incident announcement format. It contains all required elements:
• [P1 INCIDENT] — severity + incident label visible at a glance
• 🔴 — visual severity indicator
• What is broken — payment service, 500 errors
• Scope — 30% of users since 14:32 UTC
• Business impact — ~6,000 transactions/hr
• Incident Commander — clear owner (@alex)
• Bridge link — where to join the response call
• Status — "Investigating now"
Compare Option A — too casual, no structure. Options C and D lack scope, IC, and bridge. During an incident, the initial announcement sets the tone. A well-structured one means engineers can get up to speed instantly — seconds matter.
• [P1 INCIDENT] — severity + incident label visible at a glance
• 🔴 — visual severity indicator
• What is broken — payment service, 500 errors
• Scope — 30% of users since 14:32 UTC
• Business impact — ~6,000 transactions/hr
• Incident Commander — clear owner (@alex)
• Bridge link — where to join the response call
• Status — "Investigating now"
Compare Option A — too casual, no structure. Options C and D lack scope, IC, and bridge. During an incident, the initial announcement sets the tone. A well-structured one means engineers can get up to speed instantly — seconds matter.