Writing Post-Mortems
4 exercises — write complete blameless post-mortems: root cause analysis, impact, action items, and what went well.
0 / 4 completed
Post-mortem section checklist
- Executive Summary — 2 sentences, no jargon, for leadership
- Impact — duration, users affected, revenue, SLO breach (quantify everything)
- Timeline — blameless, chronological, UTC timestamps
- Root Cause Analysis — the fundamental reason, not just the trigger
- What Went Well / What Went Poorly — honest, specific, evidence-based
- Action Items — What + Owner + Due Date + Priority
1 / 4
Which section of a post-mortem describes what failed and why without assigning blame to individuals?
The Root Cause Analysis (RCA) section explains the technical and process failures that caused the incident. It answers: What specifically failed? Why did it fail? What conditions allowed it to fail?
Standard post-mortem sections:
• Executive Summary — 1–2 sentences for leadership
• Impact — duration, users affected, business effect
• Timeline — chronological events
• Root Cause Analysis — what failed and why
• Contributing Factors — conditions that made failure worse
• What Went Well — response actions that helped
• What Went Poorly — gaps in response or process
• Action Items — concrete follow-up with owner + due date
The RCA should identify root causes (the fundamental reason) not just proximate causes (the immediate trigger). Example: the proximate cause was "a config file had an error". The root cause might be "there was no automated config validation in the deployment pipeline".
Standard post-mortem sections:
• Executive Summary — 1–2 sentences for leadership
• Impact — duration, users affected, business effect
• Timeline — chronological events
• Root Cause Analysis — what failed and why
• Contributing Factors — conditions that made failure worse
• What Went Well — response actions that helped
• What Went Poorly — gaps in response or process
• Action Items — concrete follow-up with owner + due date
The RCA should identify root causes (the fundamental reason) not just proximate causes (the immediate trigger). Example: the proximate cause was "a config file had an error". The root cause might be "there was no automated config validation in the deployment pipeline".