Build fluency in the everyday language of on-call rotations and incident escalation.
0 / 5 completed
1 / 5
At standup, a dev explains they were paged overnight for a failing service. What is this notification event called?
A page is an urgent alert sent to the on-call engineer, typically via a tool like PagerDuty, demanding immediate attention. It differs from a low-priority notification that can wait for business hours. Pages are reserved for issues needing prompt human response.
2 / 5
During an incident, the on-call engineer can't resolve it alone and brings in a specialist. What is this action called?
Escalation is the process of pulling in additional responders, often a specialist team or a more senior engineer, when the current on-call can't resolve or diagnose the issue alone. Escalation policies define who gets paged next and after how long. This keeps incidents from stalling with one overwhelmed responder.
3 / 5
In a design review of the on-call process, the team writes step-by-step instructions for handling a known alert. What document is this?
A runbook provides concrete, step-by-step instructions for diagnosing and resolving a specific known alert or failure mode. It reduces cognitive load during a stressful page, especially for less experienced responders. Good runbooks are kept up to date as systems change.
4 / 5
An incident report shows the on-call engineer got paged for a condition that self-resolved and needed no action. What is this called?
A false positive alert fires without a real actionable problem, and repeated false positives contribute to alert fatigue, where responders start ignoring or delaying pages. Tuning alert thresholds reduces this risk. Excess noisy alerts are a common on-call health complaint.
5 / 5
During a PR review of the on-call schedule, a teammate asks who owns handling an incident after it's paged. What role is typically defined?
The incident commander coordinates the response during a major incident, making decisions and delegating tasks so responders can focus on fixing the problem. This role is distinct from whoever is deep in the technical debugging. Clear IC assignment avoids confusion during high-pressure incidents.