Intermediate Numbers & Data #observability #monitoring #dashboards #metrics #alerting

Interpreting Monitoring Dashboards

5 exercises on describing what you see on monitoring dashboards in professional IT English.

Dashboard narration vocabulary

Spike: a sharp short-duration increase — "a spike in latency at 14:32"
Trend: a directional change over time — "memory usage trended up over 6 hours"
Correlation: two metrics moving together — "correlates with the deployment at 14:15"
Anomaly: a departure from baseline — "outside the normal operating range"

0 / 5 completed

1 / 5

A Grafana panel shows CPU usage jumping from 30% to 92% at 14:32 and returning to 35% by 14:45. How do you describe this in an incident summary?

2 / 5

A monitoring dashboard shows memory usage increasing steadily from 4 GB to 7.8 GB over 6 hours, with the OOM killer triggering at hour 7. What type of pattern is this, and what does it suggest?

3 / 5

An alert fires for high error rate (threshold: 1%). The graph shows error rate at 0.8% for the past hour but jumped to 1.4% in the last 5 minutes. A colleague asks: "Is this a real incident?" What is the best professional response?

4 / 5

After a deployment, the dashboard shows p99 latency increasing from 180ms to 340ms. The p50 latency is unchanged at 45ms. What does this pattern indicate?

5 / 5

A dashboard shows request rate dropping 40% at 02:00 UTC and recovering at 08:00 UTC. There are no alerts and no incidents. How would you describe this in a daily standup?