Intermediate Numbers & Data #latency #percentiles #performance

📉 Latency Percentiles

5 exercises — practice reading and explaining p50/p95/p99 percentile latency, tail latency, and the "99th percentile latency" pattern in monitoring and SLA contexts.

0 / 5 completed

Key percentile vocabulary for performance discussions

"p50 = median: half of requests are faster, half are slower."
"p99 = tail latency: 99% of requests are under this value; the slowest 1% take longer."
"At 1,000 RPS, p99 = 10 users per second experiencing worst-case latency."
"Fat tail: when p99.9 is 100x higher than p50, you have a bimodal distribution with a specific slow mode."
"Percentiles are not additive: p95 of A + p95 of B ≠ p95 of A+B (requires measurement)."

1 / 5

A monitoring alert fires: "p99 latency: 4,200ms." The p50 is 45ms. What does this tell you?

2 / 5

Why do teams monitor p99 (or p99.9) latency rather than just p50 (median)?

3 / 5

An SLA states: "p95 response time under 200ms." A new microservice call adds 80ms to the critical path. What's the impact?

4 / 5

A dashboard shows: "p50: 12ms, p95: 45ms, p99: 210ms, p99.9: 3,200ms." Which statement correctly characterises the system's latency profile?

5 / 5

How should you describe p50/p95/p99 latency metrics to a non-technical product manager?