Performance Metrics Vocabulary
Latency, throughput, p95/p99, baseline, degradation — core vocabulary for discussing performance metrics. Intermediate
0 / 5 completed
1 / 5
A monitoring dashboard shows: "The p99 latency for the checkout API is 2.3 seconds."
What does "p99 latency" mean?
Percentile latency is more meaningful than average because it captures the tail of the distribution — the slowest requests that real users actually experience. A fast average can hide a badly slow 1%.
| Percentile | Meaning |
|---|---|
| p50 | Median — 50% of requests are faster than this |
| p95 | 95% of requests complete faster than this — only 5% are slower |
| p99 | 99% of requests complete faster — captures "tail latency" affecting 1% of requests |
| p999 | 99.9th percentile — used for high-traffic services where 0.1% still = many users |
Key vocabulary: tail latency, percentile distribution, mean vs. percentile (mean hides outliers), latency SLO defined at p99.