Intermediate Numbers #metrics #performance #SRE

Performance Benchmarks

5 exercises on the language of performance — reading p99/p95 metrics, expressing uptime in "nines", describing improvements, and calculating error rates.

Key performance metric vocabulary
  • p50 / p95 / p99 — percentile latency (say: "p-ninety-nine")
  • three nines = 99.9% uptime = ~8h 46m downtime/year
  • RPS / TPS / QPS — throughput units (requests/transactions/queries per second)
  • error rate = failed requests ÷ total requests × 100%
  • Describe changes: "decreased/dropped from X to Y — an N% reduction"
0 / 5 completed
1 / 5
A monitoring dashboard shows: "p99 latency: 420ms".
What does this metric mean to a backend engineer?