Intermediate SRE / DevOps #latency #throughput #collocations

Latency & Throughput Collocations

5 exercises on the language engineers use to describe how fast and how much: latency, percentiles, throughput, round-trip time, and the hot path.

Key patterns

reduce / cut / shave latency; measure p95 / p99 tail latency
increase / boost / scale throughput, in requests per second
reduce round-trip time (RTT); collapse round trips
keep work off the hot path
latency = time per request; throughput = requests per time

0 / 5 completed

1 / 5

A platform engineer writes: "After adding the cache, we managed to ___ p99 latency from 800ms to 120ms."
Which verb is the standard collocation for bringing latency down?

2 / 5

A load-testing report states: "At peak load the service handled 12,000 ___ — well above our target."
Which phrase is the standard unit of throughput?

3 / 5

A network engineer explains a delay: "Most of the slowness is ___ — the packet has to cross the Atlantic and back for every call."
Which term names that out-and-back travel time?

4 / 5

During an optimisation review someone says: "This allocation runs on the ___, so even a tiny inefficiency is multiplied millions of times."
Which term describes the most frequently executed code route?

5 / 5

A capacity planner reports: "We need to ___ throughput to handle Black Friday — current limit is 8k RPS, we expect 20k."
Which verb best expresses raising the system's rate of work?