IntermediateVocabulary#devops#backend#developer-tools

Tail Latency Percentiles Vocabulary

Build fluency in the vocabulary of measuring how slow the worst requests actually are, not just the average.

0 / 5 completed

1 / 5

At standup, a dev mentions reporting the response time that ninety-nine percent of requests fall under, rather than just the average response time across all requests. What metric is being described?

2 / 5

During a design review, the team wants a user-facing request that calls several downstream services in parallel to be understood as being only as fast as its single slowest downstream call. Which capability supports this?

3 / 5

In a code review, a dev notices an alerting rule is configured against the p99 latency rather than the average latency, specifically to catch a growing tail-latency problem before it drags the average up too. What does this represent?

4 / 5

An incident report shows a growing number of users experienced noticeably slow page loads for days before anyone noticed, because the team's dashboard and alerting were built entirely around average latency, which barely moved even as the tail worsened significantly. What practice would prevent this?

5 / 5

During a PR review, a teammate asks why the team reports and alerts on a tail-latency percentile instead of just relying on the average response time across all requests. What is the reasoning?