How to Discuss Load Testing Results in English
Learn the English vocabulary and phrases for presenting load and performance testing results clearly to engineering and non-technical teams.
Presenting load testing results well is as important as running the test itself — a poorly explained report can hide a real risk or trigger unnecessary panic. Engineers need to summarize throughput, latency, and failure points in a way that both technical peers and less technical stakeholders can act on. This post covers the vocabulary and phrasing you need to present load test findings clearly, in status updates, readiness reviews, or postmortems.
Key Vocabulary
Throughput — the number of requests or transactions a system can successfully process per unit of time under load. “We sustained a throughput of 2,400 requests per second before latency began to degrade.”
Latency percentile (p95/p99) — a statistical measure showing the response time below which a given percentage of requests fall, used to describe tail performance rather than averages. “Average latency looked fine at 80ms, but the p99 spiked to 1.2 seconds under peak load.”
Breaking point — the load level at which the system’s performance degrades sharply or fails, identifying the practical ceiling of current capacity. “We found the breaking point at around 5,000 concurrent users, where error rates jumped from 0.1% to 12%.”
Ramp-up period — the phase of a load test during which simulated traffic gradually increases to the target level, rather than starting at full load immediately. “We used a 10-minute ramp-up period to avoid triggering false alarms from cold caches.”
Bottleneck — the specific component or resource that limits overall system performance, identified as the primary cause of degradation under load. “Database connection pooling turned out to be the bottleneck, not the application servers themselves.”
Soak test — a load test run over an extended period at a sustained, realistic load to detect issues like memory leaks that only appear over time. “The soak test ran for six hours and revealed a slow memory leak in the caching layer.”
Error budget — the acceptable margin of failed or degraded requests during a load test, used as a pass/fail threshold against the system’s reliability target. “We stayed within our error budget of 0.5% throughout the test, so we’re calling this a pass.”
Common Phrases
- “The system held steady up to 3,000 concurrent users, with p95 latency under 200ms.”
- “We identified the database as the primary bottleneck during peak load.”
- “Error rates stayed within acceptable thresholds until we hit roughly 80% of target capacity.”
- “We recommend scaling the connection pool before the next load test.”
- “These results indicate we’re ready for the projected launch traffic with some margin.”
- “This is a blocking finding — we should not go live until the bottleneck is resolved.”
Example Sentences
Announcing load test results to the team: “Load testing for the checkout service is complete. We sustained the target of 1,500 requests per second with p95 latency at 180ms and zero errors. We’re confident this meets the capacity requirements for Friday’s launch.”
Escalating a concerning finding: “During today’s load test, we saw error rates climb to 9% once concurrent users passed 4,000, well below our projected peak of 6,000. Root cause appears to be connection pool exhaustion on the payments service. I’d like to treat this as a blocker for launch until we’ve resolved it and re-tested.”
Following up after remediation: “We re-ran the load test after increasing the connection pool size and adding a circuit breaker. Error rates now stay under 0.2% up to 7,000 concurrent users, comfortably above our projected peak. Attaching the full report for reference.”
Professional Tips
- Report both the average and the p95/p99 latency — averages alone can hide serious tail-latency problems that affect real users.
- State the breaking point explicitly in relation to expected production traffic, so stakeholders can judge the safety margin at a glance.
- Use “blocker” deliberately when a finding should stop a release — reserving the word for genuine blockers keeps it meaningful.
- When presenting to non-technical stakeholders, translate technical metrics into business impact, e.g. “the system can handle roughly 3x our expected Black Friday traffic.”
Practice Exercise
- Write a two-sentence summary of a load test that passed comfortably, including one specific metric.
- Draft an escalation message describing a load test that revealed a bottleneck, and recommend it as a blocker for release.
- Explain, in two sentences, why p95 latency is more useful than average latency when discussing load test results.