IntermediateVocabulary#devops#architecture#developer-tools

Horizontal Pod Autoscaling Vocabulary

Build fluency in the vocabulary of automatically adjusting replica count based on an observed metric.

0 / 5 completed

1 / 5

At standup, a dev mentions a Kubernetes controller that automatically adjusts the number of running pod replicas based on an observed metric like CPU utilization. What is this controller called?

2 / 5

During a design review, the team wants the HPA configured with a specific target, like 70 percent average CPU utilization, that it tries to maintain by adding or removing replicas. Which capability supports this?

3 / 5

In a code review, a dev notices the HPA is configured with a stabilization window that prevents it from scaling down again too soon after a recent scale-up, avoiding rapid oscillation. What does this represent?

4 / 5

An incident report shows a service's replica count oscillated rapidly up and down for hours, adding churn and occasional latency spikes during each scale-up, because no stabilization window had been configured on its HPA. What practice would prevent this?

5 / 5

During a PR review, a teammate asks why the team relies on an HPA instead of just setting a fixed replica count sized for the service's expected peak traffic. What is the reasoning?