API Rate Limiting & Tiers Vocabulary

Rate limit, quota, burst limit, throttle, free tier vs. paid tier, rate limit headers (Retry-After, X-RateLimit-*), quota reset, and tier upgrade vocabulary.

Key vocabulary

Rate limit — the maximum number of API requests a client can make in a given time window (e.g., 100 requests per minute).
Quota — a longer-term usage ceiling, often monthly (e.g., 10,000 calls/month on the free tier).
Burst limit — allows a short spike of requests above the sustained rate limit before throttling kicks in.
Throttle — the act of slowing down or temporarily blocking requests that exceed the rate limit; the API returns HTTP 429 Too Many Requests.
X-RateLimit-Remaining — a response header telling the client how many requests it has left in the current window.

0 / 5 completed

1 / 5

A developer receives HTTP 429 Too Many Requests. What has happened?

2 / 5

What is the difference between a rate limit and a quota?

3 / 5

An API allows 60 requests/minute sustained but 200 requests in the first 10 seconds before throttling. The 200 request allowance is called a:

4 / 5

A response header X-RateLimit-Remaining: 23 tells a developer:

5 / 5

A SaaS API offers a free tier and a paid tier. A user has exhausted their free monthly quota. What options does the API typically offer?