How to Explain a Message Queue Backlog in English

Learn the English vocabulary and phrases needed to explain a growing message queue backlog to your team, including why consumers can't keep up and what's being done about it.

A message queue backlog is a slow-moving incident — nothing crashes outright, but a growing gap between how fast messages arrive and how fast they’re processed quietly turns into delayed notifications, stale data, or missed deadlines. Explaining it clearly in English, especially the distinction between a temporary blip and a structural capacity problem, helps a team decide how urgently to respond.

Key Vocabulary

Backlog (queue depth) — the number of messages waiting to be processed in a queue, which grows whenever the rate of incoming messages exceeds the rate consumers can process them. “Queue depth has been climbing steadily for the last two hours — we’re not draining faster than messages are arriving.”

Consumer lag — the delay between when a message is produced and when it’s actually processed by a consumer, a key indicator of whether a backlog is affecting real-time behavior. “Consumer lag is now sitting at twenty minutes, which means anything relying on near-real-time processing is noticeably behind.”

Throughput — the rate at which a consumer can process messages, typically measured in messages per second, and the number that ultimately determines whether a backlog grows or shrinks. “Our consumer’s throughput dropped by half after the last deploy, which is exactly when the backlog started climbing.”

Dead letter queue — a separate queue where messages that repeatedly fail processing are routed, so they don’t block the main queue or get silently dropped. “About five percent of these messages are landing in the dead letter queue because of a malformed payload, and we need to look at those separately from the backlog itself.”

Horizontal scaling (of consumers) — adding more consumer instances to process messages in parallel, one of the primary ways to increase throughput and drain a backlog faster. “We’re horizontally scaling the consumer group from four to twelve instances temporarily, which should let us drain this backlog within the hour.”

Explaining the Root Cause

  • “Queue depth has been growing since our last deploy, and it lines up with a drop in consumer throughput, not an increase in incoming messages.”
  • “This isn’t a queue failure — the queue is doing exactly what it’s supposed to, buffering messages faster than we’re currently able to process them.”
  • “About five percent of the backlog is actually malformed messages repeatedly failing and should be routed to the dead letter queue instead of retried indefinitely.”

Communicating What Needs to Change

  • “I’m scaling the consumer group horizontally right now to increase throughput and start draining the backlog faster than it’s growing.”
  • “We need to fix whatever regressed in the last deploy that dropped consumer throughput, or scaling out will just be a temporary patch.”
  • “Let’s route the repeatedly failing messages to the dead letter queue so they stop consuming retry attempts that could go toward healthy messages.”

Verifying the Fix Together

  • “Can we watch queue depth over the next thirty minutes and confirm it’s actually shrinking, not just growing more slowly?”
  • “Let’s check consumer lag specifically, since queue depth alone doesn’t tell us whether real-time processing has recovered.”
  • “Once this is drained, let’s review whether we need permanent additional consumer capacity or if this was a one-off spike.”

Professional Tips

  1. Distinguish queue depth from consumer lag. They measure related but different things, and reporting the wrong one can make an incident look either more or less urgent than it actually is for anything depending on real-time processing.
  2. State whether the cause is more messages or slower processing. These require completely different fixes — the first needs more capacity or upstream throttling, the second needs a code or infrastructure fix — and conflating them wastes time.
  3. Mention the dead letter queue explicitly when relevant. Separating genuinely stuck, malformed messages from the healthy backlog prevents a team from assuming the entire backlog will drain once throughput improves.

Practice Exercise

  1. Write two sentences explaining to a teammate the difference between queue depth and consumer lag.
  2. Describe, in one sentence, why horizontally scaling consumers might not fix a backlog if the root cause is a deploy that reduced per-consumer throughput.
  3. Draft a short message explaining that a portion of a backlog is landing in the dead letter queue and won’t drain along with the rest.