Advanced Interview Prep #kafka #flink #streaming #real-time

Streaming Data Engineer Interview Questions

5 exercises — practice structured English answers for streaming data engineering interviews covering Kafka internals, Flink processing, delivery semantics, late data, and pipeline testing.

How to structure streaming data interview answers
  • Delivery semantics: at-most-once → at-least-once → exactly-once → trade-offs and idempotent consumers
  • Late data: event time vs. processing time → watermarks → allowed lateness → side outputs
  • Kafka internals: partitions, consumer groups, offsets, replication, ISR, retention
  • Consumer lag: root causes (slow consumers, batch spikes) → monitoring → remediation
  • Stream testing: unit (transformation logic) → integration (Kafka test containers) → end-to-end
0 / 5 completed
1 / 5
The interviewer asks: "Explain the difference between at-least-once, at-most-once, and exactly-once delivery in a streaming pipeline."
Which answer is most precise?