Advanced Interview #databases #storage-engines #postgresql #interview-prep

Database Internals Engineer Interview Questions

5 exercises — choose the best-structured answer to common Database Internals Engineer interview questions. Focus on storage engine trade-offs, MVCC, WAL recovery, buffer management, and query optimisation.

Structure for database internals interview answers

Name the trade-off dimensions: read/write/space amplification, latency vs. throughput
Explain failure recovery: what guarantees hold after a crash and how they are achieved
Give concrete parameters: WAF numbers, buffer sizes, XID limits, tuning knobs
Cover the failure modes: bloat, wraparound, thrashing, over-indexing

0 / 5 completed

1 / 5

The interviewer asks: "Compare LSM-trees and B-trees for a write-heavy workload — explain write amplification, read amplification, and space amplification for each."
Which answer best covers the trade-off analysis?

2 / 5

The interviewer asks: "Explain how MVCC (Multi-Version Concurrency Control) works in PostgreSQL — how are versions stored, how are they cleaned up, and what can go wrong?"
Which answer demonstrates the deepest understanding?

3 / 5

The interviewer asks: "Explain the WAL (Write-Ahead Log) recovery process in a crash scenario — what are the ARIES phases and how does PostgreSQL implement them?"
Which answer best covers crash recovery?

4 / 5

The interviewer asks: "How does a database buffer pool work, and what are the key eviction policy trade-offs between LRU and clock-sweep?"
Which answer best covers buffer pool internals?

5 / 5

The interviewer asks: "Design an index advisor for a relational database — what signals would you collect, how would you recommend indexes, and how do you avoid over-indexing?"
Which answer demonstrates the most complete design?