AdvancedVocabulary#algorithms#backend#distributed-systems

LSM Tree Vocabulary

Build fluency in the vocabulary of buffering writes in memory and flushing them as immutable sorted files merged later.

0 / 5 completed

1 / 5

A teammate explains that a storage engine buffers writes in an in-memory sorted structure, periodically flushes it to disk as an immutable sorted file, and later merges those files in the background, so writes stay sequential and fast even though reads may need to check several files. What data structure is being described?

2 / 5

During a design review, the team chooses an LSM-tree-based storage engine for a write-heavy time-series database, specifically because writes only need to append to an in-memory buffer and a sequential flush file rather than performing random-access disk writes. Which capability does this provide?

3 / 5

In a code review, a dev notices a storage engine performs an in-place random-access disk write for every incoming data point in a write-heavy time-series workload, instead of buffering writes in memory and flushing them sequentially as an LSM tree would. What does this represent?

4 / 5

An incident report shows write throughput collapsed under load because every incoming data point triggered an in-place random-access disk write, and disk seek time dominated the write path once concurrent writers scaled up. What practice would prevent this?

5 / 5

During a PR review, a teammate asks why the team reaches for an LSM tree instead of a B-tree, given that a B-tree gives more predictable single-key read latency without needing to check multiple files. What is the reasoning?