DatabasesAdvancedguide

B-Trees vs. LSM-Trees: The Battle of Storage Engine Internals

Why is MongoDB fast for reads but Cassandra better for writes? Deep dive into the trade-offs between B-Trees and Log-Structured Merge Trees.

Sachin Sarawgi•April 20, 2026•2 min read•2 minute lesson

#databases #storage-engines #btrees #lsm-trees #performance

On This PageOpen

1. B-Trees: The Read-Optimized Classic
2. LSM-Trees: The Write-Optimized Modernist
3. The Trade-off: Read vs. Write Amplification
Summary

Recommended Prerequisites

Database Indexing Deep Dive

B-Trees vs. LSM-Trees: Choosing Your Storage Engine

At the heart of every database is a storage engine that decides how data is laid out on disk. The two most dominant architectures are B-Trees and LSM-Trees. Understanding the difference is key to picking the right database for your workload.

1. B-Trees: The Read-Optimized Classic

Used by: PostgreSQL, MySQL, MongoDB, Oracle.

The Structure: A B-Tree is a self-balancing tree that stores data in fixed-size blocks (pages).
The Process: When you update a record, the B-Tree finds the specific page and overwrites it in place.
Pros: Fast, predictable read performance ((\log N)$). Great for range scans and applications where data is read more often than written.
Cons: Write amplification. Every small update requires reading and rewriting an entire page (usually 4KB or 8KB).

2. LSM-Trees: The Write-Optimized Modernist

Used by: Cassandra, RocksDB (used by Meta), LevelDB, DynamoDB.

The Structure: A Log-Structured Merge Tree doesn't update data in place. Instead, it turns all writes into sequential appends.
The Process:
1. Writes go to an in-memory Memtable.
2. When the Memtable is full, it's flushed to disk as an immutable SSTable.
3. In the background, a Compaction process merges these files.
Pros: Incredible write throughput. Writes are sequential, making them extremely fast on both HDD and SSD.
Cons: Read amplification. To find a key, the engine may have to check the Memtable and multiple SSTables. (Though Bloom Filters help mitigate this).

3. The Trade-off: Read vs. Write Amplification

Read Amplification: One logical read requires multiple physical disk reads. High in LSM-Trees.
Write Amplification: One logical write requires multiple physical disk writes. High in B-Trees (due to page fragmentation and WAL overhead).

Summary

Choose B-Trees (Postgres/Mongo) for standard CRUD apps, CMS systems, and relational data where query latency for reads is the primary concern.
Choose LSM-Trees (Cassandra/RocksDB) for logging, telemetry, IoT data, and high-frequency messaging systems where you need to ingest millions of events per second.

By matching your data access patterns to the underlying storage structure, you can avoid costly performance bottlenecks as your system scales.

Learning Path: Databases Track

Keep the momentum going

Step 37 of 54: Your next milestone in this track.

NEXT UP

SQL vs NoSQL: Which One for Your Next Production MVP?

2 min read • Advanced

📚

Recommended Resources

Designing Data-Intensive ApplicationsBest Seller

The definitive guide to building scalable, reliable distributed systems by Martin Kleppmann.

View on Amazon →

Kafka: The Definitive GuideEditor's Pick

Real-time data and stream processing by Confluent engineers.

View on Amazon →

Apache Kafka Series on Udemy

Hands-on Kafka course covering producers, consumers, Kafka Streams, and Connect.

View Course →

Practical engineering notes

One useful note when a new deep dive is published: system design tradeoffs, Java production lessons, Kafka debugging, database patterns, and AI infrastructure.

Written by

Sachin Sarawgi

Engineering Manager and backend engineer with 10+ years building distributed systems across fintech, enterprise SaaS, and startups. CodeSprintPro is where I write practical guides on system design, Java, Kafka, databases, AI infrastructure, and production reliability.

LinkedIn GitHub Medium More articles

Share this lesson

Share on X Share on LinkedIn

Keep Learning

Move through the archive without losing the thread.

Windowing in Stream Processing: Tumbling, Sliding, and Session Windows

Windowing in Stream Processing: Timing is Everything In stream processing (Kafka Streams, Flink, Spark Streaming), you rarely want to aggregate data from the beginning of time. Instead, you want to perform calculations o…

Data Engineering2 min readAdvanced

Stateless Auth: Managing JWT Blacklisting at Scale

Stateless Auth: JWT Revocation JWTs are often marketed as "stateless authentication", but the first real security requirement (logout, account disable, token theft response) immediately introduces state. If you do not de…

System Design4 min readAdvanced

More deep dives chosen from shared tags, category overlap, and reading difficulty.

DatabasesAdvanced

LSM-Tree Compaction Strategies: Leveled vs. Size-Tiered

LSM-Tree Compaction Strategies LSM-tree based databases (Cassandra, RocksDB, ScyllaDB) don't update data in place. They write immutable SSTables. Over time, these files must be merged to reclaim space and improve reads.…

Apr 20, 20262 min read

Deep DiveBackend Systems Mastery

#databases#lsm-trees#cassandra

DatabasesAdvanced

Inside the Linux Page Cache: The Invisible Database Accelerator

Inside the Linux Page Cache When your database (PostgreSQL, MongoDB, etc.) reads a row from disk, it doesn't just read the bytes and forget them. The Linux kernel intercepts the request and caches the data in a region of…

Apr 20, 20262 min read

Deep DiveBackend Systems Mastery

#linux#kernel#databases

DatabasesAdvanced

Redis Internals: Event Loop, Data Structures, and Persistence

Redis Internals: The Speed of Single-Threading Redis is often categorized as just a "cache," but its internal architecture makes it a powerful data structure server. Let's dive into the core technologies that make Redis…

Apr 20, 20262 min read

Deep Dive

#redis#databases#event-loop

DatabasesIntermediate

MongoDB Anti-Patterns: From Unbounded Arrays to Shard Imbalance

MongoDB Anti-Patterns: Building Scalable Document Stores MongoDB's flexibility is its greatest strength, but it's also a trap for those coming from relational backgrounds. Here are the most critical "gotchas" and anti-pa…

Apr 20, 20263 min read

Deep Dive

#mongodb#databases#performance

More in Databases

Category-based suggestions if you want to stay in the same domain.

DatabasesBeginner

Bloom Filters: The Speed Secret of Modern NoSQL Databases

Bloom Filters: Avoiding the Disk Bottleneck In high-performance databases like Cassandra, RocksDB, and BigTable, the biggest performance killer is unnecessary disk I/O. When you query for a key that doesn't exist, the da…

Apr 20, 20262 min read

Deep Dive

#databases#bloom-filters#nosql

DatabasesAdvanced

Cassandra Gotchas: Dealing with Tombstones and Wide Partitions

Cassandra Gotchas: Managing Distributed Scale Cassandra is built for extreme availability, but its "append-only" storage model (LSM-trees) introduces specific behaviors that can catch developers off guard. Here are the m…

Apr 20, 20263 min read

Deep Dive

#cassandra#databases#performance

DatabasesAdvanced

Cassandra Internals: LSM-Trees, Gossip, and Eventual Consistency

Cassandra Internals: Built for Scale Apache Cassandra is a peer-to-peer distributed database designed to handle massive amounts of data across many commodity servers. Its "Masterless" architecture and high write throughp…

Apr 20, 20262 min read

Deep Dive

#cassandra#databases#lsm-trees

← Back to all articles

B-Trees vs. LSM-Trees: The Battle of Storage Engine Internals

B-Trees vs. LSM-Trees: Choosing Your Storage Engine

1. B-Trees: The Read-Optimized Classic

2. LSM-Trees: The Write-Optimized Modernist

3. The Trade-off: Read vs. Write Amplification

Summary

Keep the momentum going

Recommended Resources

Get the next backend guide in your inbox

Sachin Sarawgi

Keep Learning

Windowing in Stream Processing: Tumbling, Sliding, and Session Windows

Stateless Auth: Managing JWT Blacklisting at Scale

Related Articles

LSM-Tree Compaction Strategies: Leveled vs. Size-Tiered

Inside the Linux Page Cache: The Invisible Database Accelerator

Redis Internals: Event Loop, Data Structures, and Persistence

MongoDB Anti-Patterns: From Unbounded Arrays to Shard Imbalance

More in Databases

Bloom Filters: The Speed Secret of Modern NoSQL Databases

Cassandra Gotchas: Dealing with Tombstones and Wide Partitions

Cassandra Internals: LSM-Trees, Gossip, and Eventual Consistency