System DesignAdvancedarticle

System Design: Solving the Top K Problem (Heavy Hitters)

How does YouTube track trending videos or Twitter find trending hashtags in real-time? Learn about the Top K problem, Count-Min Sketch, and heavy hitters at scale.

Sachin Sarawgi•April 20, 2026•3 min read•3 minute lesson

#system-design #top-k #heavy-hitters #stream-processing #count-min-sketch #scalability

On This PageOpen

1. Core Requirements
2. Naive Approach: Hash Map
3. The Scalable Solution: Count-Min Sketch
4. Distributed Architecture
5. Time-Series Windowing
6. Storage
Summary

System Design: Solving the Top K Problem

The "Top K" problem (or Heavy Hitters) is about finding the $ most frequent items in a massive stream of data. For example:

YouTube: The top 10 trending videos in the last hour.
Twitter: The top 50 trending hashtags globally.
E-commerce: The top selling products across all categories.

1. Core Requirements

Real-time: Results must be updated almost instantly.
High Volume: Handling millions of events per second.
Accuracy vs. Efficiency: At scale, 100% accuracy is often too expensive; we need efficient probabilistic solutions.

2. Naive Approach: Hash Map

Maintain a Hash Map of item_id -> count.

Problem: If you have billions of items, the hash map won't fit in RAM. If you store it on disk, it's too slow for real-time updates.

3. The Scalable Solution: Count-Min Sketch

The Count-Min Sketch is a probabilistic data structure that estimates the frequency of items in a stream using a constant amount of memory.

How it works:
1. An array of W columns and D rows is created, all initialized to 0.
2. D different hash functions are chosen.
3. When an item arrives, it is hashed by each function to find its position in each row, and that counter is incremented.
Query: To find the frequency of an item, you hash it again and take the minimum value from its positions in the D rows.
Benefit: It uses fixed memory and provides an "upper bound" estimate with a very small error margin.

4. Distributed Architecture

To handle millions of events per second:

Ingestion: Events land in Apache Kafka.
Aggregation: Apache Flink or Spark Streaming workers process partitions of the stream.
Local Top K: Each worker maintains its own local Count-Min Sketch and Top K list.
Global Top K: A central aggregator merges the local lists from all workers to produce the final global Top K.

5. Time-Series Windowing

Trending topics change over time. We use Sliding Windows (e.g., 60-minute window sliding every 5 minutes).

Optimization: Use a Lossy Counting algorithm or a time-decayed counter where older events contribute less to the total score.

6. Storage

Real-time: The current Top K list is stored in Redis for instant access by the front-end API.
Historical: Full logs are stored in a data lake like Amazon S3 for long-term auditing and precise offline analysis.

Summary

The Top K problem is a classic example of the trade-off between Accuracy and Scale. By using probabilistic data structures like Count-Min Sketch and a distributed stream processing engine, you can track global trends in real-time without overwhelming your infrastructure.

📚

Recommended Resources

Designing Data-Intensive ApplicationsBest Seller

The definitive guide to building scalable, reliable distributed systems by Martin Kleppmann.

View on Amazon →

Kafka: The Definitive GuideEditor's Pick

Real-time data and stream processing by Confluent engineers.

View on Amazon →

Apache Kafka Series on Udemy

Hands-on Kafka course covering producers, consumers, Kafka Streams, and Connect.

View Course →

Practical engineering notes

One useful note when a new deep dive is published: system design tradeoffs, Java production lessons, Kafka debugging, database patterns, and AI infrastructure.

Written by

Sachin Sarawgi

Engineering Manager and backend engineer with 10+ years building distributed systems across fintech, enterprise SaaS, and startups. CodeSprintPro is where I write practical guides on system design, Java, Kafka, databases, AI infrastructure, and production reliability.

LinkedIn GitHub Medium More articles

Share this lesson

Share on X Share on LinkedIn

Keep Learning

Move through the archive without losing the thread.

System Design: Designing Twitter (Timeline and News Feed)

System Design: Designing Twitter (Timeline and News Feed) Twitter (now X) is a massive real-time messaging system. The core technical challenge is not storing the tweets, but delivering them to millions of followers' tim…

System Design3 min readAdvanced

System Design: Designing a URL Shortener (TinyURL)

System Design Masterclass: Designing a URL Shortener (TinyURL) Designing a URL shortener like TinyURL or Bitly is the most ubiquitous System Design interview question in the world. While it sounds trivial on the surface…

System Design6 min readIntermediate

More deep dives chosen from shared tags, category overlap, and reading difficulty.

System DesignAdvanced

System Design: Designing a Real-Time Analytics Dashboard

System Design: Designing a Real-Time Analytics Dashboard Real-time analytics dashboards (used for tracking game players, ad clicks, or server metrics) require capturing and visualizing massive data streams. The challenge…

Apr 20, 20262 min read

Deep Dive

#system-design#analytics#real-time

System DesignAdvanced

System Design: Designing an Ad Click Aggregator

System Design: Designing an Ad Click Aggregator Ad click aggregation is a massive scale data problem. When billions of users click on ads across the web, those clicks must be aggregated, deduplicated, and stored for both…

Apr 20, 20263 min read

Deep Dive

#system-design#ad-aggregator#analytics

System DesignAdvanced

System Design: Designing Airbnb (Hotel/Home Booking)

System Design: Designing Airbnb (Hotel/Home Booking) Designing a platform like Airbnb or Booking.com involves two distinct technical challenges: Search (helping users find the perfect place) and Concurrency (ensuring tha…

Apr 20, 20263 min read

Deep Dive

#system-design#airbnb#booking-system

System DesignAdvanced

System Design: Data Partitioning and Sharding Strategies

System Design: Data Partitioning and Sharding When your database outgrows a single machine, you have two choices: Scale Up (bigger machine) or Scale Out (multiple machines). Sharding (Horizontal Partitioning) is the art…

Apr 20, 20262 min read

Deep Dive

#system-design#sharding#partitioning

More in System Design

Category-based suggestions if you want to stay in the same domain.

System DesignIntermediate

System Design: Designing Stateless Authentication

System Design: Designing Stateless Authentication In a microservices architecture, you can't rely on server-side sessions (stored in memory/database) because every request might hit a different service instance. Stateles…

Apr 22, 20263 min read

Deep DiveBackend Systems Mastery

#system design#authentication#jwt

System DesignBeginner

gRPC vs REST: The Decision-Maker's Guide for Backend Architecture

gRPC vs REST: Which One for Your Microservices? In modern backend architecture, how services talk is as important as what they say. Choosing between REST and gRPC isn't just about syntax; it's about the trade-off between…

Apr 20, 20262 min read

ComparisonBackend Systems Mastery

#grpc#rest#api-design

System DesignBeginner

gRPC vs REST: A Decision-Maker's Guide for Backend Architecture

gRPC vs REST: Which One for Your Microservices? > Prerequisite: Before diving into protocols, ensure you understand the fundamentals of Load Balancing and API Idempotency. Choosing between REST and gRPC is one of the mos…

Apr 20, 20262 min read

ComparisonBackend Systems Mastery

#grpc#rest#api-design

← Back to all articles

System Design: Solving the Top K Problem (Heavy Hitters)

System Design: Solving the Top K Problem

1. Core Requirements

2. Naive Approach: Hash Map

3. The Scalable Solution: Count-Min Sketch

4. Distributed Architecture

5. Time-Series Windowing

6. Storage

Summary

Recommended Resources

Get the next backend guide in your inbox

Sachin Sarawgi

Keep Learning

System Design: Designing Twitter (Timeline and News Feed)

System Design: Designing a URL Shortener (TinyURL)

Related Articles

System Design: Designing a Real-Time Analytics Dashboard

System Design: Designing an Ad Click Aggregator

System Design: Designing Airbnb (Hotel/Home Booking)

System Design: Data Partitioning and Sharding Strategies

More in System Design

System Design: Designing Stateless Authentication

gRPC vs REST: The Decision-Maker's Guide for Backend Architecture

gRPC vs REST: A Decision-Maker's Guide for Backend Architecture