System DesignAdvancedarticle

System Design: Designing a Video Conferencing System (Zoom / MS Teams)

How does Zoom handle 1,000 participants in a single call with low latency? A technical deep dive into WebRTC, SFU vs. MCU, and UDP vs. TCP.

Sachin Sarawgi•April 20, 2026•3 min read•3 minute lesson

#system-design #webrtc #video-conferencing #streaming #low-latency #scalability

On This PageOpen

1. Core Requirements
2. The Protocol: UDP vs. TCP
3. Communication Technology: WebRTC
4. Scaling the Meeting: SFU vs. MCU
Option A: Peer-to-Peer (Mesh)
Option B: MCU (Multipoint Control Unit)
Option C: SFU (Selective Forwarding Unit) - The Standard
5. Handling Network Jitter (Adaptive Bitrate)
6. Global Scalability
Summary

System Design: Designing a Video Conferencing System

Designing a real-time video conferencing system like Zoom or Microsoft Teams is fundamentally different from a video streaming service like YouTube. While YouTube prioritizes quality and high resolution, Zoom prioritizes Latency. A delay of more than 150ms makes a conversation impossible.

1. Core Requirements

Real-time Video/Audio: Bi-directional streaming with sub-200ms latency.
Large Meetings: Supporting hundreds or thousands of participants.
Screen Sharing: Sharing a high-resolution, low-framerate stream.
Resilience: Handling varying network conditions (packet loss, low bandwidth).

2. The Protocol: UDP vs. TCP

The Choice: UDP (User Datagram Protocol) is the mandatory choice for real-time media.
Why? TCP's error correction (re-sending lost packets) causes delay. In a call, it's better to lose a single frame of video (a minor glitch) than to pause the whole call to wait for that frame to arrive.

3. Communication Technology: WebRTC

WebRTC is the standard for real-time communication in the browser. It handles:

STUN/TURN Servers: For bypassing firewalls and finding the best path between peers.
Signaling: Using WebSockets to exchange metadata (like "I'm calling you") before the media starts flowing.

4. Scaling the Meeting: SFU vs. MCU

How do you deliver 100 video streams to 100 participants?

Option A: Peer-to-Peer (Mesh)

Every user sends their stream to every other user.

Limit: Only works for 2-3 people. A user's upload bandwidth will crash with more.

Option B: MCU (Multipoint Control Unit)

The server receives all streams, mixes them into one single video (like a collage), and sends that one stream to everyone.

Pros: Low bandwidth for the client.
Cons: Extremely CPU-intensive for the server.

Option C: SFU (Selective Forwarding Unit) - The Standard

The server receives all streams but doesn't mix them. It simply forwards the relevant streams to each participant.

The Optimization: If a participant is muted and their camera is off, the SFU stops forwarding their data. This is how Zoom scales to 1,000 people.

5. Handling Network Jitter (Adaptive Bitrate)

Simulcast: The client sends three versions of their video (High, Medium, Low quality) to the SFU. The SFU forwards the High-quality version to users with fast internet and the Low-quality version to users with slow mobile data.

6. Global Scalability

Video servers must be placed in data centers geographically close to participants to minimize the "Speed of Light" delay.

Geo-routing: If users in London are talking, the meeting should be hosted on a server in London, not New York.

Summary

The engineering of video conferencing is a masterclass in Low-latency Networking. By leveraging UDP, SFU architectures, and Simulcast for adaptive quality, you can build a platform that makes global communication feel as natural as a face-to-face meeting.

📚

Recommended Resources

Designing Data-Intensive ApplicationsBest Seller

The definitive guide to building scalable, reliable distributed systems by Martin Kleppmann.

View on Amazon →

Kafka: The Definitive GuideEditor's Pick

Real-time data and stream processing by Confluent engineers.

View on Amazon →

Apache Kafka Series on Udemy

Hands-on Kafka course covering producers, consumers, Kafka Streams, and Connect.

View Course →

Practical engineering notes

One useful note when a new deep dive is published: system design tradeoffs, Java production lessons, Kafka debugging, database patterns, and AI infrastructure.

Written by

Sachin Sarawgi

Engineering Manager and backend engineer with 10+ years building distributed systems across fintech, enterprise SaaS, and startups. CodeSprintPro is where I write practical guides on system design, Java, Kafka, databases, AI infrastructure, and production reliability.

LinkedIn GitHub Medium More articles

Share this lesson

Share on X Share on LinkedIn

Keep Learning

Move through the archive without losing the thread.

Tarjan's Algorithm in Java: Finding Strongly Connected Components

In a directed graph, a Strongly Connected Component (SCC) is a maximal subgraph where every node is reachable from every other node. Identifying these components is crucial for solving problems related to dependency anal…

DSA3 min readIntermediate

Project Case Study: Designing YouTube (Video Streaming at Global Scale)

Project Case Study: Designing YouTube YouTube is one of the world's largest distributed systems, managing exabytes of data and serving billions of concurrent users. The technical challenge isn't just "storing a video"—it…

System Design5 min readAdvanced

More deep dives chosen from shared tags, category overlap, and reading difficulty.

System DesignAdvanced

System Design: Designing a Distributed Message Queue (Kafka Architecture)

System Design: Designing a Distributed Message Queue A Distributed Message Queue is the backbone of modern asynchronous architecture. It allows services to communicate without being tightly coupled. While many use Apache…

Apr 20, 20263 min read

Deep Dive

#system-design#kafka#message-queue

System DesignBeginner

System Design: Designing a Real-time Bidding (RTB) Ad System

System Design: Designing a Real-time Bidding (RTB) Ad System Real-time Bidding (RTB) is the backbone of the modern digital advertising industry. When you load a webpage, an auction happens in the background to decide whi…

Apr 20, 20263 min read

Deep Dive

#system-design#rtb#ad-tech

System DesignAdvanced

Speculative Retries: The Google Approach to Solving Tail Latency

Speculative Retries: Solving the P99 Tail In a large distributed system, the "tail latency" (P99.9) is often dominated by a single "slow" node. This is the Tail at Scale problem. No matter how much you optimize your code…

Apr 20, 20262 min read

Deep DiveDistributed Systems Mastery

#system-design#low-latency#p99

System DesignAdvanced

System Design: Designing an Ad Click Aggregator

System Design: Designing an Ad Click Aggregator Ad click aggregation is a massive scale data problem. When billions of users click on ads across the web, those clicks must be aggregated, deduplicated, and stored for both…

Apr 20, 20263 min read

Deep Dive

#system-design#ad-aggregator#analytics

More in System Design

Category-based suggestions if you want to stay in the same domain.

System DesignIntermediate

System Design: Designing Stateless Authentication

System Design: Designing Stateless Authentication In a microservices architecture, you can't rely on server-side sessions (stored in memory/database) because every request might hit a different service instance. Stateles…

Apr 22, 20263 min read

Deep DiveBackend Systems Mastery

#system design#authentication#jwt

System DesignBeginner

gRPC vs REST: The Decision-Maker's Guide for Backend Architecture

gRPC vs REST: Which One for Your Microservices? In modern backend architecture, how services talk is as important as what they say. Choosing between REST and gRPC isn't just about syntax; it's about the trade-off between…

Apr 20, 20262 min read

ComparisonBackend Systems Mastery

#grpc#rest#api-design

System DesignBeginner

gRPC vs REST: A Decision-Maker's Guide for Backend Architecture

gRPC vs REST: Which One for Your Microservices? > Prerequisite: Before diving into protocols, ensure you understand the fundamentals of Load Balancing and API Idempotency. Choosing between REST and gRPC is one of the mos…

Apr 20, 20262 min read

ComparisonBackend Systems Mastery

#grpc#rest#api-design

← Back to all articles

System Design: Designing a Video Conferencing System (Zoom / MS Teams)

System Design: Designing a Video Conferencing System

1. Core Requirements

2. The Protocol: UDP vs. TCP

3. Communication Technology: WebRTC

4. Scaling the Meeting: SFU vs. MCU

Option A: Peer-to-Peer (Mesh)

Option B: MCU (Multipoint Control Unit)

Option C: SFU (Selective Forwarding Unit) - The Standard

5. Handling Network Jitter (Adaptive Bitrate)

6. Global Scalability

Summary

Recommended Resources

Get the next backend guide in your inbox

Sachin Sarawgi

Keep Learning

Tarjan's Algorithm in Java: Finding Strongly Connected Components

Project Case Study: Designing YouTube (Video Streaming at Global Scale)

Related Articles

System Design: Designing a Distributed Message Queue (Kafka Architecture)

System Design: Designing a Real-time Bidding (RTB) Ad System

Speculative Retries: The Google Approach to Solving Tail Latency

System Design: Designing an Ad Click Aggregator

More in System Design

System Design: Designing Stateless Authentication

gRPC vs REST: The Decision-Maker's Guide for Backend Architecture

gRPC vs REST: A Decision-Maker's Guide for Backend Architecture