Question 1

What is database sharding and why do teams use it?

Accepted Answer

Sharding splits a large dataset across multiple database instances so no single database becomes a bottleneck. Each shard owns a slice of data (e.g. users 0–33M on Shard 0, 33M–66M on Shard 1). It allows a team to scale to billions of rows without hitting single-server limits.

Question 2

How do you decide which shard a request goes to?

Accepted Answer

The standard approach is hash-based: compute hash(user_id) mod shard_count. This distributes data evenly and is deterministic — the same user always maps to the same shard. Other strategies include range-based (user_id 0–1M on Shard 0) or geographic sharding (US on Shard 0, EU on Shard 1).

Question 3

What happens when you add or remove a shard?

Accepted Answer

Adding a shard changes the modulo arithmetic, so most keys map to new shards. This requires expensive data migration. Teams handle this with consistent hashing (reduces remapped keys to 1/N) or accept a planned migration window. Plan shard count generously to minimize reshuffling.

Question 4

What are the tradeoffs of sharding?

Accepted Answer

Pros: horizontal scalability, each shard is small and fast. Cons: complex application logic (routing), no cross-shard transactions, operational overhead (backups per shard). Start with a single database, cache to reduce queries, then shard only when single-database replication and read replicas are exhausted.

Database sharding strategy

When to use this template

How to adapt it

Mermaid code

Frequently asked questions

Related templates

Saga pattern: distributed transaction

API request lifecycle

Cache invalidation sequence