Question 1

What is a zero-downtime database migration?

Accepted Answer

A zero-downtime migration moves data from an old database schema to a new one without shutting down your service. The trick is to run both schemas in parallel: your code writes to both (dual-write), reads from the old one, and verifies that new data matches old data (shadow-read). Once they align, you switch reads to the new schema and clean up the old one. If anything goes wrong mid-migration, you roll back instantly.

Question 2

Why can't we just dump the old database and load the new one?

Accepted Answer

Because your service is running and users are actively writing data. If you pause writes, you lose revenue; if you dump during writes, you corrupt the new database. A migration that does not coordinate with live writes will always lose data or cause downtime. Dual-write ensures data consistency even as writes arrive during the backfill.

Question 3

What does 'shadow-read' mean and why is this diagram critical?

Accepted Answer

Shadow-read means reading from both the old and new database, comparing the results, and alerting if they differ. This is your safety net: if the backfill missed rows or the schema transformation lost data, shadow-read catches it before you commit. Running shadow-read for 30 minutes before cutover catches most bugs that would otherwise turn into customer-visible data loss.

Question 4

What happens if the cutover fails and I need to rollback?

Accepted Answer

Rollback is instant: flip your code back to reading from the old schema, keep dual-writing to both, and investigate why the new schema differs. Once you fix the issue (usually data mismatch or a schema bug), resume shadow-read verification, then retry cutover. This safety net is why zero-downtime migrations take time — but they never take the system down.

Zero-downtime database migration

When to use this template

How to adapt it

Mermaid code

Frequently asked questions

Related templates

Database migration flow

CI/CD pipeline

Data pipeline (ETL)