Question 1

What is a request timeout and retry pattern?

Accepted Answer

It's a fault-tolerance strategy for unreliable networks and overloaded services. When a request doesn't complete within a deadline or returns a server error, the client waits an increasing amount of time before retrying — up to a maximum retry count. This prevents thundering-herd problems and gives transient failures time to recover.

Question 2

Why use exponential backoff instead of retrying immediately?

Accepted Answer

Immediate retries hammer an already-stressed service and make things worse. Exponential backoff (wait 100ms, 200ms, 400ms, 800ms…) gives the service time to drain its queue and recover. It also prevents your client from becoming a denial-of-service attack on itself.

Question 3

How do I adapt this for my API?

Accepted Answer

Set the timeout based on your SLA: 5s for most APIs, 10-30s for file uploads. Start backoff at 100-500ms and double it each retry. Set max retries to 3-5 (including the original attempt). Add jitter (random ±10%) to prevent multiple clients retrying simultaneously. Visual edits let you adjust the decision thresholds without rewriting code.

Question 4

Should I retry all errors or only some?

Accepted Answer

Retry only idempotent requests on transient errors: 408 (timeout), 429 (rate limit), 502/503/504 (server error). Never retry 4xx client errors (401, 403, 400) or non-idempotent POST requests unless you've confirmed idempotency. Add this logic to the 'Status code ok?' diamond with more granular branches.

Request timeout and retry pattern

When to use this template

How to adapt it

Mermaid code

Frequently asked questions

Related templates

API error handling flow

Error handling and recovery flow

Database reconnect retry pattern