Question 1

What does a load balancer do?

Accepted Answer

A load balancer sits between clients and your backend servers, distributing incoming requests across instances to prevent any single server from being overwhelmed. It performs health checks to route traffic only to healthy servers, applies routing algorithms like round-robin or least-connections, and maintains session affinity so users stay on the same server when needed.

Question 2

What is session affinity or sticky sessions?

Accepted Answer

Session affinity ensures that once a user hits a particular server, all their subsequent requests go to that same server. This is critical when server memory holds user session state (shopping carts, login status). Without sticky sessions, a second request might land on a different server that doesn't have that user's session data, forcing them to log in again.

Question 3

How is a load balancer different from a service mesh?

Accepted Answer

A load balancer handles traffic distribution at the network edge — between clients and servers. A service mesh (like Istio) is deployed inside your cluster and manages communication between your microservices. Service meshes offer finer-grained control (per-RPC routing, canary deployments) but add overhead; load balancers are simpler and more efficient for client-facing traffic.

Question 4

Why do we need health checks?

Accepted Answer

Without health checks, the load balancer might send requests to a server that has crashed or become unresponsive, causing user errors. Health checks — periodic pings or HTTP requests to a /health endpoint — let the load balancer detect failures in seconds and route traffic away from broken instances automatically.

Load balancer request routing

When to use this template

How to adapt it

Mermaid code

Frequently asked questions

Related templates

Network topology diagram

Auto-scaling decision tree

Database migration flow