Question 1

What is an incident state machine?

Accepted Answer

It's a diagram showing every distinct phase an incident passes through, from the moment an alert fires through resolution and the postmortem. Each state represents who is doing what, and the transitions show what event or decision moves you to the next phase. It makes incident management predictable and trainable.

Question 2

Why does this template separate Investigating from Mitigating?

Accepted Answer

Because they are fundamentally different: investigating means finding the root cause (which can take hours), while mitigating means reducing customer impact immediately (which should take minutes). Separating them clarifies whether your team is still hunting for the cause or already executing a fix.

Question 3

What if an incident gets downgraded during Escalated?

Accepted Answer

Add a transition from Escalated back to Investigating or Mitigating — not every incident turns into a P1. A state machine is a policy, and it should match your real decision tree. If you sometimes stand down, draw that path.

Question 4

How do I adapt this for my incident severity levels?

Accepted Answer

Expand the Investigating state into p1-investigating and p2-investigating, each with different escalation thresholds and personnel. Or add a Decision state after Detected that branches based on severity. The visual editor lets you add states and transitions without writing YAML, so you can build your exact process visually.

Incident state machine

When to use this template

How to adapt it

Mermaid code

Frequently asked questions

Related templates

Deployment rollback decision tree

Order fulfillment process

Support escalation path