Approval Stakes vs Reversibility Matrix

The Idea

When agents take unsolicited actions — doing things on their own rather than responding to explicit requests — you need to decide how much autonomy to grant. The framing is a 2x2: stakes (low/high) by reversibility (easy/hard).

StakesReversibilityPatternExample
LowEasyAuto-applyOrganizing files
LowHardQuick confirmPublishing to feed
HighEasySuggest + applyCode changes
HighHardExplicit approvalSending emails

The matrix gives you a default for each cell. Tune from there, but start with these.

The Crucial Caveat

This applies to unsolicited agent actions only. If the user explicitly asks the agent to do something (“send that email”), that’s already approval — the agent just does it. Wrapping every explicit request in a confirmation dialog is friction without value; it trains users to dismiss approvals without reading them, which is worse than no approval at all.

The article flags this framework as one that emerged during building but hasn’t been battle-tested. Treat it as a starting heuristic, not a settled standard.