Frame · Oversight
The confirm button is not oversight
Most human oversight of AI comes down to a button. A button that is only ever pressed measures nothing. If the person approves every time and changes nothing, you have not added oversight. You have added a stamp.
The machine proposes, a person clicks approve, the decision ships. On an org chart that looks like control. So ask one question about the last thousand times that person clicked approve: how many times did they click something else? If the honest answer is almost never, the button was never oversight. It was a formality with a keyboard shortcut.
Why a button that always says yes carries no information
A control tells you something only when it can come back two ways. A smoke alarm that cannot stay silent is not detecting smoke, it is just on. An approval that is never withheld is the same. The variation is the signal, and when the variation is gone the click stops measuring anything, no matter how many people perform it or how senior they are.
Does clicking approve count as oversight?
Not on its own. Oversight is the live capacity to change an outcome and have the change stand. A click counts only if refusing was a real, available, survivable option in that moment, and that needs three things at once: the time to actually consider the case, the context to judge it, and the standing to say no without it costing the person who says it. Strip any one of the three and approve becomes the only move the system left open. Then the click is consent theater.
Why this gets worse, not better, as the tooling improves
Faster pipelines mean less time per item. Stronger models mean the answer is usually right, so refusing starts to feel like friction. Cleaner dashboards show the polished output and hide the messy inputs a human would need in order to disagree. Every improvement that makes the click smoother also makes it emptier. The interface gets better and the oversight gets thinner, at the same time, from the same work.
What turns a button back into oversight
The fix is not removing the button. It is making the no real. Give the reviewer enough time and enough of the inputs to form an actual judgment. Make refusal cheap and safe rather than a career risk. And measure the thing that matters: not how many approvals you collected, but how often a human changed an outcome and the change held. If that number sits near zero, your oversight is a stamp, and at least now you can see it and decide whether you are comfortable with that.
Read on
This is one face of a larger defect: that most human oversight of AI is mostly theater. The measure that separates the real thing from the stamp is the Meaningful Override Rate. If you want to see what I build instead, start here.