Blog post
Planning Is Where AI Work Becomes Reviewable
Why planning is less about agent autonomy and more about making AI-assisted work inspectable before it becomes expensive.
Planning Is Where AI Work Becomes Reviewable
The risk with AI coding is not only that the agent might be wrong.
The deeper risk is that it can be wrong quickly, across many files, before I have reviewed the direction.
That changes the role of planning.
Planning is not just a way to make the agent more autonomous. For me, it is a way to make the work reviewable before the implementation becomes expensive.
The problem with immediate execution
A direct prompt often produces direct action.
"Implement this feature" becomes new files, changed functions, updated components, maybe tests, maybe a migration.
Sometimes that is useful.
Sometimes it is too much too soon.
The agent may choose an architecture before I have agreed with it. It may place logic in the wrong layer. It may assume a data model. It may add client state where the server should own the truth. It may treat a permission check as a UI concern instead of a backend guarantee.
By the time I review the diff, the real decision has already been made.
Planning moves that decision earlier.
A plan exposes assumptions
A useful plan is not a long ceremony.
It should make the hidden assumptions visible.
What files will change?
Which layer owns the behavior?
What data model is being assumed?
What trust boundary is involved?
What can be done safely now?
What needs human judgment before implementation?
A plan gives me something to inspect before the agent starts producing code.
That is the main value.
Reviewability beats autonomy
It is tempting to measure agent quality by how far it can go without interruption.
I care more about whether I can intervene at the right moment.
For trivial work, the agent can proceed.
For architectural work, I want a checkpoint.
For tenant scope, authorization, data migrations, or cross-domain changes, I want the decision surfaced before execution.
For product behavior, I want to see the trade-off, not just the implementation.
The goal is not to slow everything down. The goal is to slow down the moments where being wrong is expensive.
What a useful planning checkpoint contains
A good checkpoint does not need to be verbose. It needs to be inspectable.
For example:
- intended outcome;
- files or modules likely to change;
- assumptions;
- risks;
- proposed sequence;
- verification steps;
- questions requiring human judgment.
That is enough to catch many failures early.
If the agent proposes putting business rules in a route, I can stop it.
If it assumes tenant scope comes from the client, I can stop it.
If it wants to add a broad abstraction for a small feature, I can stop it.
If it misunderstands the product intent, I can stop it before the diff becomes large.
Multi-agent work makes this stricter, not looser
Planning matters even more when the work is delegated.
As soon as multiple agents are involved, ambiguity stops being local. A vague task does not just produce one vague implementation. It produces several partial interpretations moving in parallel.
That is why I do not think of planning as pre-work for orchestration. I think of it as the condition that makes orchestration safe enough to use at all.
If the work boundaries are unclear, multi-agent execution does not create leverage. It creates faster opacity.
Planning also creates memory
A good plan is not only useful before execution.
It is useful after execution because it explains why the work took a particular path.
The final code shows what changed.
The plan shows what was intended.
When those diverge, review becomes easier.
This matters especially across sessions. If another agent or a future version of me returns to the work, the plan becomes a small piece of durable context.
Without that, the codebase remembers the diff but forgets the reasoning.
The principle
Planning is where AI work becomes reviewable.
The purpose is not to make the agent autonomous.
The purpose is to make the work inspectable before it becomes expensive.
Good planning does not eliminate human judgment. It moves human judgment to the point where it still has leverage.
Continue exploring
Follow the same line of thought through themes, tags, or a broader local search across the archive.
Keep following the thread.
The Problem Is Not Prompting. It Is Context Decay.
Why AI-assisted development breaks down when useful reasoning disappears between sessions, and why durable project context matters more than clever prompts.
Agent Tools Need Product Design Too
Why tools for AI agents should be designed as working surfaces, not just exposed as function calls.
Local Knowledge Beats Always-On Tools in AI-Assisted Development
Why the biggest gain in AI-assisted development comes from durable local knowledge and disciplined workflow structure, not from keeping every external tool always active.