Major conceptual refactoring based on Dex Horty's principle: "Subagents are not for anthropomorphizing roles, they are for controlling context" ### Added (1 new section) - Agent Anti-Patterns section (§9.17, line 3662) - Wrong vs Right table (anthropomorphizing vs context control) - When to use agents (context isolation, parallel processing, scope limitation) - When NOT to use agents (fake teams, roleplaying, mimicking org structure) ### Changed (18 files, 200+ lines) - Section rename: "Split-Role Sub-Agents" → "Scope-Focused Agents" - Agent definitions: "Specialized role" → "Context isolation tool" - 8 custom agent examples refactored (guide + examples/agents/) - 10+ prompt examples with explicit scope boundaries - 4 workflow files updated (agent-teams, TDD, iterative refinement) - Terminology replacements: * "Specialized agents" → "Scope-focused agents" * "Expert personas" → "Context boundaries" * "Multi-domain expertise" → "Multi-scope analysis" ### Fixed - Methodologies: Clarification note for BMAD role-based naming Breaking change: Conceptual shift from role-based to scope-based agent usage. All examples now demonstrate context isolation instead of persona simulation. Co-Authored-By: Claude Sonnet 4.5 <noreply@anthropic.com>
13 KiB
Iterative Refinement
Confidence: Tier 2 — Validated pattern observed across many Claude Code users.
Prompt, observe, reprompt until satisfied. The core loop of effective AI-assisted development.
Table of Contents
- TL;DR
- The Loop
- Feedback Patterns
- Autonomous Loops
- Integration with Claude Code
- Script Generation Workflow
- Iteration Strategies
- Anti-Patterns
- See Also
TL;DR
1. Initial prompt with clear goal
2. Claude produces output
3. Evaluate against criteria
4. Specific feedback: "Change X because Y"
5. Repeat until done
Key insight: Specific feedback > vague feedback
The Loop
Step 1: Initial Prompt
Start with clear intent and constraints:
Create a React component for a user profile card.
- Show avatar, name, bio
- Include edit button
- Use Tailwind CSS
- Mobile-responsive
Step 2: Evaluate Output
Claude produces code. Evaluate:
- Does it meet requirements?
- What's missing?
- What's wrong?
- What could be better?
Step 3: Specific Feedback
Provide targeted corrections:
Good start. Changes needed:
1. Avatar should be circular, not square
2. Edit button should only show for own profile (add isOwner prop)
3. Bio should truncate after 3 lines with "Show more"
Step 4: Repeat
Continue until satisfied:
Better. One more thing:
- Add loading skeleton state for when data is fetching
Feedback Patterns
Effective Feedback
| Pattern | Example |
|---|---|
| Specific location | "Line 23: change === to ==" |
| Clear action | "Add error boundary around the form" |
| Reason given | "Remove the console.log because it leaks user data" |
| Priority marked | "Critical: fix the SQL injection. Nice-to-have: add pagination." |
Ineffective Feedback
| Anti-Pattern | Why It Fails | Better Alternative |
|---|---|---|
| "Make it better" | No direction | "Improve readability by extracting the validation logic" |
| "This is wrong" | No specifics | "The date format should be ISO 8601, not Unix timestamp" |
| "I don't like it" | Subjective | "Use functional components instead of class components" |
| "Fix the bugs" | Too vague | "Fix: 1) null check on line 12, 2) off-by-one in loop" |
Autonomous Loops
Claude can self-iterate with clear completion criteria.
The Ralph Wiggum Pattern
Named after the self-improvement loop pattern:
Keep improving the code quality until:
1. All tests pass
2. No TypeScript errors
3. ESLint shows zero warnings
After each iteration, run the checks and fix any issues.
Stop when all criteria are met.
Completion Criteria Examples
Iterate until:
- Response time < 100ms for 95th percentile
- Test coverage > 80%
- All accessibility checks pass
- Bundle size < 200KB
Iteration Limits
Always set limits to prevent infinite loops:
Improve the algorithm performance.
Maximum 5 iterations.
Stop early if improvement < 5% between iterations.
Integration with Claude Code
With TodoWrite
Track refinement iterations:
TodoWrite:
- [x] Initial implementation
- [x] Fix: handle empty arrays
- [x] Fix: add input validation
- [ ] Optimization: memoize expensive calculations
With Hooks
Auto-validate after each change:
# .claude/hooks.yaml
post_edit:
- command: "npm run lint && npm test"
on_failure: "report"
Claude sees failures and can self-correct.
With /compact
When context grows during iterations:
/compact
Continue refining the search algorithm.
We've made good progress, focus on the remaining issues.
Checkpointing
After significant progress:
Good progress. Let's checkpoint:
- Commit what we have
- List remaining issues
- Continue with the next priority
Script Generation Workflow
Script and automation generation delivers the highest ROI for iterative refinement—70-90% time savings in practitioner reports. Scripts are self-contained, testable in isolation, and yield immediate value.
The 3-7 Iteration Pattern
Most production-ready scripts emerge after 3-7 iterations:
| Iteration | Focus | Prompt Pattern |
|---|---|---|
| 1 | Basic functionality | "Create a script that [goal]" |
| 2-3 | Constraints + edge cases | "Add [constraint]. Handle [edge case]." |
| 4-5 | Hardening | "Add error handling, logging, input validation" |
| 6-7 | Polish | "Optimize for [metric]. Add usage docs." |
Example: Kubernetes Pod Manager (PowerShell)
Iteration 1 — Basic
Create a PowerShell function to list pods in a Kubernetes namespace.
Iteration 2 — Add filtering
Add: filter by label selector and pod status.
Show: pod name, status, age, restarts.
Iteration 3 — Add actions
Add: ability to delete pods matching filter.
Require: confirmation before deletion.
Iteration 4 — Error handling
Handle: kubectl not found, invalid namespace, permission denied.
Add: verbose logging with -Verbose flag.
Iteration 5 — Production ready
Add: dry-run mode, output to JSON for piping, help documentation.
Ensure: works on Windows, Linux, macOS.
Common Pitfalls
| Pitfall | Example | Mitigation |
|---|---|---|
| Hallucinated commands | apt-get on macOS |
Specify OS: "Ubuntu 22.04 only" |
| Security gaps | No input validation | Always request: "validate all user inputs" |
| Over-engineering | Adds unnecessary libs | Request: "minimal dependencies, stdlib preferred" |
| Context drift | Forgets requirements after iteration 5 | Checkpoint prompt: "Recap current requirements before next change" |
| Platform assumptions | Assumes bash features in sh | Specify: "POSIX-compliant" or "bash 4+" |
Script Iteration Template
Current script: [paste or reference]
Iteration goal: [specific improvement]
Constraints:
- Must preserve: [existing behavior to keep]
- Must not: [things to avoid]
- Target environment: [OS, shell, runtime]
Success criteria: [how to verify this iteration works]
Iteration Strategies
Breadth-First
Fix all issues at same level before going deeper:
First pass: Fix all type errors
Second pass: Fix all lint warnings
Third pass: Improve test coverage
Fourth pass: Optimize performance
Depth-First
Complete one area fully before moving on:
1. Perfect the authentication flow (all aspects)
2. Then move to user management
3. Then move to settings
Priority-Based
Address by importance:
Iterate in this order:
1. Security issues (critical)
2. Data integrity bugs (high)
3. UX problems (medium)
4. Code style (low)
Anti-Patterns
Moving Target
# Wrong
"Actually, let's change the approach entirely..."
(Repeated 5 times)
# Right
Commit to an approach, iterate within it.
If approach is wrong, explicitly restart.
Perfectionism Loop
# Wrong
Keep improving forever
# Right
Set clear "good enough" criteria:
- Tests pass
- Handles main use cases
- No critical issues
→ Ship it, improve later
Lost Context
# Wrong
After 50 iterations, forget what the goal was
# Right
Periodically restate the goal:
"Reminder: we're building a rate limiter.
Current state: basic implementation works.
Next: add Redis backend."
Review Auto-Correction Loop
Specialized iterative pattern for code review where Claude reviews → fixes → re-reviews until convergence.
Pattern
┌─────────────────────────────────────────┐
│ Review Auto-Correction Loop │
│ │
│ Review (identify issues) │
│ ↓ │
│ Fix (apply corrections) │
│ ↓ │
│ Re-Review (verify fixes) │
│ ↓ │
│ Converge (minimal changes) → Done │
│ ↑ │
│ └──── Repeat (max iterations) │
└─────────────────────────────────────────┘
Prompt Template
Review this PR with auto-correction:
1. Multi-agent review (3 scope-focused agents)
2. Fix all 🔴 Must Fix issues
3. Re-review to verify fixes didn't introduce new issues
4. Fix all 🟡 Should Fix issues
5. Re-review one final time
6. Stop when only 🟢 Can Skip remain
Max iterations: 3
Stop early if iteration produces <5 lines changed
Safeguards
| Safeguard | Purpose | Implementation |
|---|---|---|
| Max iterations | Prevent infinite loops | Hard limit: 3 iterations |
| Quality gates | Ensure fixes are valid | Run tsc && lint before each iteration |
| Protected files | Prevent risky changes | Skip auto-fix for: package.json, migrations, .env |
| Change threshold | Stop when converged | Exit if iteration changes <5 lines |
| Rollback capability | Recover from bad fixes | Git commit before each iteration |
Example Session
Iteration 1: Initial Review
Claude: Found 8 issues:
- 🔴 3 Must Fix (SQL injection, empty catch, missing auth)
- 🟡 4 Should Fix (DRY violations, N+1 query)
- 🟢 1 Can Skip (naming style)
Iteration 2: Fix Must Fix + Re-Review
Claude: Fixed 3 Must Fix issues.
Re-review: All 🔴 resolved. No new issues introduced.
Remaining: 4 🟡 Should Fix, 1 🟢 Can Skip
Iteration 3: Fix Should Fix + Re-Review
Claude: Fixed 4 Should Fix issues.
Re-review: All 🟡 resolved. No new issues.
Remaining: 1 🟢 Can Skip (optional improvement)
Convergence
Claude: Converged. Only optional improvements remain.
Changes this iteration: 2 lines (below threshold).
Review complete. ✅
Comparison: One-Pass vs Convergence Loop
| Aspect | One-Pass Review | Convergence Loop |
|---|---|---|
| Detection | Find all issues once | Find issues → fix → verify → repeat |
| Follow-up awareness | Check git log for "Co-Authored-By: Claude" | Each iteration is aware of previous |
| False positives | Can suggest fixes for already-fixed code | Re-review catches this |
| Confidence | Single validation | Multiple validation passes |
| Time cost | Fastest (1 review) | Slower (3+ reviews) |
| Quality | Good for experienced devs | Better for critical code |
When to use:
- One-pass: Simple PRs, experienced team, time-sensitive
- Convergence loop: Security-critical code, junior team, high-stakes production
Integration with Multi-Agent Review
Combine convergence loop with multi-agent review for maximum quality:
Each iteration:
├─ Agent 1: Consistency Auditor
├─ Agent 2: SOLID Principles Analyst
└─ Agent 3: Defensive Code Auditor
↓
Fix issues
↓
Re-run 3 agents
↓
Verify fixes + check for new issues
↓
Repeat until convergence
Convergence Criteria
Stop iterating when ANY of these is true:
- No issues remaining (ideal outcome)
- Max iterations reached (3 iterations default)
- Change threshold (iteration changed <5 lines)
- Quality gate failure (tsc/lint fails after fix)
- Manual stop (user requests halt)
Anti-Patterns in Review Loops
| Anti-Pattern | Problem | Solution |
|---|---|---|
| Infinite loop | No convergence criteria | Set max iterations + change threshold |
| Scope creep | Each iteration adds new requirements | Lock scope before starting loop |
| Breaking fixes | Fix introduces new bugs | Re-review after each fix + quality gates |
| Protected file changes | Modifies package.json, migrations | Explicit skip list for protected files |
| Context loss | Forgets original issues after iteration 3 | Maintain issue tracker across iterations |
Example Session
Initial Request
Create a debounce function in TypeScript.
Iteration 1
Looks good. Add:
- Generic type support for any function signature
- Option to execute on leading edge
Iteration 2
Better. Issues:
- The return type should preserve the original function's return type
- Add cancellation support
Iteration 3
Almost there. Final polish:
- Add JSDoc comments
- Export the types separately
- Add unit tests
Completion
Perfect. Commit this as "feat: add debounce utility with full TypeScript support"
See Also
- exploration-workflow.md — Explore alternatives before iterating
- tdd-with-claude.md — TDD is iterative refinement with tests
- plan-driven.md — Plan before iterating
- ../methodologies.md — Iterative Loops methodology