- Add examples/scripts/smart-suggest-roi.py: stdlib-only analyzer correlating suggestion log with session JSONL files to measure command acceptance rate. 4 acceptance signals, tier breakdown, daily trend, --json/--since/--no-sessions CLI. - Tune Aristote smart-suggest hook: tighten 5 over-firing triggers (/tech:commit, /tech:sonarqube, /tech:dupes, /check-conventions a11y, /tech:worktree) - Guide: identity re-injection hook, context engineering maturity grid, code review workflow, 1M context window GA update, Spring Break promo, security audit patterns - Resource evaluations: Nick Tune hooks (3/5), VicKayro security audit (2/5), Karl Mazier CLAUDE.md templates, Paul Rayner ContextFlow, Siddhant agent trace, Andrew Yng context hub, JP Caparas 1M context window Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>
6.4 KiB
Resource Evaluation: Paul Rayner — "Will AI Kill Refactoring?" (LinkedIn)
Date: 2026-03-16 Evaluator: Claude (automated via /eval-resource) Source type: LinkedIn post (text provided) Author: Paul Rayner, CEO & Principal Consultant @ Virtual Genius; author of The EventStorming Handbook; founder/chair of Explore DDD Published: ~March 2, 2026 (2 weeks before eval date) Repository: https://github.com/virtualgenius/contextflow Score: 3/5
Summary
Paul Rayner built ContextFlow (a DDD context mapping tool) entirely with Claude Code and analyzed 519 commits to answer whether AI makes refactoring obsolete. Key findings:
- Full commit breakdown: 30% feat, 22% fix, 23% docs, 14% tidy (refactoring), 5.4% config, 2.3% test, 2.3% other
- Code-only commits: 44% feat, 32% fix, 21% refactoring, 3% test — meaning 1 in 5 code commits is pure structural work
- Main argument: AI doesn't eliminate refactoring, it lowers its cost enough to do it more often, in smaller batches, before problems compound
- New mechanism: large incoherent files degrade context window quality — refactoring keeps AI productive
- The design skill AI can't replace: knowing when structure no longer fits the problem and what better structure looks like
- Includes a usable git prompt for analyzing any conventional commits repo by commit type distribution
Comparatif
| Aspect | This resource | Guide coverage |
|---|---|---|
| Refactoring patterns | Frequency rationale (new angle) | Section at ~line 16990 (incremental, boundary patterns) |
| Context window degradation via code structure | ✅ Original insight | ❌ Not explicitly linked to refactoring |
| Real-world Claude Code case study | ✅ Practitioner + data | 4 others (Mergify, Airbnb, Boris Cherny, Fountain) |
| Commit analysis prompt | ✅ Reusable tool | ❌ Not present |
| Conventional commits conventions | Referenced | ✅ Covered at lines 8600, 15564 |
| DDD methodology | Context for the project | Mentioned as semantic anchor at lines 3875, 3908, 16849 |
Score: 3/5
Justification: Two distinct artifacts of real value — a context window insight worth adding to the refactoring section, and a git analysis prompt worth adding to git best practices. The case study narrative itself is weaker: n=1, self-reported, no external corroboration, LinkedIn-published. The guide already holds case studies to a higher evidence standard (Mergify has a sourced blog post; Airbnb data is corroborated by academic research). Presenting the commit percentages (44/32/21) without a baseline for non-AI projects also limits what conclusions can be drawn — you can't distinguish "AI accelerates refactoring discipline" from "Rayner is personally disciplined about refactoring."
Integration Recommendations
Split the two artifacts. Treat them independently.
1. Context window degradation insight → Refactoring section (~line 17025)
Add one paragraph as an additional rationale within the incremental/boundary patterns explanation. The link between code cohesion and context quality is a distinct mechanism not currently in the guide. Attribute as a practitioner observation, note it's a single project.
Example framing:
"Refactoring also protects your context window. Large, incoherent files that accumulate
without structural cleanup force Claude to process more irrelevant content per request.
Keeping modules small and well-scoped is not just a quality practice — it's a practical
token efficiency strategy."
2. Git commit analysis prompt → Git best practices (~line 15564)
Add alongside existing commit conventions as a companion diagnostic tool. This is immediately actionable for any team using conventional commits and has standalone value regardless of the case study narrative.
Example placement: after the commit format section, as a "Analyze your commit distribution" sidebar.
3. Case study bullet → Skip
The data quality doesn't support adding it alongside Mergify and Airbnb. If Rayner publishes a proper blog post with methodology, revisit.
Priority: Low-Medium. The git prompt is the quickest win (15 minutes to add). The context window paragraph requires more care to integrate without duplicating existing content.
Challenge (technical-writer agent)
The agent pushed back on score (3/5 confirmed, not 4/5) for two reasons:
- Data provenance: n=1, self-reported on LinkedIn, no external validation. Bumping to 4 would imply evidence quality it hasn't earned.
- Integration plan was misaligned: original plan proposed adding to case studies section. Agent correctly redirected both artifacts to their natural homes (refactoring section + git best practices), not a case study bullet.
Additional issues flagged:
- No baseline comparison (are 21% refactoring commits high or low vs. non-AI projects?) — weakens the thesis
- Git prompt underweighted in original plan — it's the highest-value artifact, needs explicit placement
- Risk of not integrating: Low to medium — context window link is worth capturing, git prompt adds direct reader value, but nothing is irreplaceable given existing guide depth
Fact-Check
| Claim | Status | Notes |
|---|---|---|
| Paul Rayner is CEO @ Virtual Genius, EventStorming Handbook author | ✅ Verified | Consistent with LinkedIn bio in the post |
| ContextFlow built entirely with Claude Code | ⚠️ Unverifiable | Author's stated claim, no commit metadata to confirm |
| "519 commits" in the repo | ⚠️ Minor discrepancy | GitHub shows 552 commits at eval time (post written ~2 weeks earlier) — timing explains the gap |
| Commit breakdown percentages (30/22/23/14/5.4/2.3/2.3) | ✅ Internally consistent | Screenshot shows Claude's analysis output; numbers sum to ~99.3% (rounding). Verifiable by running the git prompt on the repo |
| Code-only breakdown (44/32/21/3) | ✅ Internally consistent | Matches the full-breakdown numbers when non-code commits excluded |
| ContextFlow is a DDD context mapping tool | ✅ Verified | GitHub confirms: TypeScript/React, 140 stars, MIT, maps bounded contexts/value streams/Wardley |
No hallucinations detected. Minor discrepancy on commit count explained by post timing.
Decision
- Score: 3/5
- Action: Integrate partially — git prompt (high priority) + context window paragraph (medium priority). Skip case study bullet.
- Confidence: High on scope/placement; medium on data (n=1 limitation acknowledged)