Commit graph

149 commits

Author SHA1 Message Date
Florian BRUNIAUX
0bdb34b2a4 docs: external orchestration frameworks, pr-triage skill, GitHub Actions templates
Added:
- guide/third-party-tools.md: External Orchestration Frameworks section
  (Ruflo + Athena Flow) with architectural distinction from multi-instance tools
- examples/skills/pr-triage/: 3-phase PR backlog management skill
  (audit, deep review via parallel agents, validated comment posting)
- examples/github-actions/: claude-code-review.yml + .coderabbit.yaml +
  prompts/code-review.md — AI-powered PR review GitHub Actions workflow
- docs/resource-evaluations/073-athena-flow-workflow-runtime.md (2/5 Watch)
- docs/resource-evaluations/074-ruflo-multi-agent-orchestration.md (3/5 Pertinent)

Updated:
- examples/README.md + examples/github-actions/README.md: new templates indexed
- machine-readable/reference.yaml: new entries for github-actions + pr-triage

Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>
2026-03-05 09:47:16 +01:00
Florian BRUNIAUX
37d9d70ea2 docs: tech leads section, straude, session-naming, cowork updates
- guide/learning-with-ai.md: §12 For Tech Leads & Engineering Managers
  (onboarding 4 semaines, métriques croissance réelle, mentoring scalable,
  warning signs équipe, template politique AI)
- guide/third-party-tools.md: fiche straude (dashboard social CC, analyse sécu)
- examples/claude-md/session-naming.md: template auto-rename sessions
- guide/cowork.md: mise à jour contenu et comparaisons
- docs/resource-evaluations/: +2 évaluations (eveillard, straude)
- README.md + examples/README.md: compteurs templates 175→176
- machine-readable/reference.yaml: nouvelles entrées

Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>
2026-03-04 17:29:06 +01:00
Florian BRUNIAUX
29e8b62679 docs: update Claude Code releases to v2.1.66 (v3.30.1)
- Update latest tracked version: 2.1.63 → 2.1.66
- Add v2.1.66: reduced spurious error logging
- Add v2.1.63: HTTP hooks, worktree config sharing, /simplify & /batch, memory leaks wave
- Add v2.1.62/61: prompt cache + Windows config fixes
- Bump guide: 3.30.0 → 3.30.1
- Update 2.1.x series date range (January-March 2026)

Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>
2026-03-04 09:30:52 +01:00
Florian BRUNIAUX
d9187ba17b release: v3.30.0 - 10 advanced patterns documentation
5 new files (plan-challenger, adr-writer, audit-codebase, first-principles, event-driven-agents),
4 workflow files enriched (iterative-refinement, agent-teams, ultimate-guide x3 sections),
reference.yaml updated with 9 new entries. Fact-checked via 9 Perplexity searches (March 2026).

Patterns covered: modular CLAUDE.md architecture, session invariants, auto-ADR, adversarial
plan review, worktree dependency coordination, auto-fix loops (Ralph Loop), Linear/Kanban
event-driven agents, codebase audit scoring, deployment automation (Vercel + Infisical).

Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>
2026-03-03 06:27:28 +01:00
Florian BRUNIAUX
2195622b50 release: v3.29.2 - /simplify + /batch commands documentation
Added dedicated sections for the two bundled slash commands
introduced in Claude Code v2.1.63: /simplify (over-engineering
detection) and /batch (batch processing). Built-in Commands
table updated in §1.3 and §6.1.

Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>
2026-03-02 17:30:58 +01:00
Florian BRUNIAUX
252148fe75 release: v3.29.1 - Git MCP + GitHub MCP catalog entries
Add Git MCP Server (12 tools, uvx setup) and GitHub MCP Server
(Issues/PRs/Projects, remote Copilot + self-hosted PAT-only) to §8.2
MCP Server Catalog. Document real-world fix for Incompatible auth
server error via gh auth token + manual header injection.

Also ships: CC v2.1.63 tracking, HTTP hooks, observability quality
patterns, config lifecycle §9.23, terminal personalization, tool
comparison table extensions, MCP server 3 new tools.

Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>
2026-03-02 16:10:19 +01:00
Florian BRUNIAUX
314f872f1f docs: update reference.yaml + CHANGELOG for mcp v1.0.3
- reference.yaml: add 4 missing tools to mcp_server.tools list
  (compare_versions, search_examples, get_threat, list_threats)
  + correct bundle_size 123KB→132KB
- CHANGELOG [Unreleased] Fixed: document content path bug fix (1.0.3)

Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>
2026-02-28 21:32:43 +01:00
Florian BRUNIAUX
e62af76767 docs: spinner verbs + tips personalization — new example + guide section
- New section §3.3 Terminal Personalization Settings (ultimate-guide.md:4978)
  spinnerVerbs (mode replace/add) + spinnerTipsOverride (excludeDefault)
- New examples/config/settings-personalization.json — 19 verbs, 113+ tips, 13 categories
- reference.yaml: new spinner_personalization entry + line number fix
- CHANGELOG updated

Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>
2026-02-28 10:00:58 +01:00
Florian BRUNIAUX
8e63d84b47 docs: factual audit + reference sync — 260 findings corrected
Parallel 6-agent audit against official Anthropic docs (llms-full.txt).
Key corrections applied across permissions, hooks, MCP, security, privacy, reference.yaml.

Highlights:
- Fix MCP config path (~/.claude.json), mcpServers key, variable substitution syntax
- Fix permission modes (5 not 3), :* syntax (×6), Stop event description
- Fix hook JSON field names (hook_event_name, tool_name, tool_input, session_id)
- Fix filesystem restriction docs (permission rules, not settings.json keys)
- Fix data-privacy: 4-tier retention, /bug 5yr warning, ZDR conditions, 5 telemetry opt-out vars
- Add official llms.txt/llms-full.txt references to CLAUDE.md + machine-readable/llms.txt
- Reference.yaml: 375 entries re-synced (92% had wrong line numbers — guide grew 15K→21K lines)
- New script: scripts/resync-reference-yaml.py for automated line number sync
- Quiz: corrected answers for hooks (07), memory settings (03), MCP servers (08)

Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>
2026-02-26 12:10:14 +01:00
Florian BRUNIAUX
5e893f3ccd docs: add Remote Control (§9.22) — mobile access feature documentation
- New section 9.22 in ultimate-guide.md with full Remote Control coverage
- Cheatsheet: dedicated section + Features Méconnues + slash commands
- security-hardening.md: Part 7 Remote Control Security threat model
- machine-readable/reference.yaml: /rc, /remote-control, /mobile, subcommand
- Resource evaluation: 2026-02-25 (score 4/5, community feedback included)

Key original content vs official Anthropic docs:
- Slash commands (/new, /compact) broken in remote UI — undocumented
- tmux multi-session workaround for 1-session limit
- Community security analysis (RCE surface, CISO implications)

Research Preview — Pro/Max plans only (v2.1.51+)

Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>
2026-02-25 18:45:41 +01:00
Florian BRUNIAUX
97f9167a61 docs: add Kairn memory MCP + resource evaluations + guide updates
- guide/ultimate-guide.md §10.2: Add Kairn (knowledge graph memory with biological decay)
  - Typed relationships (depends-on, resolves, causes), 18 MCP tools
  - Updated comparison table: Serena / grepai / doobidoo / Kairn
  - Added decision routing for long-term memory + causality tracking
- guide/ultimate-guide.md §5.1: Add real-world CLAUDE.md migration example (Avo, 600-line → 15 path-scoped files)
- guide/ai-ecosystem.md: Minor update
- machine-readable/reference.yaml: Add Kairn entries
- examples/config/mcp.json: Add Kairn MCP config
- docs/resource-evaluations/: Add 2 new evaluations (context-evaluator-packmind, kairn-memory-mcp)
- docs/resource-evaluations/agents-md-empirical-study: Add community reception section
- docs/resource-evaluations/2026-02-23-agentsview: Minor fix

Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>
2026-02-25 17:39:20 +01:00
Florian BRUNIAUX
79157ac077 release: v3.29.0
- Observability: MLflow Tracing section complète (CLI + SDK, LLM-as-judge)
- Skills §5.5: UI UX Pro Max (33.7k stars, design reasoning engine BM25)
- Diagram #41: AI Fluency High vs Low paths (Anthropic research)
- 3 callouts empiriques AI Fluency Index (Rev Engine 5.6×, CLAUDE.md 30%)
- Hooks docs v2.1.47-2.1.50: WorktreeCreate/Remove, ConfigChange, --from-pr
- CC releases tracking: v2.1.50 → v2.1.52
- 4 nouvelles évaluations ressources (AgentsView, AI Fluency, UX Pro Max)

Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>
2026-02-24 13:58:09 +01:00
Florian BRUNIAUX
b6a1e63f94 fix: rename duplicate key security_gate_hook → security_gate_hook_line in reference.yaml 2026-02-22 19:24:53 +01:00
Florian BRUNIAUX
dbb62306d7 release: v3.28.1 - Visual Diagrams Series (40 Mermaid diagrams)
guide/diagrams/: new directory with 40 interactive Mermaid diagrams
- 10 thematic files: foundations, context/sessions, configuration,
  architecture, MCP ecosystem, dev workflows, multi-agent patterns,
  security/production, cost/optimization, adoption/learning
- Each diagram: Mermaid (GitHub-native) + ASCII fallback + source link
- Bold Guy palette (6-color system) consistent across all diagrams
- README with index, visual palette legend, navigation by use case

Also includes (backlog from v3.28.0→v3.28.1):
- guide/ultimate-guide.md: Managing Large MCP Server Sets, AI Code
  Disclosure Policy, claude-mem Gemini alternative, observability
- guide/workflows/plan-driven.md: Boris Tane custom markdown plans (+172L)
- guide/security-hardening.md: Part 4 PR security review workflow
- examples/agents/security-patcher.md: new security agent
- examples/hooks/bash/security-gate.sh: PreToolUse security hook
- guide/observability.md: activity monitoring, external tools, proxying
- docs/resource-evaluations/: 4 new evaluations (Boris Cherny, Moigneu,
  Boris Tane, Aristote AI instructions)
- README.md: Visual Diagrams section in "What Makes This Guide Unique"

Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>
2026-02-22 15:51:21 +01:00
Florian BRUNIAUX
9218ab37d6 feat: security scanning workflow (auditor + patcher + gate hook)
- security-hardening.md Part 4: PR security review workflow
  3-agent pipeline: scan → data flow trace → patch
  Tableau par type de changement (auth, DB, upload, deps)
  Hook pre-push git pour alerter sur fichiers sensibles
- security-patcher agent: applique les findings du security-auditor
  Propose avant d'écrire, jamais en autonomie (human approval gate)
  Séparation nette detect vs patch
- security-gate.sh hook: PreToolUse, 7 patterns vulnérables bloqués
  SQLi, XSS innerHTML, secrets hardcodés, eval() dynamique,
  hash faible (MD5/SHA1 password), command injection, path traversal
  Complément de dangerous-actions-blocker.sh (ops système)
- Claude Code Security (research preview) documentée dans security-hardening.md
  Comparaison Security Auditor Agent vs feature Anthropic
- reference.yaml: 4 nouvelles entrées indexées

Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>
2026-02-22 15:21:35 +01:00
Florian BRUNIAUX
6049bd99c2 release: v3.28.0 - Section 2.5 Model Selection & Thinking Guide
Section canonique consolidée pour le choix de modèle :
- Section 2.5 (decision table, effort levels avec prompts, model-per-agent patterns, thinking guide)
- 3 nouveaux agents : planner (Opus), implementer (Haiku), architecture-reviewer (Opus)
- 7 nouvelles questions quiz (09-037→09-043, intermediate→power)
- 3 tables redondantes remplacées par cross-refs vers Section 2.5

Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>
2026-02-21 18:25:50 +01:00
Florian BRUNIAUX
31802b61b1 release: v3.27.9
- Update Claude Code releases tracking (v2.1.49 → v2.1.50)
- Cut versioned release for Talk Preparation Pipeline, AGENTS.md research,
  Profile-Based Module Assembly (Section 3.5), ATDD expansion,
  Spec-First Task Granularity, team-config templates, and Mergify case study

Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>
2026-02-21 17:27:32 +01:00
Florian BRUNIAUX
00cb973bdb docs: add Talk Preparation Pipeline workflow + skill templates
6-stage pipeline: raw material → conference talk → Kimi AI slides.

New files:
- guide/workflows/talk-pipeline.md — Full workflow guide (REX vs Concept
  modes, stage-by-stage breakdown, CHECKPOINT mechanics, Kimi handoff,
  real-world DevWithAI example, 5 design patterns documented)
- examples/skills/talk-pipeline/ — 7 SKILL.md files + orchestrator + 2
  templates (feedback-draft.md, kimi-prompt-template.md)

Updated:
- README.md — badges 164→172 templates, Feb 20 date, 13→14 skills
- guide/workflows/README.md — Talk Pipeline entry under Design & Content
- machine-readable/reference.yaml — 16 new entries for pipeline components
- CHANGELOG.md — [Unreleased] entry

Design patterns showcased: skill chaining + file-based state, tool
permission scoping (Bash only Stage 2), human-in-the-loop CHECKPOINT
(Stage 4), AI-to-AI handoff (Claude → Kimi), dual execution modes
(REX/Concept).

Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>
2026-02-20 15:51:29 +01:00
Florian BRUNIAUX
6d847d24de docs: add Profile-Based Module Assembly pattern (Section 3.5)
- Section 3.5 "Team Configuration at Scale" in ultimate-guide.md:
  profiles YAML + shared modules + skeleton + assembler script;
  59% context token reduction measured on 5-dev production team;
  includes CI drift detection, 5-step replication guide, trade-offs
- New workflow: guide/workflows/team-ai-instructions.md (6 phases,
  scaling thresholds, troubleshooting table)
- New templates: examples/team-config/ (profile-template.yaml,
  claude-skeleton.md, sync-script.ts)
- reference.yaml: 9 new entries for team_ai_instructions_*
- README: templates count 161 → 164, date Feb 19 → Feb 20
- CHANGELOG [Unreleased]: resource evaluations (AGENTS.md ETH Zürich
  4/5, Sylvain Chabaud 3/5), spec-first Task Granularity section,
  methodologies ATDD expansion

Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>
2026-02-20 15:04:29 +01:00
Florian BRUNIAUX
46b5f39f52 docs: update examples README and reference.yaml for claude-code-review.yml
- examples/README.md: github-actions count 3→4, add new workflow row
- machine-readable/reference.yaml: 6 new entries (workflow, prompt, pattern, tools, auth, cost)

Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>
2026-02-19 14:58:27 +01:00
Florian BRUNIAUX
9b75b5125e release: v3.27.8 - prompt-based GitHub Actions code review workflow
New examples/github-actions/claude-code-review.yml with externalized prompt,
anti-hallucination protocol, /claude-review on-demand trigger. Templates 116→161.

Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>
2026-02-19 14:23:15 +01:00
Florian BRUNIAUX
895ace49f7 docs: add Borg et al. 2025 RCT on AI code maintainability (v3.27.7)
- Resource eval: arXiv:2507.00788 "Echoes of AI" (151 devs, 95% pros,
  2-phase blind RCT) — 30.7% faster median, ~55.9% habitual users,
  no significant downstream maintainability impact
- guide/learning-with-ai.md: citation + "On maintainability fear" note
- guide/ultimate-guide.md: nuance blockquote in §1.7 Trust Calibration
- machine-readable/reference.yaml: 4 new RCT/maintainability entries
- docs/resource-evaluations/: evaluation file with technical-writer audit

Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>
2026-02-19 09:59:50 +01:00
Florian BRUNIAUX
267ce0ba86 chore: update guide content and add new examples
- Update ultimate guide and cheatsheet
- Update llms.txt and reference YAML
- Add velocity-governor hook and cc-sessions script

Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>
2026-02-18 18:48:19 +01:00
Florian BRUNIAUX
c3da456d3a release: v3.27.6 - Sonnet 4.6 default + 200K vs 1M context guide
- Pricing table: Sonnet 4.6 now default (Feb 2026)
- New section: 200K vs 1M context decision guide (MRCR bench, cost table, use cases)
- threat-db.yaml v2.1.0: CVE-2026-23744, Slopsquatting T009, OWASP Agentic AI Top 10

Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>
2026-02-18 09:33:55 +01:00
Florian BRUNIAUX
4b15bdb137 release: v3.27.5 - Claude Code v2.1.45 tracking
- Bump guide: v3.27.4 → v3.27.5
- Update Claude Code releases: v2.1.44 → v2.1.45
  - Sonnet 4.6 model support
  - spinnerTipsOverride setting
  - SDK SDKRateLimitInfo/SDKRateLimitEvent types
  - Agent Teams fix on Bedrock/Vertex/Foundry

Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>
2026-02-17 21:12:53 +01:00
Florian BRUNIAUX
1f720f7d74 release: v3.27.4 - template count update + worktree docs
- Templates 113 → 116 (review-plan, git-worktree-status/remove/clean, rules)
- Commands 23 → 26 (worktree lifecycle suite)
- Git worktree section expanded with lifecycle commands table
- Version sync across all docs

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
2026-02-17 11:15:32 +01:00
Florian BRUNIAUX
0d6a0c656e docs: add git-worktree suite, security kill switch, update reference.yaml + CC releases
- Git worktree commands: overhauled main + 3 new (status, remove, clean)
- Security hardening: AI Kill Switch & Containment Architecture (§3.5)
- DevOps SRE: cross-reference to security-hardening for AI incidents
- CC releases: v2.1.43-v2.1.44 tracking
- reference.yaml: 12 new entries, evaluations count 67 → 74

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
2026-02-17 10:20:57 +01:00
Florian BRUNIAUX
4cf1bf3cec docs: v3.27.3 — track Claude Code v2.1.42 + Google Antigravity section
- Claude Code releases: v2.1.41 → v2.1.42 (startup perf, prompt cache, Opus 4.6 effort callout)
- New AI ecosystem section: Google Antigravity agent-first IDE comparison
- Version sync across all docs (3.27.2 → 3.27.3)

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
2026-02-16 09:38:31 +01:00
Florian BRUNIAUX
0f4b1837c5 docs: v3.27.2 — YAML frontmatter on 90 markdown files
Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
2026-02-15 19:24:07 +01:00
Florian BRUNIAUX
d1182af4cf docs: v3.27.1 — fact-check corrections, grepai docs, RTK overhaul
Fact-check (README positioning):
- Template count: 120/123 → 108 (ground truth recount)
- Ratio: 14× → 24× (19,000 ÷ 784 = 24.2×)
- everything-cc stars: 31.9k → 45k+ (verified Feb 15)
- Commands count: 20 → 23, hooks: 30 → 31

Added:
- Grepai MCP documentation (semantic search, call graphs)
- 3 hook templates (rtk-baseline, session-summary, session-summary-config)
- 2 resource evaluations (system-prompts update, qmd token savings)

Changed:
- RTK documentation overhaul (v0.7.0 → v0.16.0, rtk-ai org)
- Exports deprecated (kimi.pdf, notebooklm.pdf → deprecated/)

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
2026-02-15 18:41:45 +01:00
Florian BRUNIAUX
d72905e9ba docs: integrate Entire CLI across guide (v3.27.0)
Major integration of Entire CLI, an agent-native platform launched
Feb 2026 by Thomas Dohmke (ex-GitHub CEO) with $60M funding. Provides
rewindable checkpoints, approval gates, and audit trails for AI sessions.

## Added (7 guide files + 3 meta files)

- **ai-traceability.md**: Replace git-ai 404 with Entire CLI (section 5.1)
- **third-party-tools.md**: Fill "Session replay" gap + add tool section
- **observability.md**: Add session portability alternative
- **ai-ecosystem.md**: Add governance-first orchestration (section 8.1.5)
- **ultimate-guide.md**: Enrich multi-instance section 9.17
- **security-hardening.md**: Add compliance audit trails (section 3.4)
- **cheatsheet.md**: Add Community Tools quick reference
- **README.md**: Update structure tree with third-party-tools mention
- **CHANGELOG.md**: Document v3.27.0 release
- **docs/resource-evaluations/entire-cli.md**: Formal evaluation (5/5)

## Fixed

- git-ai references (404 repo) replaced with working alternative
- "Session replay" Known Gap now marked as  FILLED

## Key Features Documented

- Rewindable checkpoints (prompts + reasoning + tool usage)
- Governance layer (approval gates, permissions, audit trails)
- Multi-agent handoffs (Claude → Gemini with context)
- Compliance-ready (SOC2, HIPAA, FedRAMP)
- Session portability (path-agnostic vs native --resume)

## Positioning

- vs git-ai: Replaces non-existent tool (404)
- vs claude-code-viewer: Active replay vs read-only history
- vs Gas Town: Governance sequential vs parallel coordination

Files modified: 10 (7 content + 3 meta)
Words added: ~2,500
Version: 3.26.0 → 3.27.0

Co-Authored-By: Claude Sonnet 4.5 <noreply@anthropic.com>
2026-02-12 23:33:16 +01:00
Florian BRUNIAUX
971a297db3 feat(security): add threat intelligence DB, security commands, and cheatsheet audit fixes (v3.26.0)
- Add threat-db.yaml v2.0.0 with 63 malicious skills, 22 CVEs, 4 campaigns
- Add /security-check, /security-audit, /update-threat-db slash commands
- Add Snyk ToxicSkills evaluation (58th resource evaluation)
- Fix cheatsheet: add Alt+T to keyboard shortcuts table, add /fast and /debug commands
- Update Features Meconnues table with Agent Teams and Auto-Memories
- Clean up cheatsheet.md.bak
- Bump version to 3.26.0

Co-Authored-By: Claude Sonnet 4.5 <noreply@anthropic.com>
2026-02-11 16:12:36 +01:00
Florian BRUNIAUX
ef7cdd899e release: v3.24.0 - Agent Evaluation Framework
Major addition: Complete agent evaluation framework with production-ready template.

## Added

- **Resource Evaluation**: nao framework (score 3/5)
  - Identified critical gap: agent evaluation not documented
  - Technical challenge adjusted score 2/5 → 3/5
  - All claims fact-checked (TypeScript 58.9%, Python 38.5%)

- **Guide Section**: Agent Evaluation (guide/agent-evaluation.md, ~3K tokens)
  - Metrics: response quality, tool usage, performance, satisfaction
  - Patterns: logging hooks, unit tests, A/B testing, feedback loops
  - Example: analytics agent with built-in metrics
  - Tools: nao framework reference, Claude Code hooks integration

- **AI Ecosystem**: Section 8.2 Domain-Specific Agent Frameworks
  - nao (Analytics Agents): Database-agnostic, built-in evaluation
  - Transposable patterns: context builder, evaluation hooks, DB integrations

- **Template**: Analytics Agent with Evaluation (5 files, ~1K lines)
  - README: setup, usage, troubleshooting
  - Agent: SQL generator with evaluation criteria, safety rules
  - Hook: automated metrics logging (safety, performance, errors)
  - Script: analysis with stats, safety reports, recommendations
  - Report template: monthly evaluation format

## Changed

- Agent Evaluation Guide: updated template references, verified links
- Landing Site: templates count 110 → 114
- Version: 3.23.5 → 3.24.0

Co-Authored-By: Claude Sonnet 4.5 <noreply@anthropic.com>
2026-02-10 11:52:13 +01:00
Florian BRUNIAUX
1fb783ebb8 release: v3.23.5 - Aristote claude-mem analysis prompt
Added comprehensive 6-phase analysis framework for claude-mem integration
targeting the Aristote project (EdTech, Next.js/tRPC/Prisma).

Key features:
- Audit current memory stack (Serena, grepai, 9 hooks)
- Cost/ROI estimation ($3.75/month API, 50-100x ROI factor)
- Integration plan with hooks synergy (activity-logger, serena-sync)
- Go/No-Go decision criteria
- Self-contained prompt template in claudedocs/prompts/

Co-Authored-By: Claude Sonnet 4.5 <noreply@anthropic.com>
2026-02-10 11:03:52 +01:00
Florian BRUNIAUX
d5c3a82cac docs: add claude-mem plugin documentation (automatic session memory)
Integrate claude-mem (thedotmack/claude-mem) into the guide as Section 8.2.5.
Score: 4/5 (High Value - automatic session capture fills documentation gap).

Added:
- Section 8.2.5: claude-mem plugin (automatic session memory)
  * Automatic capture via lifecycle hooks
  * AI compression + progressive disclosure (10x tokens)
  * Web dashboard at localhost:37777
  * Natural language search
  * Privacy controls (<private> tags)
  * Cost analysis ($0.15/100 obs)
  * AGPL-3.0 licensing considerations

- Memory Tools Decision Matrix (claude-mem vs Serena vs grepai)
  * 4-layer memory stack pattern
  * Integrated workflow examples
  * When to use automatic vs manual memory

- Plugin template: examples/plugins/claude-mem.md
  * Installation, configuration, troubleshooting
  * Advanced features (progressive disclosure, endless mode)
  * Export/import, cost optimization

- Resource evaluation: docs/resource-evaluations/claude-mem-evaluation.md
  * Technical analysis (fact-checked stats)
  * Comparison to existing tools
  * Integration recommendations

- reference.yaml: 14 new claude-mem entries

Changed:
- Updated search tools comparison (5 tools: rg, grepai, Serena, ast-grep, claude-mem)
- Extended feature matrix with "Auto capture" and "Web dashboard" rows

Stats (verified 2026-02-10):
- 26.5k GitHub stars, 1.8k forks
- 181 releases, 46 contributors
- Latest: v9.1.1 (Feb 7, 2026)
- License: AGPL-3.0 + PolyForm Noncommercial

Co-Authored-By: Claude Sonnet 4.5 <noreply@anthropic.com>
2026-02-10 08:47:17 +01:00
Florian BRUNIAUX
89084c89ec docs: integrate Anthropic 2026 Agentic Coding Trends Report
Integration strategy: diffusion transversale (~450 lines across 5 files)
instead of monolithic Section 9.21 (rejected after technical-writer review).

Evaluation: 4/5 score (high value, but lacks concrete code examples)
Source: https://resources.anthropic.com/hubfs/2026%20Agentic%20Coding%20Trends%20Report.pdf

Changes:
1. Created evaluation report (docs/resource-evaluations/)
   - Summary, gap analysis, challenge results, fact-check
   - Justification: validation industrie, benchmarks, anti-patterns

2. Modified guide/ultimate-guide.md (3 insertions, ~270 lines)
   - Section 9 intro: Industry context encadré with adoption data
   - Section 9.17 Multi-Instance: ROI benchmarks ($500-1K/month validation)
   - Section 9.11: Enterprise Anti-Patterns section (5 detailed patterns)

3. Modified guide/workflows/agent-teams.md (~80 lines)
   - Industry adoption data with case studies
   - Timeline: 3-6 months, success rates by phase
   - Real-world performance metrics (Fountain 50%, Rakuten 7h, TELUS 500K hours)

4. Modified machine-readable/reference.yaml (~40 lines)
   - Added agentic_trends_2026_* metadata section
   - Research data, case studies, benchmarks, anti-patterns references

5. Modified README.md (~8 lines)
   - Added "Research & Industry Reports" section
   - Link to Anthropic report with evaluation details

Stats validated: 60% AI usage, 0-20% full delegation, 67% more PRs/day,
27% new work, 7 case studies (Fountain, Rakuten, CRED, TELUS, Legora, Zapier, Augment).

Co-Authored-By: Claude Sonnet 4.5 <noreply@anthropic.com>
2026-02-09 17:18:52 +01:00
Florian BRUNIAUX
191ff42741 release: v3.23.4 - Agent Anti-Patterns & Scope-Focused Refactoring
Major conceptual refactoring based on Dex Horty's principle:
"Subagents are not for anthropomorphizing roles, they are for controlling context"

### Added (1 new section)
- Agent Anti-Patterns section (§9.17, line 3662)
  - Wrong vs Right table (anthropomorphizing vs context control)
  - When to use agents (context isolation, parallel processing, scope limitation)
  - When NOT to use agents (fake teams, roleplaying, mimicking org structure)

### Changed (18 files, 200+ lines)
- Section rename: "Split-Role Sub-Agents" → "Scope-Focused Agents"
- Agent definitions: "Specialized role" → "Context isolation tool"
- 8 custom agent examples refactored (guide + examples/agents/)
- 10+ prompt examples with explicit scope boundaries
- 4 workflow files updated (agent-teams, TDD, iterative refinement)
- Terminology replacements:
  * "Specialized agents" → "Scope-focused agents"
  * "Expert personas" → "Context boundaries"
  * "Multi-domain expertise" → "Multi-scope analysis"

### Fixed
- Methodologies: Clarification note for BMAD role-based naming

Breaking change: Conceptual shift from role-based to scope-based agent usage.
All examples now demonstrate context isolation instead of persona simulation.

Co-Authored-By: Claude Sonnet 4.5 <noreply@anthropic.com>
2026-02-09 10:29:59 +01:00
Florian BRUNIAUX
17846b1179 docs: complete Wasp fullstack essentials integration
Complete all 4 action items from wasp-fullstack-essentials-eval.md resource
evaluation (score 3/5). Framework-agnostic insights only, promotional content
excluded.

Changes (3 sections):

1. Background tasks workflow (Section 9.5)
   - New subsection: "Background Tasks for Fullstack Development"
   - When to background tasks (5 scenarios table)
   - Fullstack workflow pattern with examples
   - Context rot prevention strategies
   - Limitations and workarounds
   - Integration with teleportation
   - /tasks monitoring guide
   - ~100 lines added to Section 9.5 "Tight Feedback Loops"

2. Chrome DevTools MCP (mcp-servers-ecosystem.md)
   - New server entry in "Browser Automation" section
   - Official Anthropic server (not community)
   - Comparison table vs Playwright MCP (debugging vs testing)
   - Setup and configuration
   - Use cases and limitations
   - Updated stats: 3 browser servers (was 2), 6 official servers (was 5)
   - ~60 lines added to Browser Automation section

3. Convention-over-config for AI (Section 9.18.1)
   - New subsection: "Convention-Over-Configuration for AI Agents"
   - Why opinionated frameworks reduce agent cognitive load
   - Comparison table: custom vs opinionated architectures
   - Examples: Next.js, Rails, Phoenix, Django
   - Real-world impact on agent productivity
   - Trade-offs analysis
   - Connection to CLAUDE.md sizing (token reduction)
   - ~60 lines added to Section 9.18.1

Total additions: ~220 lines (workflow patterns + MCP server + AX framework)

Source evaluation: docs/resource-evaluations/wasp-fullstack-essentials-eval.md
Primary sources: llmstxt.org (llms.txt), official docs (background tasks, Chrome
DevTools MCP), existing Section 9.18 (Marmelab/AX framework)

Related commits:
- 783c43b: llms.txt conceptual documentation (completed earlier)

Co-Authored-By: Claude Sonnet 4.5 <noreply@anthropic.com>
2026-02-09 10:00:53 +01:00
Florian BRUNIAUX
783c43baed docs: add llms.txt conceptual documentation to Section 9.18
Add comprehensive llms.txt documentation based on Wasp fullstack essentials
resource evaluation (score 3/5). Sourced from llmstxt.org spec, not the
promotional article.

Changes:
- New section 9.18.4: Documentation Formats for Agents (llms.txt)
- Explains llms.txt standard, format, and use cases
- Clarifies complementarity with Context7 MCP (not opposition)
- Provides minimal and advanced examples with line numbers
- Integration patterns with CLAUDE.md
- References this repo's own llms.txt implementation
- Updated section numbering (9.18.4-9.18.11)
- Updated Section 9.18 TL;DR with new principle
- Added reference.yaml entries for llms.txt

Resource evaluation:
- File: docs/resource-evaluations/wasp-fullstack-essentials-eval.md
- Source: Wasp DevRel blog (framework-agnostic insights extracted)
- Score: 3/5 (partial integration, promotional content excluded)
- Gap identified: Embarrassing to have llms.txt file without explaining concept
- Primary source: llmstxt.org specification

Co-Authored-By: Claude Sonnet 4.5 <noreply@anthropic.com>
2026-02-09 09:46:50 +01:00
Florian BRUNIAUX
734a1cbef7 docs: add Zadig&Voltaire case study + awesome-claude-skills taxonomy
- Ecosystem: Zadig&Voltaire migration path (Cursor → Windsurf → Claude Code)
  with team productivity data (7 → 27 MRs/week, 4x acceleration)
  Source: Benjamin Calef, tech.zadig-et-voltaire.com (Feb 2026)
- Ecosystem: awesome-claude-skills (BehiSecc) - 62 skills across 12 categories
  Evaluation: 3/5, complementary to awesome-claude-code

Co-Authored-By: Claude Sonnet 4.5 <noreply@anthropic.com>
2026-02-08 15:34:46 +01:00
Florian BRUNIAUX
a68a1bd52b docs: add Agent Teams Quick Start Guide (v3.23.2)
Added practical 8-10 min guide for using agent teams in real projects.

Added:
- guide/workflows/agent-teams-quick-start.md (580 lines)
  - 5-minute setup walkthrough
  - 4 copy-paste patterns (Guide + RTK projects)
  - Decision matrix (10+ scenarios)
  - Success metrics framework
  - Red flags section

Updated:
- guide/workflows/agent-teams.md: Link to quick start
- guide/ultimate-guide.md: Section 9.20 with quick start link
- machine-readable/reference.yaml: agent_teams_quick_start entry
- CHANGELOG.md: Release v3.23.2
- VERSION: 3.23.1 → 3.23.2

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
2026-02-08 11:47:40 +01:00
Florian BRUNIAUX
36880caf29 docs: update Claude Code releases to v2.1.37
- Update latest tracked version: v2.1.33 → v2.1.37
- v2.1.36: Fast mode now available for Opus 4.6
- v2.1.34: Security fix for sandbox-excluded commands bypass
- v2.1.37: Fixed /fast availability after /extra-usage
- Fix reference.yaml line numbers for architecture.md

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
2026-02-08 11:25:05 +01:00
Florian BRUNIAUX
4c0e4b6ac6 docs: integrate Gur Sannikov ADR workflow + native capabilities audit (4/5)
- Add ADR-Driven Development pattern to methodologies.md (~60 lines)
  - Pattern: ADR → skill → native execution
  - Example ADR template (database migration)
  - Complete bash workflow with benefits
- Add Native Capabilities Audit checklist to architecture.md (~50 lines)
  - 11 native capabilities with internal links
  - Onboarding tip for comprehension audit
- Add Dynamic Model Switching pattern to cheatsheet.md (~40 lines)
  - Pattern: Sonnet → Opus → Sonnet
  - Cost comparison table and best practices
- Add Community Validation to architecture.md (~15 lines)
  - External validation of 'less scaffolding, more model' approach
  - Cursor power user adopting Agent Skills standard
- Track evaluation in docs/resource-evaluations/ (full methodology)
- Update evaluations count: 24 → 55 (README + reference.yaml)
- Update CHANGELOG.md with integration details

Source: https://www.linkedin.com/posts/gursannikov_claudecode-embeddedengineering-aiagents-activity-7423851983331328001-DrFb
Score: 4/5 (HIGH VALUE) - fills ADR workflow gap + onboarding checklist

Co-Authored-By: Claude Sonnet 4.5 <noreply@anthropic.com>
2026-02-07 16:12:53 +01:00
Florian BRUNIAUX
b48d95c024 feat: add agent/skill quality audit tooling + Grenier evaluation
AUDIT TOOLING (3 templates):
- Command: /audit-agents-skills (quick project audits)
  - 16-criteria framework (Identity 3x, Prompt 2x, Validation 1x, Design 2x)
  - Weighted scoring: 32 pts (agents/skills), 20 pts (commands)
  - Production grading (A-F, 80% threshold)
  - Fix mode with actionable suggestions
- Skill: audit-agents-skills (advanced audits)
  - 3 modes: Quick (top-5), Full (all 16), Comparative (vs templates)
  - JSON + Markdown output for CI/CD
- Scoring grids: criteria.yaml (externalized for reuse)

EVALUATION:
- Grenier agent/skill quality (3/5 - Moderate Value)
  - Gap: 29.5% deploy without evaluation (LangChang 2026)
  - Integration: Created audit command + skill + criteria
  - Industry context: 18% cite agent bugs as top challenge

DOCUMENTATION:
- Guide refs: 2 strategic call-outs (after Agent/Skill validation)
- CHANGELOG: New "Added" section + evaluation details
- README: Templates 106→107, Evaluations 49→24 (count corrections)
- reference.yaml: 10 new audit entries + updated counts

SYNC:
- Landing index.html: Templates 107, Evals 24, Quiz 257
- Landing examples/index.html: Templates 107

FILES: 14 changed, 4148 insertions (+1250 lines new audit content)

Co-Authored-By: Claude Sonnet 4.5 <noreply@anthropic.com>
2026-02-07 15:40:18 +01:00
Florian BRUNIAUX
99cca4f498 docs: update Claude Code releases to v2.1.33 + guide v3.23.1
Claude Code v2.1.33 (2026-02-06):
- Agent teams fixes (tmux sessions, availability warnings)
- TeammateIdle and TaskCompleted hook events for multi-agent workflows
- Agent frontmatter: memory field (user/project/local), Task(agent_type) syntax
- Plugin name in skill descriptions and /skills menu
- VSCode: Remote sessions, branch/message count in session picker
- Multiple stability fixes and improved error messages

Guide v3.23.1:
- Updated Claude Code releases tracking (YAML + Markdown)
- Version synced across all documentation files

Files updated:
- machine-readable/claude-code-releases.yaml (latest: 2.1.33)
- guide/claude-code-releases.md (v2.1.33 section + footer date)
- CHANGELOG.md ([3.23.1] entry)
- VERSION (3.23.0 → 3.23.1)
- Auto-synced: README.md, cheatsheet.md, ultimate-guide.md, reference.yaml

Co-Authored-By: Claude Sonnet 4.5 <noreply@anthropic.com>
2026-02-06 11:07:52 +01:00
Florian BRUNIAUX
c81180aec7 feat: adaptive onboarding architecture v2.0.0 (v3.23.0)
Major overhaul of onboarding system with adaptive topic selection based on
user context and keywords. Addresses 8 critical gaps identified by technical-
writer agent challenge.

Core Changes:
- Adaptive matrix: core topics (always) + adaptive topics (keyword-triggered)
- Security-first: moved sandbox_native_guide to beginner_5min (before commands)
- Time budget validation: all 18 profiles validated at 6-8 min/topic
- Quiz integration: positioned as exit activity in Phase 4 wrap-up
- New learn_security goal with 2 profiles (beginner_15min, advanced_60min)

Technical Improvements:
- Added onboarding_matrix_meta for version tracking and maintenance triggers
- Created validation script (validate-onboarding.sh) with 6 automated checks
- Created automation script (detect-new-onboarding-topics.sh) for monthly reviews
- Fixed 8 missing deep_dive keys (rules, workflow, fix, architecture, etc.)
- Removed duplicate deep_dive section causing validation failures

Documentation:
- README.md: version 3.23.0, harmonized counts (106 templates, 49 evaluations)
- CHANGELOG.md: comprehensive v3.23.0 entry with all changes
- Onboarding-prompt.md: updated Phase 1.5, 2, 4 with adaptive logic
- Reference.yaml: 180+ lines added for adaptive architecture

Validation:
- All 18 profiles pass time budget constraints (30-50% buffer maintained)
- All deep_dive keys verified (no missing references)
- Version synchronized across 6 files via sync-version.sh

Challenge: technical-writer agent identified 8 gaps in initial analysis
Result: Full adaptive approach implemented, all gaps addressed

Co-Authored-By: Claude Sonnet 4.5 <noreply@anthropic.com>
2026-02-05 22:19:58 +01:00
Florian BRUNIAUX
de4b438a72 docs: update Claude Code releases to v2.1.32 + guide v3.22.1
Claude Code v2.1.32 (2026-02-05):
-  Opus 4.6 now available
-  Agent teams research preview (experimental, token-intensive)
-  Automatic memory recording and recall
- "Summarize from here" for partial conversation summaries
- Skills auto-load from .claude/skills/ in --add-dir directories
- Fixes: @ file completion paths, bash heredoc JS template literals
- [VSCode] Slash commands execution fix, conversation loading spinner

Guide updates:
- Bumped guide version: 3.22.0 → 3.22.1 (minor)
- Updated CHANGELOG with v2.1.32 highlights
- Synced version across all documentation files
- Updated machine-readable tracking (YAML + Markdown)

Synchronization:
- Landing version badge updated: v2.1.31 → v2.1.32
- Landing footer version: v3.21.1 → v3.22.1
- Landing releases timeline updated with Opus 4.6 features
- All sync checks passing (templates, quiz, lines, stars)

Co-Authored-By: Claude Sonnet 4.5 <noreply@anthropic.com>
2026-02-05 21:04:55 +01:00
Florian BRUNIAUX
f830b1c836 chore: bump version to 3.22.0
Release dual-instance planning pattern (Jon Williams).

## Changes

- VERSION: 3.21.1 → 3.22.0
- CHANGELOG.md: Move [Unreleased] → [3.22.0] - 2026-02-04
  - Dual-instance planning pattern entry (~15 bullet points)
  - Documentation, Added section, full integration details
- Synced across: README.md, guide/cheatsheet.md, guide/ultimate-guide.md, reference.yaml

## Release Content

**Section 9.17.1**: Alternative Pattern: Dual-Instance Planning (~350 lines)
**Workflow**: guide/workflows/dual-instance-planning.md (~750 lines)
**Evaluation**: docs/resource-evaluations/jon-williams-dual-instance-pattern.md (~300 lines)
**References**: 15 new entries in reference.yaml

## Stats

- Total lines added: ~1,400
- Files modified: 10
- Evaluation score: 4/5 (High Value)
- Pattern: Vertical separation (planner ↔ implementer)
- Target audience: Solo devs, spec-heavy work, $100-200/month

Co-Authored-By: Claude Sonnet 4.5 <noreply@anthropic.com>
2026-02-04 10:41:55 +01:00
Florian BRUNIAUX
9c5d030b11 docs: add dual-instance planning pattern (Jon Williams)
Add vertical separation pattern (planner/implementer) as complement to horizontal scaling (Boris pattern).

## Changes

**Main guide (ultimate-guide.md)**:
- New Section 9.17.1: "Alternative Pattern: Dual-Instance Planning" (~350 lines)
  - When to use (solo devs, spec-heavy, $100-200/month)
  - Setup instructions (2 Claude instances, Plans/ directory)
  - Complete workflow (5 phases: planning, review, implementation, verification, archive)
  - Comparison table (Boris horizontal vs Jon vertical scaling)
  - Cost analysis (2 instances vs correction loops)
  - Agent-ready plan best practices
  - Limitations and tips

**Workflow file (workflows/dual-instance-planning.md)**:
- Full workflow guide (~750 lines)
- Complete example (JWT auth implementation)
- Plan template (ready to copy-paste)
- Cost breakdown and decision matrix
- Troubleshooting and bash aliases

**References updated**:
- machine-readable/reference.yaml: 15 new entries
  - dual_instance_planning, dual_instance_workflow, etc.
  - Line numbers, source attribution, metadata
- guide/workflows/plan-driven.md: Link in See Also section
- README.md: Update evaluation count (46 → 47)

**Evaluation documented**:
- docs/resource-evaluations/jon-williams-dual-instance-pattern.md
  - Full methodology (fetch, analyze, challenge, fact-check)
  - Score progression (2-3/5 → 4/5 after technical-writer challenge)
  - Gap analysis, comparison, integration rationale

## Source
LinkedIn post by Jon Williams (Product Designer, UK)
Date: 2026-02-03
URL: https://www.linkedin.com/posts/thatjonwilliams_ive-been-using-cursor-for-six-months-now-activity-7424481861802033153-k8bu

Context: Transition from Cursor (6 months) to Claude Code with Opus 4.5
Pattern: Vertical separation (Claude Zero: planning/review, Claude One: implementation)
Distinction: Orthogonal to Boris pattern (vertical vs horizontal scaling)

## Stats
- Lines added: ~1,400
- Files modified: 4
- Files created: 2 (workflow + evaluation)
- References added: 15 (reference.yaml)
- Evaluation score: 4/5 (High Value)
- Integration time: ~2.5 hours

Co-Authored-By: Claude Sonnet 4.5 <noreply@anthropic.com>
2026-02-04 10:38:10 +01:00
Florian BRUNIAUX
b15647d57f docs: add Git MCP Server (Official) comprehensive documentation
Integration:
- New section "Version Control (Official Servers)" in mcp-servers-ecosystem.md (~1600 words)
- Decision matrix: Git MCP vs GitHub MCP vs Bash tool (11 operations)
- 12 tools documented with setup, config, use cases, limitations
- Resource evaluation file created (git-mcp-server-evaluation.md)
- Machine-readable index updated (11 new entries)
- Evaluation count corrected: 36 → 46 (actual file count)

Score: 5/5 (CRITICAL) after technical-writer challenge
Gap filled: Official Git server 0% documented → 100% comprehensive

Co-Authored-By: Claude Sonnet 4.5 <noreply@anthropic.com>
2026-02-03 17:20:12 +01:00