Commit graph

204 commits

Author SHA1 Message Date
Florian BRUNIAUX
2874eee382 docs(diagrams): add budget modifier to model selection decision flow
Le diagramme assumait implicitement un budget illimité (Max/API).
Ajout d'un budget modifier table et community pattern basé sur le
feedback de Frédéric Camblor (Teams Standard $25/mo: Sonnet plan → Haiku impl).

Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>
2026-02-23 11:33:23 +01:00
Florian BRUNIAUX
ce9e8718df feat(diagrams): add per-node click hrefs to all 40 flowchart diagrams
Each flowchart node now links to the specific guide section it represents
via Mermaid `click NodeId href "URL"` directives. sequenceDiagram blocks
(10 total) are skipped as Mermaid does not support click directives for them.

Files updated:
- 01-foundations.md through 10-adoption-and-learning.md
- All flowchart nodes linked to their respective guide anchors
- Architecture, MCP, security, workflows, multi-agent, cost, adoption sections

Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>
2026-02-22 22:21:30 +01:00
Florian BRUNIAUX
24b464802e fix(diagrams): replace \n with <br/> in all Mermaid node labels
Mermaid does not support \n for line breaks — rendered literally.
Replaced all 276 occurrences with <br/> across 10 diagram files.

Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>
2026-02-22 16:17:52 +01:00
Florian BRUNIAUX
92643c1a6b docs(security): update threat-db v2.2.0 — CVE-2026-0755, mcp-run-python SSRF, 5 new scanners
New CVEs:
- CVE-2026-0755 (gemini-mcp-tool, CVSS 9.8, RCE, no fix yet)
- SNYK-PYTHON-MCPRUNPYTHON-15250607 (mcp-run-python SSRF via Deno sandbox)

New entries:
- Attack technique T010: Agent-to-Agent Communication Injection
- 5 new scanning tools: Proximity, Enkrypt AI, Cisco MCP Scanner, NeuralTrust, MCPScan.ai
- 1 new defensive resource: Anthropic Claude Code Security (2026-02-21)
- 4 new sources (Lakera, Penligent AI, Snyk, THN)

Updated security-hardening.md: added CVE-2026-0755 and mcp-run-python SSRF to CVE table

Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>
2026-02-22 16:14:34 +01:00
Florian BRUNIAUX
8a75fd2fd2 fix(diagrams): add color:#333 to light-fill nodes for dark mode contrast
Nodes with fill:#F5E6D3 (beige), fill:#B8B8B8 (gray), fill:#7BC47F (green)
had no explicit text color — GitHub dark mode rendered white text on light
backgrounds, making them unreadable.

Added color:#333 to all 3 light fills across all 10 diagram files.

Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>
2026-02-22 15:59:07 +01:00
Florian BRUNIAUX
dbb62306d7 release: v3.28.1 - Visual Diagrams Series (40 Mermaid diagrams)
guide/diagrams/: new directory with 40 interactive Mermaid diagrams
- 10 thematic files: foundations, context/sessions, configuration,
  architecture, MCP ecosystem, dev workflows, multi-agent patterns,
  security/production, cost/optimization, adoption/learning
- Each diagram: Mermaid (GitHub-native) + ASCII fallback + source link
- Bold Guy palette (6-color system) consistent across all diagrams
- README with index, visual palette legend, navigation by use case

Also includes (backlog from v3.28.0→v3.28.1):
- guide/ultimate-guide.md: Managing Large MCP Server Sets, AI Code
  Disclosure Policy, claude-mem Gemini alternative, observability
- guide/workflows/plan-driven.md: Boris Tane custom markdown plans (+172L)
- guide/security-hardening.md: Part 4 PR security review workflow
- examples/agents/security-patcher.md: new security agent
- examples/hooks/bash/security-gate.sh: PreToolUse security hook
- guide/observability.md: activity monitoring, external tools, proxying
- docs/resource-evaluations/: 4 new evaluations (Boris Cherny, Moigneu,
  Boris Tane, Aristote AI instructions)
- README.md: Visual Diagrams section in "What Makes This Guide Unique"

Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>
2026-02-22 15:51:21 +01:00
Florian BRUNIAUX
9218ab37d6 feat: security scanning workflow (auditor + patcher + gate hook)
- security-hardening.md Part 4: PR security review workflow
  3-agent pipeline: scan → data flow trace → patch
  Tableau par type de changement (auth, DB, upload, deps)
  Hook pre-push git pour alerter sur fichiers sensibles
- security-patcher agent: applique les findings du security-auditor
  Propose avant d'écrire, jamais en autonomie (human approval gate)
  Séparation nette detect vs patch
- security-gate.sh hook: PreToolUse, 7 patterns vulnérables bloqués
  SQLi, XSS innerHTML, secrets hardcodés, eval() dynamique,
  hash faible (MD5/SHA1 password), command injection, path traversal
  Complément de dangerous-actions-blocker.sh (ops système)
- Claude Code Security (research preview) documentée dans security-hardening.md
  Comparaison Security Auditor Agent vs feature Anthropic
- reference.yaml: 4 nouvelles entrées indexées

Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>
2026-02-22 15:21:35 +01:00
Florian BRUNIAUX
ac50ee7ad8 docs: add monitoring & activity audit sections to observability guide
- guide/observability.md: +3 sections (Activity Monitoring, External Tools, Proxying)
  - Activity Monitoring: JSONL tool_use audit, jq queries, sensitive pattern detection
  - External Tools: ccusage / claude-code-otel / Akto / MLflow / ccboard comparison
  - Proxying: NODE_EXTRA_CA_CERTS, ANTHROPIC_API_URL, mitmproxy, Python proxy
- docs: ccboard Activity module implementation plan (Tab 10, Rust models, SQLite cache)
- docs: Mergify cross-system support evaluation

Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>
2026-02-21 20:29:05 +01:00
Florian BRUNIAUX
6049bd99c2 release: v3.28.0 - Section 2.5 Model Selection & Thinking Guide
Section canonique consolidée pour le choix de modèle :
- Section 2.5 (decision table, effort levels avec prompts, model-per-agent patterns, thinking guide)
- 3 nouveaux agents : planner (Opus), implementer (Haiku), architecture-reviewer (Opus)
- 7 nouvelles questions quiz (09-037→09-043, intermediate→power)
- 3 tables redondantes remplacées par cross-refs vers Section 2.5

Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>
2026-02-21 18:25:50 +01:00
Florian BRUNIAUX
31802b61b1 release: v3.27.9
- Update Claude Code releases tracking (v2.1.49 → v2.1.50)
- Cut versioned release for Talk Preparation Pipeline, AGENTS.md research,
  Profile-Based Module Assembly (Section 3.5), ATDD expansion,
  Spec-First Task Granularity, team-config templates, and Mergify case study

Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>
2026-02-21 17:27:32 +01:00
Florian BRUNIAUX
520b8e6fdd docs: update Claude Code releases to v2.1.50
- WorktreeCreate/WorktreeRemove hook events for custom VCS setup/teardown
- isolation: worktree in agent definitions (declarative worktree isolation)
- claude agents CLI command to list configured agents
- Opus 4.6 fast mode now has full 1M context window
- CLAUDE_CODE_DISABLE_1M_CONTEXT env var
- Major memory leak fixes + headless startup performance improvements

Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>
2026-02-21 15:32:55 +01:00
Florian BRUNIAUX
e110346d34 docs: update Claude Code releases to v2.1.49
- Updated latest tracked version: v2.1.47 → v2.1.49
- v2.1.49 highlights: --worktree/-w flag + subagent isolation:worktree,
  background:true for agents, ConfigChange hook event, simple mode file edit
- Updated CHANGELOG [Unreleased] with CC release tracking entry

Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>
2026-02-20 17:51:33 +01:00
Florian BRUNIAUX
00cb973bdb docs: add Talk Preparation Pipeline workflow + skill templates
6-stage pipeline: raw material → conference talk → Kimi AI slides.

New files:
- guide/workflows/talk-pipeline.md — Full workflow guide (REX vs Concept
  modes, stage-by-stage breakdown, CHECKPOINT mechanics, Kimi handoff,
  real-world DevWithAI example, 5 design patterns documented)
- examples/skills/talk-pipeline/ — 7 SKILL.md files + orchestrator + 2
  templates (feedback-draft.md, kimi-prompt-template.md)

Updated:
- README.md — badges 164→172 templates, Feb 20 date, 13→14 skills
- guide/workflows/README.md — Talk Pipeline entry under Design & Content
- machine-readable/reference.yaml — 16 new entries for pipeline components
- CHANGELOG.md — [Unreleased] entry

Design patterns showcased: skill chaining + file-based state, tool
permission scoping (Bash only Stage 2), human-in-the-loop CHECKPOINT
(Stage 4), AI-to-AI handoff (Claude → Kimi), dual execution modes
(REX/Concept).

Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>
2026-02-20 15:51:29 +01:00
Florian BRUNIAUX
6d847d24de docs: add Profile-Based Module Assembly pattern (Section 3.5)
- Section 3.5 "Team Configuration at Scale" in ultimate-guide.md:
  profiles YAML + shared modules + skeleton + assembler script;
  59% context token reduction measured on 5-dev production team;
  includes CI drift detection, 5-step replication guide, trade-offs
- New workflow: guide/workflows/team-ai-instructions.md (6 phases,
  scaling thresholds, troubleshooting table)
- New templates: examples/team-config/ (profile-template.yaml,
  claude-skeleton.md, sync-script.ts)
- reference.yaml: 9 new entries for team_ai_instructions_*
- README: templates count 161 → 164, date Feb 19 → Feb 20
- CHANGELOG [Unreleased]: resource evaluations (AGENTS.md ETH Zürich
  4/5, Sylvain Chabaud 3/5), spec-first Task Granularity section,
  methodologies ATDD expansion

Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>
2026-02-20 15:04:29 +01:00
Florian BRUNIAUX
9b75b5125e release: v3.27.8 - prompt-based GitHub Actions code review workflow
New examples/github-actions/claude-code-review.yml with externalized prompt,
anti-hallucination protocol, /claude-review on-demand trigger. Templates 116→161.

Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>
2026-02-19 14:23:15 +01:00
Florian BRUNIAUX
373b50cf77 docs: add 4 resource evaluations + tasks/lessons.md pattern (v3.27.7+)
- Add tasks/lessons.md variant to Fresh Context Pattern section — lightweight
  self-improvement loop where Claude captures behavioral rules after corrections
- Resource evaluations (4 new, 67 → 78 total):
  - SKILLMIND/Boris Cherny workflow (2/5): self-improvement loop concept
  - Augment Context Engine MCP (3/5): cloud alternative to grepai
  - LinkedIn context engineering post (2/5): marginal relevance
  - LinkedIn activity-7428930570451083264 (inaccessible)
- README: update evaluations count 67 → 78, threat DB badge 18/341 → 19/655
- CLAUDE.md: add Whitepaper Generation section (PDF + EPUB commands)

Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>
2026-02-19 13:47:38 +01:00
Florian BRUNIAUX
895ace49f7 docs: add Borg et al. 2025 RCT on AI code maintainability (v3.27.7)
- Resource eval: arXiv:2507.00788 "Echoes of AI" (151 devs, 95% pros,
  2-phase blind RCT) — 30.7% faster median, ~55.9% habitual users,
  no significant downstream maintainability impact
- guide/learning-with-ai.md: citation + "On maintainability fear" note
- guide/ultimate-guide.md: nuance blockquote in §1.7 Trust Calibration
- machine-readable/reference.yaml: 4 new RCT/maintainability entries
- docs/resource-evaluations/: evaluation file with technical-writer audit

Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>
2026-02-19 09:59:50 +01:00
Florian BRUNIAUX
5083d80ed1 docs: update Claude Code releases to v2.1.47
- v2.1.47: VS Code plan preview auto-updates, ctrl+f kills background agents,
  last_assistant_message hook field, 70+ bug fixes (PDF compaction, Unicode
  curly quotes, parallel file edits, OSC 8 hyperlinks, Windows hooks, worktrees)
- v2.1.46: claude.ai MCP connectors support, orphaned process fix on macOS
- Update latest tracked: v2.1.45 → v2.1.47

Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>
2026-02-19 08:37:31 +01:00
Florian BRUNIAUX
267ce0ba86 chore: update guide content and add new examples
- Update ultimate guide and cheatsheet
- Update llms.txt and reference YAML
- Add velocity-governor hook and cc-sessions script

Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>
2026-02-18 18:48:19 +01:00
Florian BRUNIAUX
61d7bb5fad docs: enrich 200K vs 1M section with community findings
- Add community verdict: 1M is niche premium, not default
- Add Gemini 1.5 Pro comparison ($3.50/$10.50 vs Opus $10/$37.50)
- Expand decision table: full-repo audit vs RAG vs refacto
- Add Sonnet 4.6 community report on precision degradation at 1M
- Add "check context at 70%, open new session" workaround pattern (HN)
- Add links to HN validation test + independent MRCR analysis

Sources: Perplexity deep search Reddit/HN/blogs, Feb 18 2026

Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>
2026-02-18 10:43:46 +01:00
Florian BRUNIAUX
8d6c50403d docs: restore 93% @ 256K with source + add HN community validation
- Restore Opus 4.6 MRCR 93% @ 256K (confirmed: independent analysis of Anthropic data)
- Add Harry Potter needle test reference (HN 46905735: 49/50 spells at 733K tokens)
- Source: Perplexity deep search cross-validation, Feb 18 2026

Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>
2026-02-18 10:41:03 +01:00
Florian BRUNIAUX
78f4dc4b42 fix: correct 200K vs 1M context section after source verification
- Fix beta header: interleaved-thinking → context-1m-2025-08-07
- Add Sonnet 4.6 long-context premium ($3→$6 / $15→$22.50 above 200K)
- Correct cost estimates: Sonnet 1M ~$4.13 (was $2.25), Opus 1M ~$6.88 (was $8.75)
- Remove unverified 93% @ 256K MRCR figure
- Add usage tier 4 requirement
- Clarify "all tokens" billed at premium when input >200K

Sources: Anthropic pricing docs + context-windows docs (verified Feb 18 2026)

Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>
2026-02-18 10:06:21 +01:00
Florian BRUNIAUX
c3da456d3a release: v3.27.6 - Sonnet 4.6 default + 200K vs 1M context guide
- Pricing table: Sonnet 4.6 now default (Feb 2026)
- New section: 200K vs 1M context decision guide (MRCR bench, cost table, use cases)
- threat-db.yaml v2.1.0: CVE-2026-23744, Slopsquatting T009, OWASP Agentic AI Top 10

Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>
2026-02-18 09:33:55 +01:00
Florian BRUNIAUX
4b15bdb137 release: v3.27.5 - Claude Code v2.1.45 tracking
- Bump guide: v3.27.4 → v3.27.5
- Update Claude Code releases: v2.1.44 → v2.1.45
  - Sonnet 4.6 model support
  - spinnerTipsOverride setting
  - SDK SDKRateLimitInfo/SDKRateLimitEvent types
  - Agent Teams fix on Bedrock/Vertex/Foundry

Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>
2026-02-17 21:12:53 +01:00
Florian BRUNIAUX
88c9f310a7 docs: update Claude Code releases to v2.1.45
- Updated latest tracked version: v2.1.44 → v2.1.45
- Highlights: Sonnet 4.6 support, spinnerTipsOverride setting,
  SDK SDKRateLimitInfo/SDKRateLimitEvent types,
  Agent Teams fix on Bedrock/Vertex/Foundry
- Updated CHANGELOG with release tracking note

Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>
2026-02-17 20:57:57 +01:00
Florian BRUNIAUX
1f720f7d74 release: v3.27.4 - template count update + worktree docs
- Templates 113 → 116 (review-plan, git-worktree-status/remove/clean, rules)
- Commands 23 → 26 (worktree lifecycle suite)
- Git worktree section expanded with lifecycle commands table
- Version sync across all docs

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
2026-02-17 11:15:32 +01:00
Florian BRUNIAUX
0d6a0c656e docs: add git-worktree suite, security kill switch, update reference.yaml + CC releases
- Git worktree commands: overhauled main + 3 new (status, remove, clean)
- Security hardening: AI Kill Switch & Containment Architecture (§3.5)
- DevOps SRE: cross-reference to security-hardening for AI incidents
- CC releases: v2.1.43-v2.1.44 tracking
- reference.yaml: 12 new entries, evaluations count 67 → 74

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
2026-02-17 10:20:57 +01:00
Florian BRUNIAUX
e4d9d9e825 fix: correct v2.1.39/v2.1.41 feature attributions + add claude auth CLI docs
3 features were incorrectly attributed to v2.1.39 instead of v2.1.41
(guard nested sessions, OTel speed attribute, Agent Teams model fix).
Verified against official CHANGELOG. Also adds claude auth login/status/logout
to the ultimate guide maintenance commands table.

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
2026-02-16 20:37:08 +01:00
Florian BRUNIAUX
c7c63a1bc4 docs: update session-summary install to recommend plugin system
- Add plugin install as recommended method in Implementation section
- Fix hook event: Stop → SessionEnd (correct event)
- Add plugin install option in Installation section
- Keep manual install as alternative

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
2026-02-16 14:55:10 +01:00
Florian BRUNIAUX
4cf1bf3cec docs: v3.27.3 — track Claude Code v2.1.42 + Google Antigravity section
- Claude Code releases: v2.1.41 → v2.1.42 (startup perf, prompt cache, Opus 4.6 effort callout)
- New AI ecosystem section: Google Antigravity agent-first IDE comparison
- Version sync across all docs (3.27.2 → 3.27.3)

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
2026-02-16 09:38:31 +01:00
Florian BRUNIAUX
0f4b1837c5 docs: v3.27.2 — YAML frontmatter on 90 markdown files
Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
2026-02-15 19:24:07 +01:00
Florian BRUNIAUX
624b2e0425 docs(guide): add YAML frontmatter to 15 workflow guide files
Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
2026-02-15 19:20:35 +01:00
Florian BRUNIAUX
ac9b07a837 docs(guide): add YAML frontmatter to 24 top-level guide files
Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
2026-02-15 19:20:31 +01:00
Florian BRUNIAUX
d1182af4cf docs: v3.27.1 — fact-check corrections, grepai docs, RTK overhaul
Fact-check (README positioning):
- Template count: 120/123 → 108 (ground truth recount)
- Ratio: 14× → 24× (19,000 ÷ 784 = 24.2×)
- everything-cc stars: 31.9k → 45k+ (verified Feb 15)
- Commands count: 20 → 23, hooks: 30 → 31

Added:
- Grepai MCP documentation (semantic search, call graphs)
- 3 hook templates (rtk-baseline, session-summary, session-summary-config)
- 2 resource evaluations (system-prompts update, qmd token savings)

Changed:
- RTK documentation overhaul (v0.7.0 → v0.16.0, rtk-ai org)
- Exports deprecated (kimi.pdf, notebooklm.pdf → deprecated/)

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
2026-02-15 18:41:45 +01:00
Florian BRUNIAUX
d94f6bf62e docs: update Claude Code releases tracking (v2.1.38 → v2.1.41)
- Add v2.1.39: recursive launch guard, Agent Teams 3P model fix, OTel speed
- Add v2.1.41: claude auth CLI, Windows ARM64, /rename auto-name
- Fix markdown missing v2.1.38 entry (security: heredoc, sandbox skills)
- Add heredoc command smuggling to security breaking summary

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
2026-02-13 11:06:07 +01:00
Florian BRUNIAUX
d72905e9ba docs: integrate Entire CLI across guide (v3.27.0)
Major integration of Entire CLI, an agent-native platform launched
Feb 2026 by Thomas Dohmke (ex-GitHub CEO) with $60M funding. Provides
rewindable checkpoints, approval gates, and audit trails for AI sessions.

## Added (7 guide files + 3 meta files)

- **ai-traceability.md**: Replace git-ai 404 with Entire CLI (section 5.1)
- **third-party-tools.md**: Fill "Session replay" gap + add tool section
- **observability.md**: Add session portability alternative
- **ai-ecosystem.md**: Add governance-first orchestration (section 8.1.5)
- **ultimate-guide.md**: Enrich multi-instance section 9.17
- **security-hardening.md**: Add compliance audit trails (section 3.4)
- **cheatsheet.md**: Add Community Tools quick reference
- **README.md**: Update structure tree with third-party-tools mention
- **CHANGELOG.md**: Document v3.27.0 release
- **docs/resource-evaluations/entire-cli.md**: Formal evaluation (5/5)

## Fixed

- git-ai references (404 repo) replaced with working alternative
- "Session replay" Known Gap now marked as  FILLED

## Key Features Documented

- Rewindable checkpoints (prompts + reasoning + tool usage)
- Governance layer (approval gates, permissions, audit trails)
- Multi-agent handoffs (Claude → Gemini with context)
- Compliance-ready (SOC2, HIPAA, FedRAMP)
- Session portability (path-agnostic vs native --resume)

## Positioning

- vs git-ai: Replaces non-existent tool (404)
- vs claude-code-viewer: Active replay vs read-only history
- vs Gas Town: Governance sequential vs parallel coordination

Files modified: 10 (7 content + 3 meta)
Words added: ~2,500
Version: 3.26.0 → 3.27.0

Co-Authored-By: Claude Sonnet 4.5 <noreply@anthropic.com>
2026-02-12 23:33:16 +01:00
Florian BRUNIAUX
971a297db3 feat(security): add threat intelligence DB, security commands, and cheatsheet audit fixes (v3.26.0)
- Add threat-db.yaml v2.0.0 with 63 malicious skills, 22 CVEs, 4 campaigns
- Add /security-check, /security-audit, /update-threat-db slash commands
- Add Snyk ToxicSkills evaluation (58th resource evaluation)
- Fix cheatsheet: add Alt+T to keyboard shortcuts table, add /fast and /debug commands
- Update Features Meconnues table with Agent Teams and Auto-Memories
- Clean up cheatsheet.md.bak
- Bump version to 3.26.0

Co-Authored-By: Claude Sonnet 4.5 <noreply@anthropic.com>
2026-02-11 16:12:36 +01:00
Florian BRUNIAUX
deb518ceff fix(security): fact-check corrections across threat-db and hardening guide
- CVE-2025-53109/53110: fix version 0.6.4 → 0.6.3 (per NVD/Cymulate)
- CVE-2025-53967: CVSS 8.0 → 7.5 (per NVD)
- CVE-2026-25536: add missing fixed_in 1.26.0
- CVE-2026-25546: add missing fixed_in 0.1.1
- Rename pseudo-CVE "claude-code-v2.1.34" → ADVISORY-CC-2026-001
- Fix Flatt Security URL to specific blog post
- Fix SentinelOne URL to specific CVE page

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
2026-02-11 15:11:13 +01:00
Florian BRUNIAUX
7b0d65cb09 fix(docs): correct effort parameter documentation based on official Anthropic docs
- Fix API syntax: `thinking={}` → `output_config={"effort": "medium"}`
- Clarify scope: effort controls entire response (text + tools + thinking)
- Add official descriptions for max/high/medium/low levels
- Update control table: Opus 4.5 vs 4.6 support
- New subsection: "Effort and Tool Use" with concrete examples
- Clarify effort/thinking/budget_tokens relationship

Source: https://platform.claude.com/docs/en/build-with-claude/effort
Section: guide/ultimate-guide.md lines 10396-10442

Co-Authored-By: Claude Sonnet 4.5 <noreply@anthropic.com>
2026-02-10 17:38:08 +01:00
Florian BRUNIAUX
5f67bd8957 docs: add Robin Lorenz attribution with LinkedIn links
- CHANGELOG: Added LinkedIn profile and comment links
- Guide §2.5: Added direct quote with attribution
2026-02-10 13:18:58 +01:00
Florian BRUNIAUX
c958738c20 docs: integrate AI fatigue symptom recognition (score 3/5)
Add session time-boxing guidance and nondeterminism stress recognition
to learning-with-ai.md across 3 strategic locations (~220 words total).

Changes:
- Red Flags Checklist: Add session fatigue warning with time-boxing mitigation
  (30 min limit, max 3 attempts before manual implementation)
- Productivity Reality: Add nondeterminism stress paragraph (identical prompts
  → varying outputs causes AI fatigue)
- UVAL Protocol: Add Step 2.5 checkpoint for fatigue signal recognition
  (session duration, retry count, frustration assessment)

Rationale:
- Score 3/5: Moderate relevance (90% overlap with existing content)
- Extracted only novel tactics: session time-boxing (distinct from weekly 70/30)
- Rejected contradictory recommendations (70% quality vs understand 100%)
- Full evaluation + technical-writer challenge: docs/resource-evaluations/

Source: Siddhant Khare, "AI Fatigue is Real and Nobody Talks About It"
        (Feb 2026, https://siddhantkhare.com/writing/ai-fatigue-is-real)

Co-Authored-By: Claude Sonnet 4.5 <noreply@anthropic.com>
2026-02-10 13:13:32 +01:00
Florian BRUNIAUX
9088d68b9f feat: add "From Chatbot to Context System" mental model (v3.25.0)
New Section §2.5:
- 4-layer framework: CLAUDE.md + Skills + Hooks + Memory
- Before/After comparison (chatbot mode vs context system)
- Addresses gap: features documented separately, never unified
- Inspired by Robin Lorenz feedback (17-agent orchestration)

Changes:
- Mistake #8 rewritten: "Treating Claude Code Like a Chatbot"
- Key mindset shift updated: "structured context system"
- Cross-references added: §1.6, §1.8, §2.5, §9.10

Impact: Shifts from "write better prompts" to "build context that compounds"

Co-Authored-By: Claude Sonnet 4.5 <noreply@anthropic.com>
2026-02-10 13:06:49 +01:00
Florian BRUNIAUX
ef7cdd899e release: v3.24.0 - Agent Evaluation Framework
Major addition: Complete agent evaluation framework with production-ready template.

## Added

- **Resource Evaluation**: nao framework (score 3/5)
  - Identified critical gap: agent evaluation not documented
  - Technical challenge adjusted score 2/5 → 3/5
  - All claims fact-checked (TypeScript 58.9%, Python 38.5%)

- **Guide Section**: Agent Evaluation (guide/agent-evaluation.md, ~3K tokens)
  - Metrics: response quality, tool usage, performance, satisfaction
  - Patterns: logging hooks, unit tests, A/B testing, feedback loops
  - Example: analytics agent with built-in metrics
  - Tools: nao framework reference, Claude Code hooks integration

- **AI Ecosystem**: Section 8.2 Domain-Specific Agent Frameworks
  - nao (Analytics Agents): Database-agnostic, built-in evaluation
  - Transposable patterns: context builder, evaluation hooks, DB integrations

- **Template**: Analytics Agent with Evaluation (5 files, ~1K lines)
  - README: setup, usage, troubleshooting
  - Agent: SQL generator with evaluation criteria, safety rules
  - Hook: automated metrics logging (safety, performance, errors)
  - Script: analysis with stats, safety reports, recommendations
  - Report template: monthly evaluation format

## Changed

- Agent Evaluation Guide: updated template references, verified links
- Landing Site: templates count 110 → 114
- Version: 3.23.5 → 3.24.0

Co-Authored-By: Claude Sonnet 4.5 <noreply@anthropic.com>
2026-02-10 11:52:13 +01:00
Florian BRUNIAUX
1fb783ebb8 release: v3.23.5 - Aristote claude-mem analysis prompt
Added comprehensive 6-phase analysis framework for claude-mem integration
targeting the Aristote project (EdTech, Next.js/tRPC/Prisma).

Key features:
- Audit current memory stack (Serena, grepai, 9 hooks)
- Cost/ROI estimation ($3.75/month API, 50-100x ROI factor)
- Integration plan with hooks synergy (activity-logger, serena-sync)
- Go/No-Go decision criteria
- Self-contained prompt template in claudedocs/prompts/

Co-Authored-By: Claude Sonnet 4.5 <noreply@anthropic.com>
2026-02-10 11:03:52 +01:00
Florian BRUNIAUX
d5c3a82cac docs: add claude-mem plugin documentation (automatic session memory)
Integrate claude-mem (thedotmack/claude-mem) into the guide as Section 8.2.5.
Score: 4/5 (High Value - automatic session capture fills documentation gap).

Added:
- Section 8.2.5: claude-mem plugin (automatic session memory)
  * Automatic capture via lifecycle hooks
  * AI compression + progressive disclosure (10x tokens)
  * Web dashboard at localhost:37777
  * Natural language search
  * Privacy controls (<private> tags)
  * Cost analysis ($0.15/100 obs)
  * AGPL-3.0 licensing considerations

- Memory Tools Decision Matrix (claude-mem vs Serena vs grepai)
  * 4-layer memory stack pattern
  * Integrated workflow examples
  * When to use automatic vs manual memory

- Plugin template: examples/plugins/claude-mem.md
  * Installation, configuration, troubleshooting
  * Advanced features (progressive disclosure, endless mode)
  * Export/import, cost optimization

- Resource evaluation: docs/resource-evaluations/claude-mem-evaluation.md
  * Technical analysis (fact-checked stats)
  * Comparison to existing tools
  * Integration recommendations

- reference.yaml: 14 new claude-mem entries

Changed:
- Updated search tools comparison (5 tools: rg, grepai, Serena, ast-grep, claude-mem)
- Extended feature matrix with "Auto capture" and "Web dashboard" rows

Stats (verified 2026-02-10):
- 26.5k GitHub stars, 1.8k forks
- 181 releases, 46 contributors
- Latest: v9.1.1 (Feb 7, 2026)
- License: AGPL-3.0 + PolyForm Noncommercial

Co-Authored-By: Claude Sonnet 4.5 <noreply@anthropic.com>
2026-02-10 08:47:17 +01:00
Florian BRUNIAUX
89084c89ec docs: integrate Anthropic 2026 Agentic Coding Trends Report
Integration strategy: diffusion transversale (~450 lines across 5 files)
instead of monolithic Section 9.21 (rejected after technical-writer review).

Evaluation: 4/5 score (high value, but lacks concrete code examples)
Source: https://resources.anthropic.com/hubfs/2026%20Agentic%20Coding%20Trends%20Report.pdf

Changes:
1. Created evaluation report (docs/resource-evaluations/)
   - Summary, gap analysis, challenge results, fact-check
   - Justification: validation industrie, benchmarks, anti-patterns

2. Modified guide/ultimate-guide.md (3 insertions, ~270 lines)
   - Section 9 intro: Industry context encadré with adoption data
   - Section 9.17 Multi-Instance: ROI benchmarks ($500-1K/month validation)
   - Section 9.11: Enterprise Anti-Patterns section (5 detailed patterns)

3. Modified guide/workflows/agent-teams.md (~80 lines)
   - Industry adoption data with case studies
   - Timeline: 3-6 months, success rates by phase
   - Real-world performance metrics (Fountain 50%, Rakuten 7h, TELUS 500K hours)

4. Modified machine-readable/reference.yaml (~40 lines)
   - Added agentic_trends_2026_* metadata section
   - Research data, case studies, benchmarks, anti-patterns references

5. Modified README.md (~8 lines)
   - Added "Research & Industry Reports" section
   - Link to Anthropic report with evaluation details

Stats validated: 60% AI usage, 0-20% full delegation, 67% more PRs/day,
27% new work, 7 case studies (Fountain, Rakuten, CRED, TELUS, Legora, Zapier, Augment).

Co-Authored-By: Claude Sonnet 4.5 <noreply@anthropic.com>
2026-02-09 17:18:52 +01:00
Florian BRUNIAUX
191ff42741 release: v3.23.4 - Agent Anti-Patterns & Scope-Focused Refactoring
Major conceptual refactoring based on Dex Horty's principle:
"Subagents are not for anthropomorphizing roles, they are for controlling context"

### Added (1 new section)
- Agent Anti-Patterns section (§9.17, line 3662)
  - Wrong vs Right table (anthropomorphizing vs context control)
  - When to use agents (context isolation, parallel processing, scope limitation)
  - When NOT to use agents (fake teams, roleplaying, mimicking org structure)

### Changed (18 files, 200+ lines)
- Section rename: "Split-Role Sub-Agents" → "Scope-Focused Agents"
- Agent definitions: "Specialized role" → "Context isolation tool"
- 8 custom agent examples refactored (guide + examples/agents/)
- 10+ prompt examples with explicit scope boundaries
- 4 workflow files updated (agent-teams, TDD, iterative refinement)
- Terminology replacements:
  * "Specialized agents" → "Scope-focused agents"
  * "Expert personas" → "Context boundaries"
  * "Multi-domain expertise" → "Multi-scope analysis"

### Fixed
- Methodologies: Clarification note for BMAD role-based naming

Breaking change: Conceptual shift from role-based to scope-based agent usage.
All examples now demonstrate context isolation instead of persona simulation.

Co-Authored-By: Claude Sonnet 4.5 <noreply@anthropic.com>
2026-02-09 10:29:59 +01:00
Florian BRUNIAUX
94b9ad4f89 docs: update metadata and statistics for v3.9.9
Update CHANGELOG, README, and workflow docs to reflect latest additions.

Changes:
1. CHANGELOG.md
   - 4 new entries under [Unreleased] → Added:
     * llms.txt Standard Documentation (Section 9.18.4)
     * Background Tasks Workflow (Section 9.5)
     * Chrome DevTools MCP Server (mcp-servers-ecosystem.md)
     * Convention-Over-Configuration for AI (Section 9.18.1)
   - Each entry with line numbers, resources, gaps addressed

2. README.md statistics
   - Guide lines: 16K → 19K (actual: 18,820 lines)
   - Templates: 107 → 111 (verified count)
   - Evaluations: 55 → 56 (wasp-fullstack-essentials-eval.md)
   - Updated in 8 locations: hero, badges, mermaid diagram, structure

3. guide/workflows/agent-teams-quick-start.md
   - Improved task delegation clarity (Content Scope, Index Scope labels)
   - Enhanced context boundaries in multi-file operations

Statistics verification:
- Templates: find examples -type f | wc -l → 111
- Guide lines: wc -l guide/ultimate-guide.md → 18,820
- Evaluations: docs/resource-evaluations/README.md → 56

Related commits:
- 783c43b: llms.txt conceptual documentation
- 17846b1: Background tasks + Chrome DevTools + convention-over-config

Co-Authored-By: Claude Sonnet 4.5 <noreply@anthropic.com>
2026-02-09 10:07:12 +01:00
Florian BRUNIAUX
baebe23816 docs: integrate Anthropic update 2-9 Feb 2026 (Opus 4.6, auto-memories, fast mode)
Comprehensive integration of Claude Opus 4.6 features and v2.1.32-2.1.37
releases based on Anthropic veille (8 items evaluated, score 5/5).

## Critical Updates (Phase 1)

### Opus 4.6 + Adaptive Thinking
- Extended Thinking section: Adaptive thinking replaces budget-based approach
- API syntax: thinking: {type: "adaptive", effort: "low|medium|high|max"}
- Deprecated: budget_tokens parameter no longer functional
- 8 references updated: "Opus 4.5" → "Opus 4.6" contextually

### Pricing Model
- Three-tier structure: Standard ($5/$25), 1M Context Beta ($10/$37.50),
  Fast Mode ($30/$150) per 1M tokens
- Default model: Sonnet 3.5 → Sonnet 4.5

### Commands Table
- Added: /fast (toggle fast mode, 2.5x speed, 6x price, persists)
- Added: /debug (systematic troubleshooting)

## High-Value Features (Phase 2)

### Auto-Memories (v2.1.32+)
- Automatic context capture: decisions, preferences, patterns, gotchas
- Opt-in, per-project, separate from CLAUDE.md
- Comparative table: Manual CLAUDE.md vs automatic memories

### Context Management
- "Summarize from here": Right-click message → selective summarization
- More precise than full /compact

### Agent Teams Enhancements
- Hook events: TeammateIdle, TaskCompleted (events table)
- Agent frontmatter: memory field for pre-populated context

### Security
- CVE table: Claude Code v2.1.34 sandbox bypass fix
- Recommendation: Upgrade to v2.1.34+ immediately

## Contextual Additions (Phase 3)

### Xcode Integration (Feb 2026)
- Xcode 26.3 RC+: Native Claude Agent SDK support
- Same harness as Claude Code CLI
- Clarified: Agent SDK ≠ Claude Code (shared framework)

### API Breaking Changes (Opus 4.6)
- Removed: assistant-prefill (use system prompts instead)
- New: Fast mode API (speed: "fast" + beta header)

### Corrections
- PowerPoint capability: "No PPTX" → "Limited (add-in only)"

## Documentation

### Fact-Checking (Perplexity Pro)
- Opus 4.6 pricing: Verified via pricepertoken.com, eesel.ai
- Adaptive thinking: Verified syntax, effort parameter, API examples
- Fast mode: Confirmed 2.5x speed, 6x cost, persistence
- Auto-memories: Validated opt-in, per-project behavior

### machine-readable/reference.yaml
- 24 new entries: auto-memories, fast mode, debug, Opus 4.6, hooks
- Updated: 2026-02-09

### Metrics
- 14 files modified across 3 phases
- 11 sections updated in ultimate-guide.md
- Confidence: High (official sources + Perplexity validation)

Co-Authored-By: Claude Sonnet 4.5 <noreply@anthropic.com>
2026-02-09 10:02:47 +01:00
Florian BRUNIAUX
17846b1179 docs: complete Wasp fullstack essentials integration
Complete all 4 action items from wasp-fullstack-essentials-eval.md resource
evaluation (score 3/5). Framework-agnostic insights only, promotional content
excluded.

Changes (3 sections):

1. Background tasks workflow (Section 9.5)
   - New subsection: "Background Tasks for Fullstack Development"
   - When to background tasks (5 scenarios table)
   - Fullstack workflow pattern with examples
   - Context rot prevention strategies
   - Limitations and workarounds
   - Integration with teleportation
   - /tasks monitoring guide
   - ~100 lines added to Section 9.5 "Tight Feedback Loops"

2. Chrome DevTools MCP (mcp-servers-ecosystem.md)
   - New server entry in "Browser Automation" section
   - Official Anthropic server (not community)
   - Comparison table vs Playwright MCP (debugging vs testing)
   - Setup and configuration
   - Use cases and limitations
   - Updated stats: 3 browser servers (was 2), 6 official servers (was 5)
   - ~60 lines added to Browser Automation section

3. Convention-over-config for AI (Section 9.18.1)
   - New subsection: "Convention-Over-Configuration for AI Agents"
   - Why opinionated frameworks reduce agent cognitive load
   - Comparison table: custom vs opinionated architectures
   - Examples: Next.js, Rails, Phoenix, Django
   - Real-world impact on agent productivity
   - Trade-offs analysis
   - Connection to CLAUDE.md sizing (token reduction)
   - ~60 lines added to Section 9.18.1

Total additions: ~220 lines (workflow patterns + MCP server + AX framework)

Source evaluation: docs/resource-evaluations/wasp-fullstack-essentials-eval.md
Primary sources: llmstxt.org (llms.txt), official docs (background tasks, Chrome
DevTools MCP), existing Section 9.18 (Marmelab/AX framework)

Related commits:
- 783c43b: llms.txt conceptual documentation (completed earlier)

Co-Authored-By: Claude Sonnet 4.5 <noreply@anthropic.com>
2026-02-09 10:00:53 +01:00