Commit graph

129 commits

Author SHA1 Message Date
Florian BRUNIAUX
d72905e9ba docs: integrate Entire CLI across guide (v3.27.0)
Major integration of Entire CLI, an agent-native platform launched
Feb 2026 by Thomas Dohmke (ex-GitHub CEO) with $60M funding. Provides
rewindable checkpoints, approval gates, and audit trails for AI sessions.

## Added (7 guide files + 3 meta files)

- **ai-traceability.md**: Replace git-ai 404 with Entire CLI (section 5.1)
- **third-party-tools.md**: Fill "Session replay" gap + add tool section
- **observability.md**: Add session portability alternative
- **ai-ecosystem.md**: Add governance-first orchestration (section 8.1.5)
- **ultimate-guide.md**: Enrich multi-instance section 9.17
- **security-hardening.md**: Add compliance audit trails (section 3.4)
- **cheatsheet.md**: Add Community Tools quick reference
- **README.md**: Update structure tree with third-party-tools mention
- **CHANGELOG.md**: Document v3.27.0 release
- **docs/resource-evaluations/entire-cli.md**: Formal evaluation (5/5)

## Fixed

- git-ai references (404 repo) replaced with working alternative
- "Session replay" Known Gap now marked as  FILLED

## Key Features Documented

- Rewindable checkpoints (prompts + reasoning + tool usage)
- Governance layer (approval gates, permissions, audit trails)
- Multi-agent handoffs (Claude → Gemini with context)
- Compliance-ready (SOC2, HIPAA, FedRAMP)
- Session portability (path-agnostic vs native --resume)

## Positioning

- vs git-ai: Replaces non-existent tool (404)
- vs claude-code-viewer: Active replay vs read-only history
- vs Gas Town: Governance sequential vs parallel coordination

Files modified: 10 (7 content + 3 meta)
Words added: ~2,500
Version: 3.26.0 → 3.27.0

Co-Authored-By: Claude Sonnet 4.5 <noreply@anthropic.com>
2026-02-12 23:33:16 +01:00
Florian BRUNIAUX
971a297db3 feat(security): add threat intelligence DB, security commands, and cheatsheet audit fixes (v3.26.0)
- Add threat-db.yaml v2.0.0 with 63 malicious skills, 22 CVEs, 4 campaigns
- Add /security-check, /security-audit, /update-threat-db slash commands
- Add Snyk ToxicSkills evaluation (58th resource evaluation)
- Fix cheatsheet: add Alt+T to keyboard shortcuts table, add /fast and /debug commands
- Update Features Meconnues table with Agent Teams and Auto-Memories
- Clean up cheatsheet.md.bak
- Bump version to 3.26.0

Co-Authored-By: Claude Sonnet 4.5 <noreply@anthropic.com>
2026-02-11 16:12:36 +01:00
Florian BRUNIAUX
ef7cdd899e release: v3.24.0 - Agent Evaluation Framework
Major addition: Complete agent evaluation framework with production-ready template.

## Added

- **Resource Evaluation**: nao framework (score 3/5)
  - Identified critical gap: agent evaluation not documented
  - Technical challenge adjusted score 2/5 → 3/5
  - All claims fact-checked (TypeScript 58.9%, Python 38.5%)

- **Guide Section**: Agent Evaluation (guide/agent-evaluation.md, ~3K tokens)
  - Metrics: response quality, tool usage, performance, satisfaction
  - Patterns: logging hooks, unit tests, A/B testing, feedback loops
  - Example: analytics agent with built-in metrics
  - Tools: nao framework reference, Claude Code hooks integration

- **AI Ecosystem**: Section 8.2 Domain-Specific Agent Frameworks
  - nao (Analytics Agents): Database-agnostic, built-in evaluation
  - Transposable patterns: context builder, evaluation hooks, DB integrations

- **Template**: Analytics Agent with Evaluation (5 files, ~1K lines)
  - README: setup, usage, troubleshooting
  - Agent: SQL generator with evaluation criteria, safety rules
  - Hook: automated metrics logging (safety, performance, errors)
  - Script: analysis with stats, safety reports, recommendations
  - Report template: monthly evaluation format

## Changed

- Agent Evaluation Guide: updated template references, verified links
- Landing Site: templates count 110 → 114
- Version: 3.23.5 → 3.24.0

Co-Authored-By: Claude Sonnet 4.5 <noreply@anthropic.com>
2026-02-10 11:52:13 +01:00
Florian BRUNIAUX
1fb783ebb8 release: v3.23.5 - Aristote claude-mem analysis prompt
Added comprehensive 6-phase analysis framework for claude-mem integration
targeting the Aristote project (EdTech, Next.js/tRPC/Prisma).

Key features:
- Audit current memory stack (Serena, grepai, 9 hooks)
- Cost/ROI estimation ($3.75/month API, 50-100x ROI factor)
- Integration plan with hooks synergy (activity-logger, serena-sync)
- Go/No-Go decision criteria
- Self-contained prompt template in claudedocs/prompts/

Co-Authored-By: Claude Sonnet 4.5 <noreply@anthropic.com>
2026-02-10 11:03:52 +01:00
Florian BRUNIAUX
d5c3a82cac docs: add claude-mem plugin documentation (automatic session memory)
Integrate claude-mem (thedotmack/claude-mem) into the guide as Section 8.2.5.
Score: 4/5 (High Value - automatic session capture fills documentation gap).

Added:
- Section 8.2.5: claude-mem plugin (automatic session memory)
  * Automatic capture via lifecycle hooks
  * AI compression + progressive disclosure (10x tokens)
  * Web dashboard at localhost:37777
  * Natural language search
  * Privacy controls (<private> tags)
  * Cost analysis ($0.15/100 obs)
  * AGPL-3.0 licensing considerations

- Memory Tools Decision Matrix (claude-mem vs Serena vs grepai)
  * 4-layer memory stack pattern
  * Integrated workflow examples
  * When to use automatic vs manual memory

- Plugin template: examples/plugins/claude-mem.md
  * Installation, configuration, troubleshooting
  * Advanced features (progressive disclosure, endless mode)
  * Export/import, cost optimization

- Resource evaluation: docs/resource-evaluations/claude-mem-evaluation.md
  * Technical analysis (fact-checked stats)
  * Comparison to existing tools
  * Integration recommendations

- reference.yaml: 14 new claude-mem entries

Changed:
- Updated search tools comparison (5 tools: rg, grepai, Serena, ast-grep, claude-mem)
- Extended feature matrix with "Auto capture" and "Web dashboard" rows

Stats (verified 2026-02-10):
- 26.5k GitHub stars, 1.8k forks
- 181 releases, 46 contributors
- Latest: v9.1.1 (Feb 7, 2026)
- License: AGPL-3.0 + PolyForm Noncommercial

Co-Authored-By: Claude Sonnet 4.5 <noreply@anthropic.com>
2026-02-10 08:47:17 +01:00
Florian BRUNIAUX
9031c8fd05 sync: Claude Code v2.1.38 release
- Fixed VS Code terminal scroll-to-top regression
- Fixed Tab key queueing slash commands
- Fixed bash permission matching with env vars
- Security: Blocked .claude/skills writes in sandbox

Co-Authored-By: Claude Sonnet 4.5 <noreply@anthropic.com>
2026-02-10 08:29:07 +01:00
Florian BRUNIAUX
89084c89ec docs: integrate Anthropic 2026 Agentic Coding Trends Report
Integration strategy: diffusion transversale (~450 lines across 5 files)
instead of monolithic Section 9.21 (rejected after technical-writer review).

Evaluation: 4/5 score (high value, but lacks concrete code examples)
Source: https://resources.anthropic.com/hubfs/2026%20Agentic%20Coding%20Trends%20Report.pdf

Changes:
1. Created evaluation report (docs/resource-evaluations/)
   - Summary, gap analysis, challenge results, fact-check
   - Justification: validation industrie, benchmarks, anti-patterns

2. Modified guide/ultimate-guide.md (3 insertions, ~270 lines)
   - Section 9 intro: Industry context encadré with adoption data
   - Section 9.17 Multi-Instance: ROI benchmarks ($500-1K/month validation)
   - Section 9.11: Enterprise Anti-Patterns section (5 detailed patterns)

3. Modified guide/workflows/agent-teams.md (~80 lines)
   - Industry adoption data with case studies
   - Timeline: 3-6 months, success rates by phase
   - Real-world performance metrics (Fountain 50%, Rakuten 7h, TELUS 500K hours)

4. Modified machine-readable/reference.yaml (~40 lines)
   - Added agentic_trends_2026_* metadata section
   - Research data, case studies, benchmarks, anti-patterns references

5. Modified README.md (~8 lines)
   - Added "Research & Industry Reports" section
   - Link to Anthropic report with evaluation details

Stats validated: 60% AI usage, 0-20% full delegation, 67% more PRs/day,
27% new work, 7 case studies (Fountain, Rakuten, CRED, TELUS, Legora, Zapier, Augment).

Co-Authored-By: Claude Sonnet 4.5 <noreply@anthropic.com>
2026-02-09 17:18:52 +01:00
Florian BRUNIAUX
191ff42741 release: v3.23.4 - Agent Anti-Patterns & Scope-Focused Refactoring
Major conceptual refactoring based on Dex Horty's principle:
"Subagents are not for anthropomorphizing roles, they are for controlling context"

### Added (1 new section)
- Agent Anti-Patterns section (§9.17, line 3662)
  - Wrong vs Right table (anthropomorphizing vs context control)
  - When to use agents (context isolation, parallel processing, scope limitation)
  - When NOT to use agents (fake teams, roleplaying, mimicking org structure)

### Changed (18 files, 200+ lines)
- Section rename: "Split-Role Sub-Agents" → "Scope-Focused Agents"
- Agent definitions: "Specialized role" → "Context isolation tool"
- 8 custom agent examples refactored (guide + examples/agents/)
- 10+ prompt examples with explicit scope boundaries
- 4 workflow files updated (agent-teams, TDD, iterative refinement)
- Terminology replacements:
  * "Specialized agents" → "Scope-focused agents"
  * "Expert personas" → "Context boundaries"
  * "Multi-domain expertise" → "Multi-scope analysis"

### Fixed
- Methodologies: Clarification note for BMAD role-based naming

Breaking change: Conceptual shift from role-based to scope-based agent usage.
All examples now demonstrate context isolation instead of persona simulation.

Co-Authored-By: Claude Sonnet 4.5 <noreply@anthropic.com>
2026-02-09 10:29:59 +01:00
Florian BRUNIAUX
17846b1179 docs: complete Wasp fullstack essentials integration
Complete all 4 action items from wasp-fullstack-essentials-eval.md resource
evaluation (score 3/5). Framework-agnostic insights only, promotional content
excluded.

Changes (3 sections):

1. Background tasks workflow (Section 9.5)
   - New subsection: "Background Tasks for Fullstack Development"
   - When to background tasks (5 scenarios table)
   - Fullstack workflow pattern with examples
   - Context rot prevention strategies
   - Limitations and workarounds
   - Integration with teleportation
   - /tasks monitoring guide
   - ~100 lines added to Section 9.5 "Tight Feedback Loops"

2. Chrome DevTools MCP (mcp-servers-ecosystem.md)
   - New server entry in "Browser Automation" section
   - Official Anthropic server (not community)
   - Comparison table vs Playwright MCP (debugging vs testing)
   - Setup and configuration
   - Use cases and limitations
   - Updated stats: 3 browser servers (was 2), 6 official servers (was 5)
   - ~60 lines added to Browser Automation section

3. Convention-over-config for AI (Section 9.18.1)
   - New subsection: "Convention-Over-Configuration for AI Agents"
   - Why opinionated frameworks reduce agent cognitive load
   - Comparison table: custom vs opinionated architectures
   - Examples: Next.js, Rails, Phoenix, Django
   - Real-world impact on agent productivity
   - Trade-offs analysis
   - Connection to CLAUDE.md sizing (token reduction)
   - ~60 lines added to Section 9.18.1

Total additions: ~220 lines (workflow patterns + MCP server + AX framework)

Source evaluation: docs/resource-evaluations/wasp-fullstack-essentials-eval.md
Primary sources: llmstxt.org (llms.txt), official docs (background tasks, Chrome
DevTools MCP), existing Section 9.18 (Marmelab/AX framework)

Related commits:
- 783c43b: llms.txt conceptual documentation (completed earlier)

Co-Authored-By: Claude Sonnet 4.5 <noreply@anthropic.com>
2026-02-09 10:00:53 +01:00
Florian BRUNIAUX
783c43baed docs: add llms.txt conceptual documentation to Section 9.18
Add comprehensive llms.txt documentation based on Wasp fullstack essentials
resource evaluation (score 3/5). Sourced from llmstxt.org spec, not the
promotional article.

Changes:
- New section 9.18.4: Documentation Formats for Agents (llms.txt)
- Explains llms.txt standard, format, and use cases
- Clarifies complementarity with Context7 MCP (not opposition)
- Provides minimal and advanced examples with line numbers
- Integration patterns with CLAUDE.md
- References this repo's own llms.txt implementation
- Updated section numbering (9.18.4-9.18.11)
- Updated Section 9.18 TL;DR with new principle
- Added reference.yaml entries for llms.txt

Resource evaluation:
- File: docs/resource-evaluations/wasp-fullstack-essentials-eval.md
- Source: Wasp DevRel blog (framework-agnostic insights extracted)
- Score: 3/5 (partial integration, promotional content excluded)
- Gap identified: Embarrassing to have llms.txt file without explaining concept
- Primary source: llmstxt.org specification

Co-Authored-By: Claude Sonnet 4.5 <noreply@anthropic.com>
2026-02-09 09:46:50 +01:00
Florian BRUNIAUX
734a1cbef7 docs: add Zadig&Voltaire case study + awesome-claude-skills taxonomy
- Ecosystem: Zadig&Voltaire migration path (Cursor → Windsurf → Claude Code)
  with team productivity data (7 → 27 MRs/week, 4x acceleration)
  Source: Benjamin Calef, tech.zadig-et-voltaire.com (Feb 2026)
- Ecosystem: awesome-claude-skills (BehiSecc) - 62 skills across 12 categories
  Evaluation: 3/5, complementary to awesome-claude-code

Co-Authored-By: Claude Sonnet 4.5 <noreply@anthropic.com>
2026-02-08 15:34:46 +01:00
Florian BRUNIAUX
a68a1bd52b docs: add Agent Teams Quick Start Guide (v3.23.2)
Added practical 8-10 min guide for using agent teams in real projects.

Added:
- guide/workflows/agent-teams-quick-start.md (580 lines)
  - 5-minute setup walkthrough
  - 4 copy-paste patterns (Guide + RTK projects)
  - Decision matrix (10+ scenarios)
  - Success metrics framework
  - Red flags section

Updated:
- guide/workflows/agent-teams.md: Link to quick start
- guide/ultimate-guide.md: Section 9.20 with quick start link
- machine-readable/reference.yaml: agent_teams_quick_start entry
- CHANGELOG.md: Release v3.23.2
- VERSION: 3.23.1 → 3.23.2

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
2026-02-08 11:47:40 +01:00
Florian BRUNIAUX
36880caf29 docs: update Claude Code releases to v2.1.37
- Update latest tracked version: v2.1.33 → v2.1.37
- v2.1.36: Fast mode now available for Opus 4.6
- v2.1.34: Security fix for sandbox-excluded commands bypass
- v2.1.37: Fixed /fast availability after /extra-usage
- Fix reference.yaml line numbers for architecture.md

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
2026-02-08 11:25:05 +01:00
Florian BRUNIAUX
4c0e4b6ac6 docs: integrate Gur Sannikov ADR workflow + native capabilities audit (4/5)
- Add ADR-Driven Development pattern to methodologies.md (~60 lines)
  - Pattern: ADR → skill → native execution
  - Example ADR template (database migration)
  - Complete bash workflow with benefits
- Add Native Capabilities Audit checklist to architecture.md (~50 lines)
  - 11 native capabilities with internal links
  - Onboarding tip for comprehension audit
- Add Dynamic Model Switching pattern to cheatsheet.md (~40 lines)
  - Pattern: Sonnet → Opus → Sonnet
  - Cost comparison table and best practices
- Add Community Validation to architecture.md (~15 lines)
  - External validation of 'less scaffolding, more model' approach
  - Cursor power user adopting Agent Skills standard
- Track evaluation in docs/resource-evaluations/ (full methodology)
- Update evaluations count: 24 → 55 (README + reference.yaml)
- Update CHANGELOG.md with integration details

Source: https://www.linkedin.com/posts/gursannikov_claudecode-embeddedengineering-aiagents-activity-7423851983331328001-DrFb
Score: 4/5 (HIGH VALUE) - fills ADR workflow gap + onboarding checklist

Co-Authored-By: Claude Sonnet 4.5 <noreply@anthropic.com>
2026-02-07 16:12:53 +01:00
Florian BRUNIAUX
b48d95c024 feat: add agent/skill quality audit tooling + Grenier evaluation
AUDIT TOOLING (3 templates):
- Command: /audit-agents-skills (quick project audits)
  - 16-criteria framework (Identity 3x, Prompt 2x, Validation 1x, Design 2x)
  - Weighted scoring: 32 pts (agents/skills), 20 pts (commands)
  - Production grading (A-F, 80% threshold)
  - Fix mode with actionable suggestions
- Skill: audit-agents-skills (advanced audits)
  - 3 modes: Quick (top-5), Full (all 16), Comparative (vs templates)
  - JSON + Markdown output for CI/CD
- Scoring grids: criteria.yaml (externalized for reuse)

EVALUATION:
- Grenier agent/skill quality (3/5 - Moderate Value)
  - Gap: 29.5% deploy without evaluation (LangChang 2026)
  - Integration: Created audit command + skill + criteria
  - Industry context: 18% cite agent bugs as top challenge

DOCUMENTATION:
- Guide refs: 2 strategic call-outs (after Agent/Skill validation)
- CHANGELOG: New "Added" section + evaluation details
- README: Templates 106→107, Evaluations 49→24 (count corrections)
- reference.yaml: 10 new audit entries + updated counts

SYNC:
- Landing index.html: Templates 107, Evals 24, Quiz 257
- Landing examples/index.html: Templates 107

FILES: 14 changed, 4148 insertions (+1250 lines new audit content)

Co-Authored-By: Claude Sonnet 4.5 <noreply@anthropic.com>
2026-02-07 15:40:18 +01:00
Florian BRUNIAUX
99cca4f498 docs: update Claude Code releases to v2.1.33 + guide v3.23.1
Claude Code v2.1.33 (2026-02-06):
- Agent teams fixes (tmux sessions, availability warnings)
- TeammateIdle and TaskCompleted hook events for multi-agent workflows
- Agent frontmatter: memory field (user/project/local), Task(agent_type) syntax
- Plugin name in skill descriptions and /skills menu
- VSCode: Remote sessions, branch/message count in session picker
- Multiple stability fixes and improved error messages

Guide v3.23.1:
- Updated Claude Code releases tracking (YAML + Markdown)
- Version synced across all documentation files

Files updated:
- machine-readable/claude-code-releases.yaml (latest: 2.1.33)
- guide/claude-code-releases.md (v2.1.33 section + footer date)
- CHANGELOG.md ([3.23.1] entry)
- VERSION (3.23.0 → 3.23.1)
- Auto-synced: README.md, cheatsheet.md, ultimate-guide.md, reference.yaml

Co-Authored-By: Claude Sonnet 4.5 <noreply@anthropic.com>
2026-02-06 11:07:52 +01:00
Florian BRUNIAUX
c81180aec7 feat: adaptive onboarding architecture v2.0.0 (v3.23.0)
Major overhaul of onboarding system with adaptive topic selection based on
user context and keywords. Addresses 8 critical gaps identified by technical-
writer agent challenge.

Core Changes:
- Adaptive matrix: core topics (always) + adaptive topics (keyword-triggered)
- Security-first: moved sandbox_native_guide to beginner_5min (before commands)
- Time budget validation: all 18 profiles validated at 6-8 min/topic
- Quiz integration: positioned as exit activity in Phase 4 wrap-up
- New learn_security goal with 2 profiles (beginner_15min, advanced_60min)

Technical Improvements:
- Added onboarding_matrix_meta for version tracking and maintenance triggers
- Created validation script (validate-onboarding.sh) with 6 automated checks
- Created automation script (detect-new-onboarding-topics.sh) for monthly reviews
- Fixed 8 missing deep_dive keys (rules, workflow, fix, architecture, etc.)
- Removed duplicate deep_dive section causing validation failures

Documentation:
- README.md: version 3.23.0, harmonized counts (106 templates, 49 evaluations)
- CHANGELOG.md: comprehensive v3.23.0 entry with all changes
- Onboarding-prompt.md: updated Phase 1.5, 2, 4 with adaptive logic
- Reference.yaml: 180+ lines added for adaptive architecture

Validation:
- All 18 profiles pass time budget constraints (30-50% buffer maintained)
- All deep_dive keys verified (no missing references)
- Version synchronized across 6 files via sync-version.sh

Challenge: technical-writer agent identified 8 gaps in initial analysis
Result: Full adaptive approach implemented, all gaps addressed

Co-Authored-By: Claude Sonnet 4.5 <noreply@anthropic.com>
2026-02-05 22:19:58 +01:00
Florian BRUNIAUX
de4b438a72 docs: update Claude Code releases to v2.1.32 + guide v3.22.1
Claude Code v2.1.32 (2026-02-05):
-  Opus 4.6 now available
-  Agent teams research preview (experimental, token-intensive)
-  Automatic memory recording and recall
- "Summarize from here" for partial conversation summaries
- Skills auto-load from .claude/skills/ in --add-dir directories
- Fixes: @ file completion paths, bash heredoc JS template literals
- [VSCode] Slash commands execution fix, conversation loading spinner

Guide updates:
- Bumped guide version: 3.22.0 → 3.22.1 (minor)
- Updated CHANGELOG with v2.1.32 highlights
- Synced version across all documentation files
- Updated machine-readable tracking (YAML + Markdown)

Synchronization:
- Landing version badge updated: v2.1.31 → v2.1.32
- Landing footer version: v3.21.1 → v3.22.1
- Landing releases timeline updated with Opus 4.6 features
- All sync checks passing (templates, quiz, lines, stars)

Co-Authored-By: Claude Sonnet 4.5 <noreply@anthropic.com>
2026-02-05 21:04:55 +01:00
Florian BRUNIAUX
f830b1c836 chore: bump version to 3.22.0
Release dual-instance planning pattern (Jon Williams).

## Changes

- VERSION: 3.21.1 → 3.22.0
- CHANGELOG.md: Move [Unreleased] → [3.22.0] - 2026-02-04
  - Dual-instance planning pattern entry (~15 bullet points)
  - Documentation, Added section, full integration details
- Synced across: README.md, guide/cheatsheet.md, guide/ultimate-guide.md, reference.yaml

## Release Content

**Section 9.17.1**: Alternative Pattern: Dual-Instance Planning (~350 lines)
**Workflow**: guide/workflows/dual-instance-planning.md (~750 lines)
**Evaluation**: docs/resource-evaluations/jon-williams-dual-instance-pattern.md (~300 lines)
**References**: 15 new entries in reference.yaml

## Stats

- Total lines added: ~1,400
- Files modified: 10
- Evaluation score: 4/5 (High Value)
- Pattern: Vertical separation (planner ↔ implementer)
- Target audience: Solo devs, spec-heavy work, $100-200/month

Co-Authored-By: Claude Sonnet 4.5 <noreply@anthropic.com>
2026-02-04 10:41:55 +01:00
Florian BRUNIAUX
9c5d030b11 docs: add dual-instance planning pattern (Jon Williams)
Add vertical separation pattern (planner/implementer) as complement to horizontal scaling (Boris pattern).

## Changes

**Main guide (ultimate-guide.md)**:
- New Section 9.17.1: "Alternative Pattern: Dual-Instance Planning" (~350 lines)
  - When to use (solo devs, spec-heavy, $100-200/month)
  - Setup instructions (2 Claude instances, Plans/ directory)
  - Complete workflow (5 phases: planning, review, implementation, verification, archive)
  - Comparison table (Boris horizontal vs Jon vertical scaling)
  - Cost analysis (2 instances vs correction loops)
  - Agent-ready plan best practices
  - Limitations and tips

**Workflow file (workflows/dual-instance-planning.md)**:
- Full workflow guide (~750 lines)
- Complete example (JWT auth implementation)
- Plan template (ready to copy-paste)
- Cost breakdown and decision matrix
- Troubleshooting and bash aliases

**References updated**:
- machine-readable/reference.yaml: 15 new entries
  - dual_instance_planning, dual_instance_workflow, etc.
  - Line numbers, source attribution, metadata
- guide/workflows/plan-driven.md: Link in See Also section
- README.md: Update evaluation count (46 → 47)

**Evaluation documented**:
- docs/resource-evaluations/jon-williams-dual-instance-pattern.md
  - Full methodology (fetch, analyze, challenge, fact-check)
  - Score progression (2-3/5 → 4/5 after technical-writer challenge)
  - Gap analysis, comparison, integration rationale

## Source
LinkedIn post by Jon Williams (Product Designer, UK)
Date: 2026-02-03
URL: https://www.linkedin.com/posts/thatjonwilliams_ive-been-using-cursor-for-six-months-now-activity-7424481861802033153-k8bu

Context: Transition from Cursor (6 months) to Claude Code with Opus 4.5
Pattern: Vertical separation (Claude Zero: planning/review, Claude One: implementation)
Distinction: Orthogonal to Boris pattern (vertical vs horizontal scaling)

## Stats
- Lines added: ~1,400
- Files modified: 4
- Files created: 2 (workflow + evaluation)
- References added: 15 (reference.yaml)
- Evaluation score: 4/5 (High Value)
- Integration time: ~2.5 hours

Co-Authored-By: Claude Sonnet 4.5 <noreply@anthropic.com>
2026-02-04 10:38:10 +01:00
Florian BRUNIAUX
bc6b32ed3a docs: update Claude Code releases to v2.1.31
- Update latest tracked version: v2.1.29 → v2.1.31
- Add v2.1.31 highlights: Session resume hint, PDF/bash/LSP fixes, system improvements
- Add v2.1.30 highlights: PDF pages support (pages parameter), OAuth pre-config (Slack), /debug command, git flags, task metrics
- Update dates: 2026-02-02 (v2.1.30), 2026-02-03 (v2.1.31)

Co-Authored-By: Claude Sonnet 4.5 <noreply@anthropic.com>
2026-02-04 09:59:39 +01:00
Florian BRUNIAUX
b15647d57f docs: add Git MCP Server (Official) comprehensive documentation
Integration:
- New section "Version Control (Official Servers)" in mcp-servers-ecosystem.md (~1600 words)
- Decision matrix: Git MCP vs GitHub MCP vs Bash tool (11 operations)
- 12 tools documented with setup, config, use cases, limitations
- Resource evaluation file created (git-mcp-server-evaluation.md)
- Machine-readable index updated (11 new entries)
- Evaluation count corrected: 36 → 46 (actual file count)

Score: 5/5 (CRITICAL) after technical-writer challenge
Gap filled: Official Git server 0% documented → 100% comprehensive

Co-Authored-By: Claude Sonnet 4.5 <noreply@anthropic.com>
2026-02-03 17:20:12 +01:00
Florian BRUNIAUX
975b8019ac feat: add 4 ClaudeKit-inspired hooks (checkpoint, validation, file-guard)
- Add auto-checkpoint.sh (Stop event, git stash automation)
- Add typecheck-on-save.sh (PostToolUse, TypeScript validation)
- Add test-on-change.sh (PostToolUse, smart test detection)
- Add file-guard.sh (PreToolUse, unified file protection)
- Add ClaudeKit evaluation (3/5, patterns extracted)
- Version bump 3.21.0 → 3.21.1 (sync across all docs)

Co-Authored-By: Claude Sonnet 4.5 <noreply@anthropic.com>
2026-02-02 21:50:48 +01:00
Florian BRUNIAUX
6910c06981 docs: add Native Sandboxing comprehensive documentation (v3.21.1)
Integration of official Anthropic sandboxing docs (5/5 CRITICAL):

Created (5 files):
- guide/sandbox-native.md (~3K words): Complete technical reference
  * OS primitives (Seatbelt/bubblewrap), filesystem/network isolation
  * Sandbox modes, escape hatch, security limitations
  * Decision trees, config examples, troubleshooting
- docs/resource-evaluations/native-sandbox-official-docs.md (5/5 score)
- examples/config/sandbox-native.json (production config)
- examples/commands/sandbox-status.md (sandbox inspection)
- examples/hooks/bash/sandbox-validation.sh (prod validation)

Updated (5 files):
- guide/sandbox-isolation.md: Section 4 "Native Claude Code Sandbox"
  * Comparison Native vs Docker (process-level vs microVM)
  * Updated TL;DR, comparison matrix, decision tree
- guide/architecture.md: Native Sandbox sub-section in Security Model
- machine-readable/reference.yaml: +24 sandbox entries
- VERSION: 3.21.0 → 3.21.1
- README.md: Templates 100→103, Evaluations 44→45
- CHANGELOG.md: v3.21.1 entry

Closes critical security documentation gap (~1800 words missing).
Fact-checked 100%, agent-challenged (technical-writer), production-ready.

Co-Authored-By: Claude Sonnet 4.5 <noreply@anthropic.com>
2026-02-02 20:24:17 +01:00
Florian BRUNIAUX
18cc838dcd chore: bump version to 3.21.0 and update documentation
Version bump: 3.20.9 → 3.21.0

## Changes

- VERSION: Updated to 3.21.0
- CHANGELOG.md: Added [3.21.0] section with configuration management release notes
- README.md: Updated version footer and resource evaluations count (41 → 44)
- guide/cheatsheet.md: Updated version and date (January → February 2026)
- guide/ultimate-guide.md: Synced version references
- machine-readable/reference.yaml: Updated version and date

## Version Sync

All version references synchronized using ./scripts/sync-version.sh

Co-Authored-By: Claude Sonnet 4.5 <noreply@anthropic.com>
2026-02-02 18:25:09 +01:00
Florian BRUNIAUX
0630fcd883 feat: add configuration management and MCP secrets workflows (closes #16204)
Major additions to address critical gaps in Claude Code configuration:

## New Documentation Sections

1. Section 3.2.1 "Version Control & Backup" (guide/ultimate-guide.md:4085)
   - Configuration hierarchy: global → project → local
   - Git strategy for ~/.claude (symlinks approach)
   - Backup strategies: Git remote, cloud sync, cron
   - Multi-machine sync workflows
   - Disaster recovery procedures
   - Documented .claude/settings.local.json (previously undocumented)

2. Section 8.3.1 "MCP Secrets Management" (guide/ultimate-guide.md:8113)
   - Three practical approaches: OS Keychain, .env, Secret Vaults
   - Secrets rotation workflow
   - Pre-commit secret detection
   - Verification checklist
   - Best practices summary

## New Templates

1. sync-claude-config.sh (examples/scripts/)
   - Commands: setup, sync, backup, restore, validate
   - .env parsing + envsubst for variable substitution
   - Git repo creation with symlinks
   - Validation checks (secrets not in Git)

2. pre-commit-secrets.sh (examples/hooks/bash/)
   - Detects 10+ secret patterns (OpenAI, GitHub, AWS, etc.)
   - Whitelist system for false positives
   - Clear error messages with remediation steps

3. settings.local.json.example (examples/config/)
   - Machine-specific overrides template
   - Example use cases and patterns

## Resource Evaluation

- Added docs/resource-evaluations/ratinaud-config-management-evaluation.md
- Score: 5/5 (CRITICAL)
- Validated via 3 Perplexity searches + technical-writer agent challenge
- Community demand: GitHub #16204 + brianlovin/claude-config

## Updated References

- machine-readable/reference.yaml: 22 new entries
- Configuration management sections
- MCP secrets workflows
- Community resources (Ratinaud, brianlovin, GitHub issue)

## Impact

- Security: Pre-commit hook prevents secret leaks
- Productivity: Multi-machine sync reduces manual reconfig
- Team coordination: Onboarding workflow for ~/.claude setup
- Disaster recovery: Backup/restore strategies documented

Credits:
- Martin Ratinaud (504 sessions, LinkedIn post)
- brianlovin/claude-config (community example)
- GitHub Issue #16204 (community request)

Co-Authored-By: Claude Sonnet 4.5 <noreply@anthropic.com>
2026-02-02 18:17:42 +01:00
Florian BRUNIAUX
5b69db64a9 docs: add Alan Tour Eiffel paradigm evaluation (5/5 CRITICAL)
Integration of Alan Engineering team's paradigm shift framework:
- Tour Eiffel Principle (transformation vs acceleration)
- Ralph Wiggum Programming (agentic loops)
- Verification Paradox (automated guardrails over human review)

Files added:
- docs/resource-evaluations/alan-tour-eiffel-paradigm.md (291 lines)

Files modified:
- guide/production-safety.md: New Rule 7 "Verification Paradox"
- guide/ai-ecosystem.md: Added practitioner insight (line 2133)
- machine-readable/reference.yaml: Added Alan + verification paradox entries
- README.md: Fixed evaluation counters (37/35/38 → 41)

Source: https://www.linkedin.com/pulse/le-principe-de-la-tour-eiffel-et-ralph-wiggum-maxime-le-bras-psmxe/
Authors: Charles Gorintin (CTO Alan), Maxime Le Bras (Talent Lead)
Published: 2026-02-02

Co-Authored-By: Claude Sonnet 4.5 <noreply@anthropic.com>
2026-02-02 14:21:51 +01:00
Florian BRUNIAUX
4d71fdc31d docs: update Claude Code releases (v2.1.27 → v2.1.29)
Updates:
- Latest version: v2.1.29 (2026-01-31)
- Updated date: 2026-02-02
- Added v2.1.29: Performance fixes for session resumption
- Corrected v2.1.27 date: 2026-01-29 (was incorrectly 2026-01-31)
- Enhanced v2.1.27 highlights: Windows/VSCode fixes

Source: github.com/anthropics/claude-code/CHANGELOG.md

Co-Authored-By: Claude Sonnet 4.5 <noreply@anthropic.com>
2026-02-02 09:24:19 +01:00
Florian BRUNIAUX
0bb116cd2c docs: add Multi-IDE Configuration Sync pattern (v3.20.9)
Added comprehensive documentation for maintaining consistent AI instructions
across multiple coding tools (Claude Code, Cursor, Copilot, etc.).

Changes:
- guide/ai-ecosystem.md: New "Multi-IDE Configuration Sync" section (L1256-1329)
  - Problem statement with config file comparison table
  - Solution 1: Native @import for Claude Code (recommended)
  - Solution 2: Script-based generation for multi-IDE teams (bash example)
  - AGENTS.md support status clarification + workaround (symlink)
  - Compatibility matrix with external tool references
- machine-readable/reference.yaml: Added 2 index entries
  - ai_ecosystem_multi_ide_sync: guide/ai-ecosystem.md:1256
  - agents_md_support_status: guide/ai-ecosystem.md:1322
- CHANGELOG.md: Documented addition in [Unreleased] section
- VERSION: Bumped from 3.20.8 to 3.20.9 (patch)
- Auto-synced version across README, cheatsheet, ultimate-guide

Total: +91 insertions, -8 deletions (74 lines net content addition)

Co-Authored-By: Claude Sonnet 4.5 <noreply@anthropic.com>
2026-02-02 09:04:00 +01:00
Florian BRUNIAUX
d5375e32a5 docs: add 2 resource evaluations (Osmani LinkedIn + Beyond Vibe Coding)
Added:
- Resource Evaluation: Addy Osmani LinkedIn Post (scored 2/5, Marginal)
  - Post about Anthropic study (17% comprehension gap)
  - 100% overlap with Shen & Tamkin 2026 already documented
  - Decision: Tracking mention only (mainstream diffusion timeline)
  - New criterion: "Influencer Amplification" pattern documented

- Resource Evaluation: "Beyond Vibe Coding" Book (scored 3/5, Pertinent)
  - Comprehensive O'Reilly book by Addy Osmani
  - 90% overlap analysis (10/14 topics covered 100%)
  - Decision: Minimal integration (tracking mention + cross-refs)
  - Cross-validation with 2 Osmani articles already integrated

Updated:
- CHANGELOG.md: [Unreleased] section with detailed entries
- README.md: Resource evaluations count (36 → 38 assessments)

Files created:
- docs/resource-evaluations/addy-osmani-linkedin-anthropic-study.md
- docs/resource-evaluations/beyond-vibe-coding.md
- docs/resource-evaluations/nick-tune-feedback-loops.md

Co-Authored-By: Claude Sonnet 4.5 <noreply@anthropic.com>
2026-02-01 23:30:03 +01:00
Florian BRUNIAUX
fdee3305c5 docs: RTK documentation update - upstream + fork integration
- Update guide/ultimate-guide.md: RTK section (l.11084-11174)
  - Two repositories: upstream (stable) + fork (extended features)
  - Fork features: vitest, pnpm, prisma, gain, discover
  - Bug fixes documented (grep/ls fixed in fork)
  - Installation options: cargo, fork, binary

- Add guide/third-party-tools.md: RTK card (l.86)
  - Comparison upstream vs fork
  - Token savings: 70-90% depending on stack
  - Cross-reference to ultimate-guide Section 9

- Update machine-readable/reference.yaml:
  - rtk_upstream + rtk_fork_extended (two repos)
  - third_party_tools_rtk entry added
  - Line numbers updated

- Update docs/resource-evaluations/rtk-evaluation.md:
  - UPDATE 2026-02-01 section with fork comparison
  - Fork features table (JS/TS stack support)
  - Installation instructions for fork

Total: 4 files, ~320 lines modified

Co-Authored-By: Claude Sonnet 4.5 <noreply@anthropic.com>
2026-02-01 22:20:43 +01:00
Florian BRUNIAUX
9f9f735436 release: v3.20.7 - Addy Osmani spec-writing integration
Version Bump:
- VERSION: 3.20.6 → 3.20.7
- CHANGELOG.md: New [3.20.7] section with Osmani integration details
- Synced across: README.md, cheatsheet.md, ultimate-guide.md, reference.yaml

Release Highlights:
- Addy Osmani spec-writing evaluation (4/5, High Value)
- 4 new sections in spec-first.md (Modular Design, Operational Boundaries, Command Templates, Anti-Monolithic)
- 180 lines added to spec-first.md (327 → 507)
- 8 new reference.yaml entries
- 1 new resource evaluation (36 total)

Integration Details (see CHANGELOG.md):
- Modular spec design pattern (split >200 line CLAUDE.md files)
- Always/Ask First/Never operational boundaries framework
- Command spec templates with error handling
- Anti-pattern: monolithic CLAUDE.md context pollution

Landing Sync: Ready for deployment (version updated in separate commit)

Co-Authored-By: Claude Sonnet 4.5 <noreply@anthropic.com>
2026-02-01 21:48:29 +01:00
Florian BRUNIAUX
a5942f1c53 docs: add Addy Osmani spec-writing evaluation (4/5) + spec-first.md sections
Integration of "How to write a good spec for AI agents" by Addy Osmani:

Evaluation (docs/resource-evaluations/addy-osmani-good-spec.md):
- Score: 4/5 (High Value - Integrate within 1 week)
- Fills gaps: modular design, operational boundaries, command specs
- Fact-checked: credentials verified via Perplexity, all claims sourced
- Challenge phase: technical-writer agent corrected initial 3/5 → 4/5

Spec-First Workflow Updates (guide/workflows/spec-first.md):
- NEW: "Modular Spec Design" section (~50 lines, line 322)
  Pattern: Split large specs into focused files (CLAUDE-[domain].md)
- NEW: "Operational Boundaries" section (~60 lines, line 372)
  Three-tier system: Always/Ask First/Never → maps to Claude Code modes
- NEW: "Command Spec Template" section (~40 lines, line 432)
  Executable command specs with expected outputs & error handling
- NEW: "Anti-Pattern: Monolithic CLAUDE.md" section (~30 lines, line 472)
  Explains cognitive load problem (>200 lines = context pollution)

Reference Index (machine-readable/reference.yaml):
- 8 new entries: spec_first_workflow → spec_osmani_score
- Links to new spec-first.md sections with line numbers
- Source attribution: https://addyosmani.com/blog/good-spec/

Public Facing (README.md):
- Incremented resource evaluations count: 35 → 36

File growth: spec-first.md 327 → 507 lines (+180)
Source: Addy Osmani (former Chrome team, 14y), published Jan 13, 2026

Co-Authored-By: Claude Sonnet 4.5 <noreply@anthropic.com>
2026-02-01 21:30:34 +01:00
Florian BRUNIAUX
bc86c8ed7f release: v3.20.6 - agentskills.io integration + 4 resource evaluations
- agentskills.io open standard: frontmatter table, skills-ref CLI, portability section
- Agent Skills supply chain risks (security-hardening.md §1.2)
- anthropics/skills (60K+★) added to complementary resources
- 16 new reference.yaml entries
- Resource evaluations: agentskills.io (4/5), Skill Doctor (2/5), dclaude (new), paddo (new)
- Sandbox isolation + README updates

Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>
2026-02-01 16:49:33 +01:00
Florian BRUNIAUX
59ba1ae174 release: v3.20.5 - 4 new ASCII diagrams (visual-reference.md)
- Add visual-reference.md with 20 ASCII diagrams (16 existing + 4 new)
  - #17 TDD Red-Green-Refactor Cycle
  - #18 UVAL Protocol Flow
  - #19 Security 3-Layer Defense
  - #20 Secret Exposure Timeline
- Add Visual Reference to README Core Documentation table
- Version sync: 3.20.4 → 3.20.5

Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>
2026-01-31 23:14:41 +01:00
Florian BRUNIAUX
a7962d5560 release: v3.20.4 - 30 new quiz questions (227 → 257)
Add 30 questions across 11 categories covering recent guide content:
Advanced Patterns (+8), MCP Servers (+3), Architecture (+3),
Reference (+3), Hooks (+2), Learning (+2), Security (+2),
AI Ecosystem (+3), Memory (+2), Agents (+1), Privacy (+1).

Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>
2026-01-31 22:17:32 +01:00
Florian BRUNIAUX
2fcc65e222 release: v3.20.3 - Competitive Analysis Gaps (claudelog.com)
9 gaps filled from claudelog.com competitive analysis (313 pages):
- Section 9.19: Permutation Frameworks (CLAUDE.md-driven variation testing)
- Skeleton Projects workflow (guide/workflows/skeleton-projects.md)
- Task Lists as Diagnostic Tool (instruction clarity mirror)
- Rev the Engine + Mechanic Stacking (multi-round planning patterns)
- Split-Role Sub-Agents (multi-perspective analysis)
- "You Are the Main Thread" mental model (CPU scheduler analogy)
- Continuous Context Update (proactive CLAUDE.md enrichment)
- Smart Hook Dispatching (single entry point routing to handlers)

Guide: 15,771 → 16,293 lines (+522)
New workflow: skeleton-projects.md (208 lines)
Reference: +17 new entries in reference.yaml

Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>
2026-01-31 21:55:12 +01:00
Florian BRUNIAUX
950370e81b release: v3.20.2 - Sandbox Isolation for Coding Agents
New guide file covering Docker Sandboxes (microVM isolation),
cloud alternatives (Fly.io Sprites, E2B, Vercel, Cloudflare),
safe autonomy workflows, and comparison matrix.

Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>
2026-01-31 19:08:25 +01:00
Florian BRUNIAUX
3ab6fe3ef5 docs: track Claude Code v2.1.27 release
--from-pr flag for PR-linked sessions, auto PR linking via gh pr create,
Bedrock/Vertex gateway fixes.

Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>
2026-01-31 16:19:51 +01:00
Florian BRUNIAUX
22f2b91b83 docs: integrate Contribution Metrics blog (4/5) - Anthropic Jan 2026 data
New subsection in ultimate-guide.md with +67% PRs merged and 70-90%
AI-assisted code metrics. Separate from Aug 2025 study (different
methodology: PR-based vs self-reported). ROI cross-reference added.

Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>
2026-01-30 23:34:15 +01:00
Florian BRUNIAUX
26ee4ef894 release: v3.20.1 - Vercel AGENTS.md vs Skills evaluation
- New resource evaluation (025): Vercel blog on eager context vs lazy
  skill invocation (Gao, Jan 2026). Score 3/5, 13/13 fact-checked.
- Guide: added 8KB compression benchmark to CLAUDE.md sizing (line 3527)
- Guide: added 56% skill invocation warning to Memory Loading (line 4082)
- Guide: added invocation reliability caveat to skills.sh trade-offs
- Version sync 3.20.0 → 3.20.1

Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>
2026-01-30 21:45:14 +01:00
Florian BRUNIAUX
fd4550cbd3 release: v3.20.0 - Multi-Agent Code Review Automation
Integration of production-grade PR review patterns from Pat Cullen + Méthode Aristote.

New Features:
- Resource evaluation: Pat Cullen Final Review (5/5 - Critical)
- Enhanced /review-pr: +150 lines with Advanced Multi-Agent Review section
- Enhanced code-reviewer agent: +219 lines with anti-hallucination rules
- New workflow: Review Auto-Correction Loop in iterative-refinement.md
- Production example: Multi-Agent Code Review in ultimate-guide.md
- Reference updates: +3 entries (review_pr_advanced, review_anti_hallucination, review_auto_fix_loop)

Key Patterns:
- 3 specialized agents: Consistency, SOLID, Defensive Code Auditor
- Pre-flight check: git log Co-Authored-By detection
- Anti-hallucination: Grep/Glob verification before suggestions
- Severity classification: 🔴 Must Fix / 🟡 Should Fix / 🟢 Can Skip
- Convergence loop: review → fix → re-review (max 3 iterations)
- Conditional context loading: stack-agnostic decision table

Design Principles:
- Enrich existing files (no fragmentation)
- No breaking changes (review-pr.md template simple preserved)
- Complete attribution (Pat Cullen + Méthode Aristote with links)
- Audience-aware (beginner → advanced progression)

Files Modified:
- CHANGELOG.md, VERSION: bumped to 3.20.0
- docs/resource-evaluations/017-pat-cullen-final-review.md: NEW (120 lines)
- examples/commands/review-pr.md: 80 → 230 lines (+150)
- examples/agents/code-reviewer.md: 72 → 291 lines (+219)
- guide/workflows/iterative-refinement.md: 389 → 522 lines (+133)
- guide/ultimate-guide.md: +28 lines (Production Example section)
- machine-readable/reference.yaml: +3 entries
- README.md, guide/cheatsheet.md: version sync

Total: +537 insertions, 0 deletions (no breaking changes)

Co-Authored-By: Claude Sonnet 4.5 <noreply@anthropic.com>
2026-01-30 16:07:09 +01:00
Florian BRUNIAUX
8b58f014e7 docs: add Addy Osmani 80% problem to Practitioner Insights
Add Addy Osmani (Google Chrome Team) article "The 80% Problem in
Agentic Coding" to AI Ecosystem Practitioner Insights section.

Changes:
- guide/ai-ecosystem.md: Add 32-line entry after Steinberger (~line 2024)
  * "80% problem" framework and comprehension debt concept
  * Three new failure modes (overengineering, assumption propagation, sycophantic)
  * Productivity paradox data (+98% PRs, +91% review time)
  * Alignment table mapping to existing guide sections
  * Transparent note: "secondary synthesis, primary sources documented"

- machine-readable/reference.yaml: Add 4 new references
  * practitioner_addy_osmani, practitioner_osmani_source
  * eighty_percent_problem, comprehension_debt_secondary

- docs/resource-evaluations/024-addy-osmani-80-percent-problem.md: Complete evaluation
  * Score: 3/5 (Pertinent) - downgraded from initial 4/5 after technical-writer challenge
  * Minimal integration (32 lines vs rejected 250 lines)
  * Fact-check: 6 stats verified, 1 Stack Overflow stat incorrect
  * Rationale: 90% overlap with existing content (Vibe Coding Trap, Trust Calibration)

- CHANGELOG.md: Document addition in v3.19.0

Decision: Minimal integration approach chosen to avoid duplication while
recognizing value of synthesis from respected author. Article aggregates
existing research already cited in guide with primary sources.

Co-Authored-By: Claude Sonnet 4.5 <noreply@anthropic.com>
2026-01-30 12:32:38 +01:00
Florian BRUNIAUX
7df11b224f release: v3.18.2 - Steinberger Practitioner Insight
Add Peter Steinberger (PSPDFKit Founder, Moltbot Creator) to Practitioner
Insights with model-agnostic workflow patterns.

Changes:
- Add Steinberger entry in guide/ai-ecosystem.md (stream monitoring,
  multi-project juggling, fresh context validation, iterative exploration)
- Complete evaluation in docs/resource-evaluations/steinberger-inference-speed.md
  (score 3/5, fact-checked GPT-5.2, validated credentials)
- Update docs/resource-evaluations/README.md (15→16 evaluations)
- Add practitioner_steinberger references in machine-readable/reference.yaml
- Version bump 3.18.1 → 3.18.2 (VERSION + sync all docs)
- Update CHANGELOG.md with detailed v3.18.2 entry
- Update README.md evaluations count (22→25)

Scope: Model-agnostic patterns only, zero model comparisons.
Source: https://steipete.me/posts/2025/shipping-at-inference-speed

Co-Authored-By: Claude Sonnet 4.5 <noreply@anthropic.com>
2026-01-30 09:49:55 +01:00
Florian BRUNIAUX
c844f74c8f docs: update Claude Code releases to v2.1.25
- Add v2.1.25: Fixed beta header validation for Bedrock/Vertex
- Add v2.1.23: Customizable spinner verbs, mTLS/proxy fixes, terminal performance

Co-Authored-By: Claude Sonnet 4.5 <noreply@anthropic.com>
2026-01-30 08:48:25 +01:00
Florian BRUNIAUX
97c4c7e5ae release: v3.18.0 - Known Issues Tracker & RTK Integration
MAJOR ADDITIONS:

1. Known Issues Tracker (guide/known-issues.md, 285 lines)
   - GitHub issue auto-creation bug (Issue #13797, v2.0.65+, ACTIVE)
   - Excessive token consumption (Issue #16856, v2.1.1+, Jan 2026)
   - Model quality degradation (Aug-Sep 2025, RESOLVED)
   - Verified: 17+ accidental public disclosures, 20+ token reports
   - Fact-checked: Perplexity Pro + GitHub API direct queries
   - Impact: Critical security awareness, actionable workarounds

2. RTK (Rust Token Killer) Integration
   - 72.6% average token reduction for git operations
   - Benchmarks: git log (92.3%), git status (76.0%), find (76.3%)
   - Templates: CLAUDE.md, skill, PreToolUse hook
   - Limitations documented: grep broken, ls worse, early-stage

FIXES:
- Corrected "mgrep" misattribution in Everything Claude Code evaluation

VERSION UPDATES:
- VERSION: 3.17.1 → 3.18.0
- Synced across README.md, cheatsheet.md, ultimate-guide.md, reference.yaml

FILES MODIFIED: 6 (CHANGELOG, VERSION, README, cheatsheet, guide, reference)

Co-Authored-By: Claude Sonnet 4.5 <noreply@anthropic.com>
2026-01-28 18:20:05 +01:00
Florian BRUNIAUX
940caf3f1e docs: add verified critical bugs tracker (known-issues.md)
NEW: guide/known-issues.md (285 lines)
- GitHub issue auto-creation bug (Issue #13797, v2.0.65+, ACTIVE)
  * 17+ confirmed accidental public disclosures
  * Security/privacy risk documented
  * Workarounds: explicit repo, manual approval, pre-execution verification
- Excessive token consumption (Issue #16856, v2.1.1+, Jan 2026)
  * 20+ reports of 4x+ faster consumption
  * Anthropic: "Not officially confirmed as bug" (investigating)
  * Workarounds: /context monitoring, shorter sessions, disable auto-compact
- Model quality degradation (Aug-Sep 2025, RESOLVED)
  * Anthropic official postmortem: 3 infrastructure bugs
  * Community theories (quantization) debunked

FACT-CHECKED: Perplexity Pro + GitHub API direct queries
- Verified: 5,702 open issues (not 4,697), 527 invalid labels
- Corrected: v2.1.1 token bug (not non-existent v2.0.61)
- Sources: GitHub Issues, Anthropic postmortem, The Register

UPDATED:
- guide/README.md: Added known-issues.md to docs table
- machine-readable/reference.yaml: 4 new entries for issue tracking
- CHANGELOG.md: Documented integration process

NEW: docs/resource-evaluations/023-community-discussions-report-jan2026.md
- Full evaluation process documented
- Fact-check methodology: Perplexity + GitHub API
- Score: 2/5 (Marginal - partial integration only)
- Lesson: Always verify community reports with primary sources

Impact: Critical security awareness for users, actionable workarounds

Co-Authored-By: Claude Sonnet 4.5 <noreply@anthropic.com>
2026-01-28 17:59:16 +01:00
Florian BRUNIAUX
a75c66a890 docs: update Claude Code releases to v2.1.22
Added tracking for latest releases:
- v2.1.22: Task UI improvements, Vim fixes, LSP enhancements
- v2.1.21: Version requirements, TaskCreate enhancements, auto-update checks

Updated in both YAML (machine-readable) and Markdown (human-readable) formats.

Co-Authored-By: Claude Sonnet 4.5 <noreply@anthropic.com>
2026-01-28 15:00:18 +01:00
Florian BRUNIAUX
4dc380efd2 docs: integrate RTK (Rust Token Killer) for command output optimization
- Added Section 9.13 'Command Output Optimization with RTK' (guide/ultimate-guide.md:10478)
- Benchmarks: 72.6% avg token reduction (git log 92.3%, git status 76.0%, find 76.3%)
- Integration templates: CLAUDE.md, Skill, PreToolUse hook
- Evaluation: docs/resource-evaluations/rtk-evaluation.md (score 4/5)
- Limitations: grep broken (v0.2.0), ls worse (-274%), low adoption (8 stars)
- Updated CHANGELOG.md and machine-readable/reference.yaml
- Source: https://github.com/pszymkowiak/rtk v0.2.0
2026-01-28 12:20:48 +01:00
Florian BRUNIAUX
34b7376408 fix: correct mgrep misattribution in Everything Claude Code evaluation
Issue:
- Incorrectly claimed Everything Claude Code contained "mgrep (50% token reduction)" tool
- No such tool exists in affaan-m/everything-claude-code (verified via WebFetch + repo search)
- Confused mgrep (mixedbread-ai semantic search) with non-existent token reduction tool

Files corrected:
- docs/resource-evaluations/015-everything-claude-code-github-repo.md (14 occurrences removed)
- machine-readable/reference.yaml:724 (unique patterns list updated)
- guide/ultimate-guide.md:14821 (replaced with verified patterns)
- CHANGELOG.md (v3.17.0 and v3.15.0 entries updated)

Verified patterns now documented:
- hookify (conversational hooks)
- pass@k metrics (formal verification)
- sandboxed subagents (tool restrictions)
- strategic compaction skills (context management)

Impact: Maintains guide accuracy, prevents user confusion

Co-Authored-By: Claude Sonnet 4.5 <noreply@anthropic.com>
2026-01-28 09:50:07 +01:00