claude-code-ultimate-guide/docs/resource-evaluations
Florian BRUNIAUX a5942f1c53 docs: add Addy Osmani spec-writing evaluation (4/5) + spec-first.md sections
Integration of "How to write a good spec for AI agents" by Addy Osmani:

Evaluation (docs/resource-evaluations/addy-osmani-good-spec.md):
- Score: 4/5 (High Value - Integrate within 1 week)
- Fills gaps: modular design, operational boundaries, command specs
- Fact-checked: credentials verified via Perplexity, all claims sourced
- Challenge phase: technical-writer agent corrected initial 3/5 → 4/5

Spec-First Workflow Updates (guide/workflows/spec-first.md):
- NEW: "Modular Spec Design" section (~50 lines, line 322)
  Pattern: Split large specs into focused files (CLAUDE-[domain].md)
- NEW: "Operational Boundaries" section (~60 lines, line 372)
  Three-tier system: Always/Ask First/Never → maps to Claude Code modes
- NEW: "Command Spec Template" section (~40 lines, line 432)
  Executable command specs with expected outputs & error handling
- NEW: "Anti-Pattern: Monolithic CLAUDE.md" section (~30 lines, line 472)
  Explains cognitive load problem (>200 lines = context pollution)

Reference Index (machine-readable/reference.yaml):
- 8 new entries: spec_first_workflow → spec_osmani_score
- Links to new spec-first.md sections with line numbers
- Source attribution: https://addyosmani.com/blog/good-spec/

Public Facing (README.md):
- Incremented resource evaluations count: 35 → 36

File growth: spec-first.md 327 → 507 lines (+180)
Source: Addy Osmani (former Chrome team, 14y), published Jan 13, 2026

Co-Authored-By: Claude Sonnet 4.5 <noreply@anthropic.com>
2026-02-01 21:30:34 +01:00
..
015-everything-claude-code-github-repo.md fix: correct mgrep misattribution in Everything Claude Code evaluation 2026-01-28 09:50:07 +01:00
016-gang-rui-tasks-api-limitations.md docs: document Tasks API field visibility limitations (Gang Rui analysis) 2026-01-27 16:16:49 +01:00
017-pat-cullen-final-review.md release: v3.20.0 - Multi-Agent Code Review Automation 2026-01-30 16:07:09 +01:00
023-community-discussions-report-jan2026.md docs: add verified critical bugs tracker (known-issues.md) 2026-01-28 17:59:16 +01:00
024-addy-osmani-80-percent-problem.md docs: add Addy Osmani 80% problem to Practitioner Insights 2026-01-30 12:32:38 +01:00
025-vercel-agents-md-vs-skills-eval.md release: v3.20.1 - Vercel AGENTS.md vs Skills evaluation 2026-01-30 21:45:14 +01:00
026-contribution-metrics-blog.md docs: integrate Contribution Metrics blog (4/5) - Anthropic Jan 2026 data 2026-01-30 23:34:15 +01:00
2026-01-27-claude-code-hidden-feature-social-post.md docs: add Myths vs Reality appendix + TeammateTool documentation 2026-01-27 09:45:06 +01:00
addy-osmani-good-spec.md docs: add Addy Osmani spec-writing evaluation (4/5) + spec-first.md sections 2026-02-01 21:30:34 +01:00
agentskills-io-specification.md release: v3.20.6 - agentskills.io integration + 4 resource evaluations 2026-02-01 16:49:33 +01:00
anaconda-croce-evaluation.md docs: add Anaconda Croce evaluation (minimal integration) 2026-01-26 16:53:48 +01:00
anthropic-releases-jan16-23-2026.md docs: add resource-evaluations to tracked docs 2026-01-26 14:02:05 +01:00
astgrep-flavien-metivier.md docs: add resource-evaluations to tracked docs 2026-01-26 14:02:05 +01:00
boris-cowork-video-eval.md docs: add resource-evaluations to tracked docs 2026-01-26 14:02:05 +01:00
clawdbot-twitter-analysis.md docs: add resource-evaluations to tracked docs 2026-01-26 14:02:05 +01:00
dclaude-docker-wrapper.md release: v3.20.6 - agentskills.io integration + 4 resource evaluations 2026-02-01 16:49:33 +01:00
docker-sandboxes-isolation.md release: v3.20.2 - Sandbox Isolation for Coding Agents 2026-01-31 19:08:25 +01:00
eval-claude-code-releases-jan2026.md docs: update $ARGUMENTS syntax for v2.1.19 breaking change + evaluation 2026-01-26 17:37:49 +01:00
gsd-evaluation.md docs: add resource-evaluations to tracked docs 2026-01-26 14:02:05 +01:00
mcp-apps-announcement.md docs: add MCP Apps (SEP-1865) documentation 2026-01-27 08:14:49 +01:00
melvyn-malherbe-async-hooks-linkedin.md release: v3.19.0 - Hook Execution Model documentation 2026-01-30 12:37:23 +01:00
mikul-gohil-hooks-evaluation.md docs: add missing hook events from official CHANGELOG (v2.1.9-v2.1.10) 2026-01-27 12:45:47 +01:00
nick-jensen-plugins.md docs: add resource-evaluations to tracked docs 2026-01-26 14:02:05 +01:00
paddo-team-tips-eval.md release: v3.20.6 - agentskills.io integration + 4 resource evaluations 2026-02-01 16:49:33 +01:00
prompt-repetition-paper.md docs: add resource-evaluations to tracked docs 2026-01-26 14:02:05 +01:00
README.md release: v3.20.6 - agentskills.io integration + 4 resource evaluations 2026-02-01 16:49:33 +01:00
remotion-claude-code-video.md docs: add resource-evaluations to tracked docs 2026-01-26 14:02:05 +01:00
rtk-evaluation.md docs: enrich RTK evaluation with T3 Stack production testing 2026-01-28 14:01:37 +01:00
se-cove-plugin.md docs: add resource-evaluations to tracked docs 2026-01-26 14:02:05 +01:00
self-improve-skill.md docs: add resource-evaluations to tracked docs 2026-01-26 14:02:05 +01:00
skill-doctor-github-action.md release: v3.20.6 - agentskills.io integration + 4 resource evaluations 2026-02-01 16:49:33 +01:00
steinberger-inference-speed.md release: v3.18.2 - Steinberger Practitioner Insight 2026-01-30 09:49:55 +01:00
system-prompts-official-vs-community.md docs: add system prompts official sources (Anthropic + community analyses) 2026-01-26 17:03:42 +01:00
uml-oop-diagrams.md docs: add resource-evaluations to tracked docs 2026-01-26 14:02:05 +01:00
vibe-coding-rusitschka.md docs: add resource-evaluations to tracked docs 2026-01-26 14:02:05 +01:00
wooldridge-productivity-stack.md docs: add resource-evaluations to tracked docs 2026-01-26 14:02:05 +01:00
worktrunk-evaluation.md docs: add resource-evaluations to tracked docs 2026-01-26 14:02:05 +01:00

Resource Evaluations

Ce dossier contient les évaluations de ressources externes (articles, vidéos, discussions) pour déterminer leur pertinence pour le Claude Code Ultimate Guide.

Méthodologie

Chaque ressource est évaluée selon un système de scoring standardisé et challengée par un agent technique pour garantir l'objectivité.

Grille de score (sur 5)

Score Signification Action
5 Critical - Breakthrough, must integrate immediately Intégrer sous 24h
4 High Value - New capability or major improvement Intégrer sous 1 semaine
3 Moderate - Useful addition but not urgent Intégrer si temps disponible
2 Marginal - Secondary info or niche use case Ne pas intégrer (ou mention minimale)
1 Low - Redundant, incorrect, or off-topic Rejeter

Process

  1. Analyse initiale: Extraction des faits, vérification des sources
  2. Scoring: Attribution d'un score avec justification
  3. Challenge: Agent technical-writer remet en question le score
  4. Décision finale: Intégration ou rejet avec traçabilité

Nomenclature des fichiers

Format: [topic-slug].md (date supprimée pour stabilité des liens)

Exemple: remotion-claude-code-video.md

Working Documents

Les documents de travail bruts (prompts Perplexity, audits clients) restent dans claudedocs/resource-evaluations/ (gitignored).

Index des Évaluations

Ressource Score Initial Score Final Décision Fichier
Anthropic Releases (Jan 16-23, 2026) - - Suivi régulier anthropic-releases-jan16-23-2026.md
AST-grep (Flavien Métivier) 3/5 4/5 Intégrer workflow astgrep-flavien-metivier.md
MCP Apps (SEP-1865) 3/5 4/5 Intégré (architecture + guide) mcp-apps-announcement.md
Boris Cherny (Cowork Video) 4/5 4/5 Intégré (mental models) boris-cowork-video-eval.md
Clawdbot (Twitter Analysis) 2/5 2/5 ⚠️ Watch only clawdbot-twitter-analysis.md
GSD (Getting Shit Done) 4/5 4/5 Intégré (workflow) gsd-evaluation.md
Nick Jensen Plugins 3/5 3/5 Mention nick-jensen-plugins.md
Prompt Repetition Paper 3/5 4/5 Intégrer best practices prompt-repetition-paper.md
Remotion + Claude Code (Video Production) 2/5 3/5 Mention minimale remotion-claude-code-video.md
SE-Cove Plugin 2/5 2/5 ⚠️ Watch only se-cove-plugin.md
Self-Improve Skill 3/5 3/5 Template ajouté self-improve-skill.md
Steinberger (Inference Speed) 3/5 3/5 Intégré (minimal) steinberger-inference-speed.md
UML & OOP Diagrams 3/5 3/5 Mention uml-oop-diagrams.md
Vibe Coding Level 2 (Rusitschka) 4/5 4/5 Intégré (workflows) vibe-coding-rusitschka.md
Peter Wooldridge (Productivity Stack) 2/5 3/5 Practitioner Insights wooldridge-productivity-stack.md
System Prompts (Official vs Community) 4/5 2/5 ⚠️ Watch only (official sources exist) system-prompts-official-vs-community.md
Worktrunk 4/5 4/5 Intégré (workflow) worktrunk-evaluation.md
Pat Cullen (Multi-Agent PR Review) 5/5 5/5 Intégré (review-pr, code-reviewer, guide) 017-pat-cullen-final-review.md
Docker Sandboxes (Isolation Landscape) 4/5 4/5 Intégré (guide + notice) docker-sandboxes-isolation.md
dclaude (Dockerized Claude Code) 2/5 2/5 ⚠️ Footnote (sandbox-isolation.md) dclaude-docker-wrapper.md
10 Tips from Inside the Claude Code Team (paddo.dev) 4/5 4/5 Intégré (4 sections) paddo-team-tips-eval.md

Dernier update: 2026-02-01 (20 évaluations)