marketing-shibata50/claude-code-ultimate-guide

Florian BRUNIAUX b48d95c024 feat: add agent/skill quality audit tooling + Grenier evaluation

AUDIT TOOLING (3 templates):
- Command: /audit-agents-skills (quick project audits)
  - 16-criteria framework (Identity 3x, Prompt 2x, Validation 1x, Design 2x)
  - Weighted scoring: 32 pts (agents/skills), 20 pts (commands)
  - Production grading (A-F, 80% threshold)
  - Fix mode with actionable suggestions
- Skill: audit-agents-skills (advanced audits)
  - 3 modes: Quick (top-5), Full (all 16), Comparative (vs templates)
  - JSON + Markdown output for CI/CD
- Scoring grids: criteria.yaml (externalized for reuse)

EVALUATION:
- Grenier agent/skill quality (3/5 - Moderate Value)
  - Gap: 29.5% deploy without evaluation (LangChang 2026)
  - Integration: Created audit command + skill + criteria
  - Industry context: 18% cite agent bugs as top challenge

DOCUMENTATION:
- Guide refs: 2 strategic call-outs (after Agent/Skill validation)
- CHANGELOG: New "Added" section + evaluation details
- README: Templates 106→107, Evaluations 49→24 (count corrections)
- reference.yaml: 10 new audit entries + updated counts

SYNC:
- Landing index.html: Templates 107, Evals 24, Quiz 257
- Landing examples/index.html: Templates 107

FILES: 14 changed, 4148 insertions (+1250 lines new audit content)

Co-Authored-By: Claude Sonnet 4.5 <noreply@anthropic.com>

2026-02-07 15:40:18 +01:00

5.2 KiB

Raw Blame History

Resource Evaluations

Ce dossier contient les évaluations de ressources externes (articles, vidéos, discussions) pour déterminer leur pertinence pour le Claude Code Ultimate Guide.

Méthodologie

Chaque ressource est évaluée selon un système de scoring standardisé et challengée par un agent technique pour garantir l'objectivité.

Grille de score (sur 5)

Score	Signification	Action
5	Critical - Breakthrough, must integrate immediately	Intégrer sous 24h
4	High Value - New capability or major improvement	Intégrer sous 1 semaine
3	Moderate - Useful addition but not urgent	Intégrer si temps disponible
2	Marginal - Secondary info or niche use case	Ne pas intégrer (ou mention minimale)
1	Low - Redundant, incorrect, or off-topic	Rejeter

Process

Analyse initiale: Extraction des faits, vérification des sources
Scoring: Attribution d'un score avec justification
Challenge: Agent technical-writer remet en question le score
Décision finale: Intégration ou rejet avec traçabilité

Nomenclature des fichiers

Format: [topic-slug].md (date supprimée pour stabilité des liens)

Exemple: remotion-claude-code-video.md

Working Documents

Les documents de travail bruts (prompts Perplexity, audits clients) restent dans claudedocs/resource-evaluations/ (gitignored).

Index des Évaluations

Ressource	Score Initial	Score Final	Décision	Fichier
Anthropic Releases (Jan 16-23, 2026)	-	-	✅ Suivi régulier	anthropic-releases-jan16-23-2026.md
AST-grep (Flavien Métivier)	3/5	4/5	✅ Intégrer workflow	astgrep-flavien-metivier.md
MCP Apps (SEP-1865)	3/5	4/5	✅ Intégré (architecture + guide)	mcp-apps-announcement.md
Boris Cherny (Cowork Video)	4/5	4/5	✅ Intégré (mental models)	boris-cowork-video-eval.md
Clawdbot (Twitter Analysis)	2/5	2/5	⚠️ Watch only	clawdbot-twitter-analysis.md
GSD (Getting Shit Done)	4/5	4/5	✅ Intégré (workflow)	gsd-evaluation.md
Nick Jensen Plugins	3/5	3/5	✅ Mention	nick-jensen-plugins.md
Prompt Repetition Paper	3/5	4/5	✅ Intégrer best practices	prompt-repetition-paper.md
Remotion + Claude Code (Video Production)	2/5	3/5	✅ Mention minimale	remotion-claude-code-video.md
SE-Cove Plugin	2/5	2/5	⚠️ Watch only	se-cove-plugin.md
Self-Improve Skill	3/5	3/5	✅ Template ajouté	self-improve-skill.md
Steinberger (Inference Speed)	3/5	3/5	✅ Intégré (minimal)	steinberger-inference-speed.md
UML & OOP Diagrams	3/5	3/5	✅ Mention	uml-oop-diagrams.md
Vibe Coding Level 2 (Rusitschka)	4/5	4/5	✅ Intégré (workflows)	vibe-coding-rusitschka.md
Peter Wooldridge (Productivity Stack)	2/5	3/5	✅ Practitioner Insights	wooldridge-productivity-stack.md
System Prompts (Official vs Community)	4/5	2/5	⚠️ Watch only (official sources exist)	system-prompts-official-vs-community.md
Worktrunk	4/5	4/5	✅ Intégré (workflow)	worktrunk-evaluation.md
Pat Cullen (Multi-Agent PR Review)	5/5	5/5	✅ Intégré (review-pr, code-reviewer, guide)	017-pat-cullen-final-review.md
Docker Sandboxes (Isolation Landscape)	4/5	4/5	✅ Intégré (guide + notice)	docker-sandboxes-isolation.md
dclaude (Dockerized Claude Code)	2/5	2/5	⚠️ Footnote (sandbox-isolation.md)	dclaude-docker-wrapper.md
10 Tips from Inside the Claude Code Team (paddo.dev)	4/5	4/5	✅ Intégré (4 sections)	paddo-team-tips-eval.md
Sankalp's Claude Code 2.0 Experience	2/5	2/5	⚠️ Watch only (85% overlap, probable errors)	sankalp-claude-code-experience.md
Kajan Siva (/insights command)	2/5	2/5	❌ Do not integrate (no technical content)	kajan-siva-insights-command.md
Zolkos (/insights deep dive)	4/5	4/5	✅ Integrate (architecture + facets)	zolkos-insights-deep-dive.md
Grenier (Agent/Skill Quality)	3/5	3/5	✅ Intégrer partiellement	grenier-agent-skill-quality.md

Dernier update: 2026-02-07 (24 évaluations)

5.2 KiB Raw Blame History