marketing-shibata50/claude-code-ultimate-guide

History

Florian BRUNIAUX ef7cdd899e release: v3.24.0 - Agent Evaluation Framework Major addition: Complete agent evaluation framework with production-ready template. ## Added - Resource Evaluation: nao framework (score 3/5) - Identified critical gap: agent evaluation not documented - Technical challenge adjusted score 2/5 → 3/5 - All claims fact-checked (TypeScript 58.9%, Python 38.5%) - Guide Section: Agent Evaluation (guide/agent-evaluation.md, ~3K tokens) - Metrics: response quality, tool usage, performance, satisfaction - Patterns: logging hooks, unit tests, A/B testing, feedback loops - Example: analytics agent with built-in metrics - Tools: nao framework reference, Claude Code hooks integration - AI Ecosystem: Section 8.2 Domain-Specific Agent Frameworks - nao (Analytics Agents): Database-agnostic, built-in evaluation - Transposable patterns: context builder, evaluation hooks, DB integrations - Template: Analytics Agent with Evaluation (5 files, ~1K lines) - README: setup, usage, troubleshooting - Agent: SQL generator with evaluation criteria, safety rules - Hook: automated metrics logging (safety, performance, errors) - Script: analysis with stats, safety reports, recommendations - Report template: monthly evaluation format ## Changed - Agent Evaluation Guide: updated template references, verified links - Landing Site: templates count 110 → 114 - Version: 3.23.5 → 3.24.0 Co-Authored-By: Claude Sonnet 4.5 <noreply@anthropic.com>		2026-02-10 11:52:13 +01:00
..
images	docs: update template count badge (82 → 83)	2026-01-25 13:30:40 +01:00
workflows	docs: integrate Anthropic 2026 Agentic Coding Trends Report	2026-02-09 17:18:52 +01:00
adoption-approaches.md	docs: add Context Engineering (Thoughtworks) + corporate marketplaces footnotes	2026-02-06 16:09:02 +01:00
agent-evaluation.md	release: v3.24.0 - Agent Evaluation Framework	2026-02-10 11:52:13 +01:00
ai-ecosystem.md	release: v3.24.0 - Agent Evaluation Framework	2026-02-10 11:52:13 +01:00
ai-traceability.md	docs: add AI Traceability & Attribution guide	2026-01-24 20:11:53 +01:00
architecture.md	docs: complete Wasp fullstack essentials integration	2026-02-09 10:00:53 +01:00
cheatsheet.md	release: v3.24.0 - Agent Evaluation Framework	2026-02-10 11:52:13 +01:00
claude-code-releases.md	docs: update Claude Code releases to v2.1.37	2026-02-08 11:25:05 +01:00
cowork.md	chore: remove cowork folder after migration to dedicated repo	2026-01-20 12:16:32 +01:00
data-privacy.md	docs: add Anthropic governance resources	2026-01-26 09:31:19 +01:00
devops-sre.md	feat: add DevOps & SRE Guide with FIRE Framework (v3.9.9)	2026-01-20 22:09:31 +01:00
known-issues.md	docs: add verified critical bugs tracker (known-issues.md)	2026-01-28 17:59:16 +01:00
learning-with-ai.md	release: v3.20.1 - Vercel AGENTS.md vs Skills evaluation	2026-01-30 21:45:14 +01:00
mcp-servers-ecosystem.md	docs: complete Wasp fullstack essentials integration	2026-02-09 10:00:53 +01:00
methodologies.md	release: v3.23.4 - Agent Anti-Patterns & Scope-Focused Refactoring	2026-02-09 10:29:59 +01:00
observability.md	docs: complete Wasp fullstack essentials integration	2026-02-09 10:00:53 +01:00
production-safety.md	docs: add Alan Tour Eiffel paradigm evaluation (5/5 CRITICAL)	2026-02-02 14:21:51 +01:00
README.md	release: v3.24.0 - Agent Evaluation Framework	2026-02-10 11:52:13 +01:00
sandbox-isolation.md	docs: add Native Sandboxing comprehensive documentation (v3.21.1)	2026-02-02 20:24:17 +01:00
sandbox-native.md	docs: add Native Sandboxing comprehensive documentation (v3.21.1)	2026-02-02 20:24:17 +01:00
search-tools-cheatsheet.md	docs: add search tools guides and ast-grep patterns	2026-01-25 18:47:29 +01:00
security-hardening.md	docs: complete Wasp fullstack essentials integration	2026-02-09 10:00:53 +01:00
third-party-tools.md	docs: RTK documentation update - upstream + fork integration	2026-02-01 22:20:43 +01:00
ultimate-guide.md	release: v3.24.0 - Agent Evaluation Framework	2026-02-10 11:52:13 +01:00
visual-reference.md	release: v3.20.5 - 4 new ASCII diagrams (visual-reference.md)	2026-01-31 23:14:41 +01:00

README.md

Guide Documentation

Core documentation for mastering Claude Code.

File	Description	Time
ultimate-guide.md	Complete reference covering all Claude Code features	~3 hours
mcp-servers-ecosystem.md	Community MCP servers: 8 validated servers (Playwright, Semgrep, Kubernetes, etc.) with production configs	25 min
third-party-tools.md	Community tools: GUIs, TUIs, config managers, token trackers, alternative UIs	15 min
claude-code-releases.md	Official release history (condensed)	10 min
known-issues.md	Critical bugs tracker: security issues, token consumption, verified community reports	15 min
cheatsheet.md	1-page printable quick reference	5 min
visual-reference.md	Visual cheatsheet — ASCII diagrams for key concepts	5 min
architecture.md	How Claude Code works internally (master loop, tools, context)	25 min
learning-with-ai.md	Guide for juniors on using AI without losing skills	15 min
adoption-approaches.md	Implementation strategies for teams	15 min
agent-evaluation.md	Agent quality metrics: Measuring custom agent effectiveness with hooks, tests, and feedback loops	20 min
data-privacy.md	Data retention and privacy guide	10 min
observability.md	Session monitoring and cost tracking	15 min
methodologies.md	15 development methodologies reference (TDD, SDD, BDD, etc.)	20 min
security-hardening.md	Security threats, MCP vetting, injection defense	25 min
ai-traceability.md	AI attribution, disclosure policies, git-ai, compliance	20 min
devops-sre.md	FIRE framework for infrastructure diagnosis and incident response	30 min
sandbox-isolation.md	Docker Sandboxes, cloud alternatives, safe autonomy workflows	10 min
ai-ecosystem.md	Complementary AI tools (Perplexity, Gemini, Kimi, NotebookLM, TTS)	30 min
cowork.md	Claude Cowork: Summary (see dedicated repo for full docs)	5 min
workflows/	Practical workflow guides for Claude Code	30 min

Cowork Documentation

For knowledge workers using Claude Cowork (agentic desktop):

Resource	Description
Cowork Hub	Complete Cowork documentation
Getting Started	Setup and first workflow
Capabilities	What Cowork can/cannot do
Security Guide	Safe usage practices
Prompt Library	50+ ready-to-use prompts
Cheatsheet	1-page quick reference

Workflows

Hands-on guides for effective development patterns:

File	Description
workflows/tdd-with-claude.md	Test-Driven Development with Claude
workflows/spec-first.md	Spec-First Development (SDD)
workflows/plan-driven.md	Using /plan mode effectively
workflows/iterative-refinement.md	Iterative improvement loops
workflows/tts-setup.md	Add text-to-speech narration to Claude Code (18 min)
workflows/task-management.md	Multi-session task tracking, TodoWrite migration

README.md

Guide Documentation

Contents

Cowork Documentation

Workflows

Recommended Reading Order