docs: add everything-claude-code to ecosystem + verification loops pattern

- Add affaan-m/everything-claude-code to ecosystem (16k+ stars)
  - Note: author hackathon win was for Zenith project, not this repo
  - Caveat: Node.js hooks not officially recommended by Anthropic
- Document "Verification Loops" pattern in methodologies.md
  - Official Anthropic guidance: iterate until tests pass
- Document "Eval Harness" concept with source link
  - Reference: anthropic.com/engineering/demystifying-evals-for-ai-agents
- Add deep_dive index entries for quick lookup

Co-Authored-By: Claude <noreply@anthropic.com>
This commit is contained in:
Florian BRUNIAUX 2026-01-23 08:55:36 +01:00
parent 2a63230c95
commit 73e371e237
2 changed files with 29 additions and 0 deletions

View file

@ -142,11 +142,26 @@ Strict iteration: 2 weeks max per feature.
With Claude: Be explicit. "Write FAILING tests that don't exist yet."
> **Verification Loops** — A formalized pattern for autonomous iteration:
>
> Use testing as termination condition:
> 1. Claude writes tests for the feature
> 2. Claude iterates code until tests pass
> 3. Continue until explicit completion criteria met
>
> **Official guidance**: *"Tell Claude to keep going until all tests pass. It will usually take a few iterations."* — [Anthropic Best Practices](https://www.anthropic.com/engineering/claude-code-best-practices)
>
> Implementation: Can be enforced via Stop hooks, multi-Claude verification, or explicit "DONE" markers in prompts.
**Eval-Driven Development** — TDD for LLMs. Test agent behaviors via evals:
- Code-based: `output == golden_answer`
- LLM-based: Another Claude evaluates
- Human grading: Reference, slow
> **Eval Harness** — The infrastructure that runs evaluations end-to-end: providing instructions and tools, running tasks concurrently, recording steps, grading outputs, and aggregating results.
>
> See Anthropic's comprehensive guide: [Demystifying Evals for AI Agents](https://www.anthropic.com/engineering/demystifying-evals-for-ai-agents)
**Multi-Agent Orchestration** — From single assistant to orchestrated team:
```
Meta-Agent (Orchestrator)