release: v3.37.1 - threat-db v2.8.0, CC releases v2.1.78, 19 skills update, doc fixes

- threat-db v2.8.0: GhostClaw campaign, Fake OpenClaw Installer, CVE-2026-24910 (Bun),
  T017 Shadow MCP, T018 AI Search Poisoning, Jozu Agent Guard, MCP Sentinel
- Claude Code releases tracked to v2.1.78 (StopFailure hook, plugin state, security fixes)
- 19 skill descriptions improved (PR #9 selective merge, @popey/Tessl)
- MCP vs CLI token overhead corrected (lazy loading, 85% reduction benchmark)
- Agent Adoption Curve self-assessment (7-level maturity scale, Martignole framework)
- ctx7 CLI section §5.5 + resource evals #079 #080 #081

Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>
This commit is contained in:
Florian BRUNIAUX 2026-03-18 15:49:01 +01:00
parent eea5662a65
commit 44818a3f04
19 changed files with 785 additions and 87 deletions

View file

@ -0,0 +1,141 @@
r# Resource Evaluation #079 — Fabro: Graph-Based Workflow Orchestrator for AI Coding Agents
**Source:** [fabro.sh](https://fabro.sh) / [github.com/fabro-sh/fabro](https://github.com/fabro-sh/fabro)
**Type:** Open source tool (MIT) — standalone workflow orchestrator for AI coding agents
**Author:** Bryan from qlty.sh (bryan@qlty.sh)
**Evaluated:** 2026-03-17
**Maturity at evaluation:** Created 2026-03-13 — 4 days old, 28 stars, 1 fork
---
## Summary
- **Workflow orchestrator for AI coding agents**: define pipelines as Graphviz DOT graphs with branching, loops, parallelism, and human approval gates — diffable and version-controlled
- **Multi-model routing**: CSS-like stylesheets assign different LLM models (Claude, OpenAI, Gemini) to specific workflow nodes, with automatic fallback chains
- **Git checkpointing per stage**: every stage commits code changes and execution metadata to a dedicated Git branch — unique feature with no direct equivalent found in the landscape
- **Cloud sandboxes**: isolated Daytona VMs with snapshot-based setup, network controls, SSH access, and automatic cleanup
- **Automatic retrospectives**: each run generates a cost/duration/narrative retrospective for continuous improvement
- **Direct Claude Code integration**: `curl -fsSL https://fabro.sh/install.md | claude` (security note: pipes directly into Claude, no intermediate review step)
- **Single Rust binary, zero runtime dependencies**: no Python, no Node.js, no Docker required
- **REST API + SSE streaming + React web UI**: run workflows programmatically or as a service
---
## Relevance Score
| Score | Meaning |
|-------|---------|
| 5 | Essential — Major gap in the guide |
| 4 | Very relevant — Significant improvement |
| **3** | **Pertinent — Useful complement, Watch status** |
| 2 | Marginal — Secondary information |
| 1 | Out of scope — Not relevant |
**Final score: 3/5 (Watch)**
**Justification:** Fabro falls directly in the "External Orchestration Frameworks" category already documented in the guide (`third-party-tools.md`). Its DOT graph approach is architecturally distinct from all three existing entries (Ruflo = swarms, Athena Flow = hooks layer, Pipelex = DSL). Git checkpointing per stage is a genuine differentiator — no equivalent found in the landscape. Direct Claude Code integration via `claude` pipeline is legitimate. However: 28 stars at 4 days old is the same immaturity profile as Athena Flow (#073, score 2/5). Raised to 3/5 vs Athena Flow because of stronger architectural clarity, a wider feature set with more evidence, and a genuinely unique Git checkpointing angle.
---
## Comparison
| Aspect | Fabro | Guide (current state) |
|--------|-------|-----------------------|
| DOT graph pipeline definition | Unique approach | Not covered |
| Multi-model routing per node | CSS-like stylesheets | Not covered |
| Git checkpointing per stage | Concrete differentiator | Not covered anywhere |
| Cloud sandboxes (Daytona) | Declared, unverified in prod | Not covered |
| Human-in-the-loop approval gates | Hexagon nodes in DOT | Partially covered via Ruflo |
| External orchestration frameworks | External layer over Claude Code | Ruflo + Athena Flow + Pipelex |
| Maturity / community traction | 28 stars, 4 days | Ruflo at 18.9k stars |
| `curl \| claude` install security | Risk: no review step | Guide warns against `curl \| bash` |
---
## Competitive Landscape (Perplexity research, 2026-03-17)
Full competitive analysis conducted. Key findings:
| Tool | Category | Stars | Key difference from Fabro |
|------|----------|-------|--------------------------|
| **LangGraph** (LangChain) | Graph-based pipelines | ~34k | Python library (not standalone binary), general purpose (not coding-agent specific), no Git checkpointing |
| **Goose** (Block) | Coding agent orchestration | ~15k | Recipe-based (not graph), conversational architecture, no Git checkpointing — but much more mature |
| **OpenHands** | Coding agent platform | ~48k | Event-stream architecture, Docker sandboxes, research-oriented — no DOT graph, no Git checkpointing per stage |
| **Ruflo** | External orchestration (guide) | 18.9k | Swarm-based (queen + workers), npm, SQLite memory — no DOT graph, no Git checkpointing |
| **Athena Flow** | Hook-layer runtime (guide) | Watch | Hook → UDS → Node.js — entirely different architecture layer |
| **Pipelex** | MTHDS DSL (guide) | Watch | Declarative DSL for multi-LLM pipelines — different abstraction |
| **AutoGen** (Microsoft) | Multi-agent conversations | ~47.9k | General purpose, conversation-loop model, no coding-agent specifics |
| **CrewAI** | Role-based agent crews | ~34.7k | Role assignment model, no graph definition, no Git checkpointing |
**Fabro's unique combination** (no competitor does all three):
1. DOT graph workflow definition as a standalone binary
2. Git checkpointing per stage (code + metadata committed to branches)
3. Native Claude Code integration via `claude`
**Most relevant alternative for guide readers today**: LangGraph for graph-based workflows (much more mature, Python), Goose (Block) for coding agent orchestration (better traction, MIT).
---
## Recommendations
**When to integrate:** Add as Watch entry now. Promote to full entry in `guide/ecosystem/third-party-tools.md` under "External Orchestration Frameworks" when trigger is reached.
**Where:** After Athena Flow in `third-party-tools.md` External Orchestration Frameworks section.
**How:** Short entry (same format as Athena Flow) with:
- Architectural distinction (DOT graph — distinct from all three existing entries)
- Git checkpointing differentiator
- Security note on `curl | claude` install (same pattern as Ruflo's `curl | bash` warning)
- Status: "Published March 2026, not yet audited"
**Do NOT do:**
- Recommend the `curl | claude` install without a security note
- Present any feature as production-validated (no community evidence yet)
- Add before the traction trigger is reached
**Secondary discovery:** Goose (Block, github.com/block/goose) warrants its own evaluation (#080). 15k stars, MIT, recipes + subagents + 20 LLM providers — potentially more immediately relevant to the guide's audience.
---
## Challenge (technical-writer agent)
**Initial proposed score:** 4/5
**Score after challenge:** 3/5 (lowered)
Points raised:
- **Immaturity flag**: 28 stars / 4 days = same pattern as Athena Flow (score 2/5). Applying this inconsistently undermines the scoring framework. Compromise: 3/5 because Fabro shows more architectural evidence than Athena Flow at equivalent age.
- **`curl | claude` security**: more dangerous than typical `curl | bash` — pipes directly into the codebase with no review step. Guide's own security section would flag this. Must be noted in any future integration.
- **Ambitious unverified feature set**: cloud sandboxes, DOT routing, retrospectives — none validated by community use at evaluation time.
- **Risk of not integrating at this stage**: near zero. Category already covered by 3 entries.
- **What is genuinely novel**: DOT graph definition as diffable text (distinct from Ruflo/Pipelex/Athena) + Git checkpointing per stage = angles worth tracking.
---
## Fact-Check
| Claim | Verified | Source |
|-------|----------|--------|
| 28 GitHub stars | Verified | GitHub API direct |
| Created 2026-03-13 | Verified | GitHub API `created_at` |
| Rust, single binary, zero deps | Verified | README |
| MIT license | Verified | GitHub API + README badge |
| `curl \| claude` install | Verified | README + fabro.sh landing |
| DOT graph workflows | Verified | README + example code |
| Daytona cloud sandboxes | Declared | README feature table (unaudited) |
| Supports Claude/OpenAI/Gemini | Declared | WebFetch landing (unaudited) |
| Git checkpointing per stage | Declared | README feature table (unaudited) |
| Automatic retrospectives | Declared | README feature table (unaudited) |
| Bryan from qlty.sh | Verified | `bryan@qlty.sh` in README |
**No corrections needed:** all claims traced to primary sources. Features marked "Declared" are present in README but not community-validated.
---
## Final Decision
- **Final score**: 3/5
- **Action**: Watch — add to `watch-list.md`, revisit when trigger reached
- **Re-eval trigger**: >200 GitHub stars OR practitioner write-up from production use
- **Confidence**: High on score, medium on features (project too recent for full audit)
- **Next action**: Evaluate Goose (Block) as #080 — more immediately relevant to guide's audience

View file

@ -0,0 +1,120 @@
# Resource Evaluation #080 — Goose (Block): Open-Source AI Coding Agent
**Source:** [block.github.io/goose](https://block.github.io/goose) / [github.com/block/goose](https://github.com/block/goose)
**Type:** Open source tool (Apache 2.0) — on-machine AI coding agent
**Author:** Block (formerly Square) — maintained by Block's engineering team
**Evaluated:** 2026-03-17
**Maturity at evaluation:** Launched officially January 2025, 33,166 stars, 3,058 forks
---
## Summary
- **On-machine AI coding agent**: local-first CLI + desktop app, not cloud. Automates complex engineering tasks end-to-end
- **Model-agnostic**: works with Claude (recommended for tool calling), GPT-4o, Gemini, Groq, local models (Ollama) — 20+ providers
- **Recipes**: versionable, shareable, parameterized multi-step workflows. Distinct from "rules files" — recipes define what agents do, not how they behave
- **Subagent orchestration**: spawn specialized agents autonomously or via sub-recipes, with dynamic model switching per task/cost
- **1,700+ MCP servers** supported (first open source agent to support MCP, January 2025)
- **Goose Grant Program**: Block funds developers building Goose extensions (launched July 2025)
- **Custom Distributions**: teams can build branded Goose distros with preconfigured providers, extensions, and branding
- **Backed by Block** (Square, Cash App) — institutional engineering resources, not a solo project
---
## Status in the Guide
**Already documented**: `guide/ecosystem/ai-ecosystem.md` §11.1 "Goose: Open-Source Alternative (Block)"
**The entry exists and is structurally sound.** The issue is outdated data and two missing feature callouts.
---
## Relevance Score
| Score | Meaning |
|-------|---------|
| 5 | Essential — Major gap in the guide |
| **4** | **Very relevant — Significant improvement needed** |
| 3 | Pertinent — Useful complement |
| 2 | Marginal — Secondary information |
| 1 | Out of scope — Not relevant |
**Final score: 4/5 (Update existing entry)**
**Justification:** Goose is already documented. Score reflects the importance of keeping the entry current — at 33k stars (2x what the guide says), Goose is clearly not a niche alternative. The missing Recipes and subagent orchestration paragraphs are also directly relevant to a guide that extensively documents Claude Code's equivalent patterns (skills, slash commands, multi-agent).
---
## What Needs Updating in §11.1
### 1. Stats (outdated)
| Field | Current guide (Jan 2026) | Actual (Mar 2026) |
|-------|--------------------------|-------------------|
| GitHub Stars | 15,400+ | 33,166 |
| MCP servers | 3,000+ (table) vs 1,700+ (inconsistency) | 1,700+ (per Goose docs) |
| Releases | 100+ | ~175+ (estimated, fast release cadence) |
**Fix**: Update the metrics table and resolve the 3,000 vs 1,700 MCP inconsistency.
### 2. Recipes — missing
Recipes are Goose's equivalent of Claude Code slash commands + skills combined. They are:
- Versionable, shareable as standalone workflows
- Importable via deeplinks
- Parameterized (reusable across contexts)
- Can be shared across teams
This is directly relevant to a guide section that extensively documents commands and skills. One paragraph with a cross-reference to §3 (commands) and §4 (skills) is warranted.
### 3. Subagent orchestration — missing
Goose's July 2025 roadmap introduced subagent orchestration: spawn specialized sub-agents (Planner, Architect, Frontend Dev, Backend Dev) with dynamic model switching per agent. Example from Berkeley Agentic AI Summit: 7 agents collaboratively built a full-stack app in under an hour.
This overlaps with Claude Code's own multi-agent patterns (§9). A one-paragraph callout with a comparison to Claude Code's Agent tool would help readers understand the architectural difference (Claude Code: single agent + Tool spawning vs Goose: recipe-defined multi-agent subflows).
### 4. agentskills.io — verify live status
The "Skill Portability" paragraph references agentskills.io. **Verified live** (2026-03-17). No change needed.
---
## Challenge (technical-writer agent)
**Score confirmed: 4/5 (update pass)**
Key points:
- 15.4k → 33k stars: 2x undercount signals the section hasn't been maintained. Trust erosion, not just a metric miss.
- MCP discrepancy (3,000 vs 1,700): one of these is wrong. Fix before any update goes live.
- Recipes and subagents absent: closest Goose analogy to Claude Code's skills + multi-agent. Should be documented.
- Risk of not updating: low urgency for readers, moderate for guide credibility as a current reference.
- Scope: 30-minute update pass, not a restructure.
---
## Fact-Check
| Claim | Verified | Source |
|-------|----------|--------|
| 33,166 GitHub stars | Verified | GitHub API (2026-03-17) |
| 3,058 forks | Verified | GitHub API |
| Apache 2.0 license | Verified | GitHub API + README |
| Rust (primary language) | Verified | GitHub API (`language: "Rust"`) |
| Created August 2024, launched Jan 2025 | Verified | GitHub API + "1 Year of goose" discussion (Jan 2026) |
| First open source agent to support MCP | Claimed | "1 Year of goose" GitHub discussion |
| 1,700+ MCP servers | Per Perplexity (sourced from Goose docs) | Cross-check recommended |
| Claude 3.5 Sonnet recommended for tool calling | Claimed | Perplexity search citing Goose docs |
| Goose Grant Program (July 2025) | Verified | block.xyz/inside/introducing-the-goose-grant-program |
| agentskills.io live | Verified | HTTP fetch (2026-03-17) |
| Dynamic model switching per subagent | Claimed | GitHub roadmap discussion #3319 |
**MCP server count discrepancy**: Guide says 3,000+ (comparison table), Perplexity reports 1,700+ from Goose docs. Need to check Goose documentation directly before updating. Use the more conservative figure if unsure.
---
## Final Decision
- **Final score**: 4/5
- **Action**: Update existing `guide/ecosystem/ai-ecosystem.md` §11.1 — stats refresh + Recipes paragraph + subagent orchestration paragraph
- **Confidence**: High on stats, medium on MCP server count (needs direct doc check)
- **Priority**: Medium — not urgent, but a 2x star count delta is worth fixing promptly

View file

@ -0,0 +1,43 @@
# Resource Evaluation: Martignole Agent Adoption Maturity Levels
**Date**: 2026-03-17
**Evaluator**: Claude Code Ultimate Guide team
**Score**: 3/5 — Integrated
---
## Resource
- **URL**: https://www.touilleur-express.fr/2026/03/17/decouvrir-les-niveaux-de-maturite-de-ladoption-des-coding-agents
- **Author**: Nicolas Martignole, Principal Engineer at Back Market, Le Touilleur Express blog
- **Type**: Practitioner framework / blog post
- **Language**: French
## Summary
6-level maturity framework (0-5 in the original, extended to 6 here) for individual developers adopting coding agents. The real contribution is the Level 3-5 arc: basic user → stage delegator → context engineer → orchestrator. Maps cleanly onto Claude Code concepts (Plan mode, sub-agents, MCP servers, context engineering).
## Score Justification
**3/5 — Pertinent complement**
- Fills a real gap: no individual self-placement scale existed in the guide
- Practitioner source (production engineering context), not a blogger opinion piece
- Zero empirical data behind the levels — one engineer's taxonomy
- Levels 0-2 are noise for the guide's audience (already using Claude Code)
- ThoughtWorks occupies the "maturity model" reference slot; Martignole's upper levels are more Claude Code-specific
## Integration
Adapted into `guide/roles/learning-with-ai.md` as a "Where Are You on the Agent Adoption Curve?" section (inserted before the 30-Day Progression Plan). The level descriptions were extended (6 levels vs 5 in the original) and diagnostic questions added. Attribution and source link included.
## Fact-Check Notes
- Author identity and role: confirmed via LinkedIn
- "~5% manual coding at Level 3+": practitioner estimate, not empirical — used as illustrative signal
- "Free Claude Code Architect certification by Anthropic": not verifiable, not reproduced in the guide
- Article date 2026-03-17: confirmed from URL
## Decision
**Integrated** — adapted content added to `learning-with-ai.md`. Not a verbatim reproduction; the framework was restructured and extended for the guide's English-speaking, already-technical audience.