release: v3.37.1 - threat-db v2.8.0, CC releases v2.1.78, 19 skills update, doc fixes

- threat-db v2.8.0: GhostClaw campaign, Fake OpenClaw Installer, CVE-2026-24910 (Bun), T017 Shadow MCP, T018 AI Search Poisoning, Jozu Agent Guard, MCP Sentinel - Claude Code releases tracked to v2.1.78 (StopFailure hook, plugin state, security fixes) - 19 skill descriptions improved (PR #9 selective merge, @popey/Tessl) - MCP vs CLI token overhead corrected (lazy loading, 85% reduction benchmark) - Agent Adoption Curve self-assessment (7-level maturity scale, Martignole framework) - ctx7 CLI section §5.5 + resource evals #079 #080 #081 Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>
2026-03-18 15:49:01 +01:00 · 2026-03-18 15:49:01 +01:00 · 44818a3f04
commit 44818a3f04
parent eea5662a65
19 changed files with 785 additions and 87 deletions
--- a/docs/resource-evaluations/079-fabro-workflow-orchestration.md
+++ b/docs/resource-evaluations/079-fabro-workflow-orchestration.md
@ -0,0 +1,141 @@
+r# Resource Evaluation #079 — Fabro: Graph-Based Workflow Orchestrator for AI Coding Agents
+
+**Source:** [fabro.sh](https://fabro.sh) / [github.com/fabro-sh/fabro](https://github.com/fabro-sh/fabro)
+**Type:** Open source tool (MIT) — standalone workflow orchestrator for AI coding agents
+**Author:** Bryan from qlty.sh (bryan@qlty.sh)
+**Evaluated:** 2026-03-17
+**Maturity at evaluation:** Created 2026-03-13 — 4 days old, 28 stars, 1 fork
+
+---
+
+## Summary
+
+- **Workflow orchestrator for AI coding agents**: define pipelines as Graphviz DOT graphs with branching, loops, parallelism, and human approval gates — diffable and version-controlled
+- **Multi-model routing**: CSS-like stylesheets assign different LLM models (Claude, OpenAI, Gemini) to specific workflow nodes, with automatic fallback chains
+- **Git checkpointing per stage**: every stage commits code changes and execution metadata to a dedicated Git branch — unique feature with no direct equivalent found in the landscape
+- **Cloud sandboxes**: isolated Daytona VMs with snapshot-based setup, network controls, SSH access, and automatic cleanup
+- **Automatic retrospectives**: each run generates a cost/duration/narrative retrospective for continuous improvement
+- **Direct Claude Code integration**: `curl -fsSL https://fabro.sh/install.md | claude` (security note: pipes directly into Claude, no intermediate review step)
+- **Single Rust binary, zero runtime dependencies**: no Python, no Node.js, no Docker required
+- **REST API + SSE streaming + React web UI**: run workflows programmatically or as a service
+
+---
+
+## Relevance Score
+
+| Score | Meaning |
+|-------|---------|
+| 5 | Essential — Major gap in the guide |
+| 4 | Very relevant — Significant improvement |
+| **3** | **Pertinent — Useful complement, Watch status** |
+| 2 | Marginal — Secondary information |
+| 1 | Out of scope — Not relevant |
+
+**Final score: 3/5 (Watch)**
+
+**Justification:** Fabro falls directly in the "External Orchestration Frameworks" category already documented in the guide (`third-party-tools.md`). Its DOT graph approach is architecturally distinct from all three existing entries (Ruflo = swarms, Athena Flow = hooks layer, Pipelex = DSL). Git checkpointing per stage is a genuine differentiator — no equivalent found in the landscape. Direct Claude Code integration via `claude` pipeline is legitimate. However: 28 stars at 4 days old is the same immaturity profile as Athena Flow (#073, score 2/5). Raised to 3/5 vs Athena Flow because of stronger architectural clarity, a wider feature set with more evidence, and a genuinely unique Git checkpointing angle.
+
+---
+
+## Comparison
+
+| Aspect | Fabro | Guide (current state) |
+|--------|-------|-----------------------|
+| DOT graph pipeline definition | Unique approach | Not covered |
+| Multi-model routing per node | CSS-like stylesheets | Not covered |
+| Git checkpointing per stage | Concrete differentiator | Not covered anywhere |
+| Cloud sandboxes (Daytona) | Declared, unverified in prod | Not covered |
+| Human-in-the-loop approval gates | Hexagon nodes in DOT | Partially covered via Ruflo |
+| External orchestration frameworks | External layer over Claude Code | Ruflo + Athena Flow + Pipelex |
+| Maturity / community traction | 28 stars, 4 days | Ruflo at 18.9k stars |
+| `curl \| claude` install security | Risk: no review step | Guide warns against `curl \| bash` |
+
+---
+
+## Competitive Landscape (Perplexity research, 2026-03-17)
+
+Full competitive analysis conducted. Key findings:
+
+| Tool | Category | Stars | Key difference from Fabro |
+|------|----------|-------|--------------------------|
+| **LangGraph** (LangChain) | Graph-based pipelines | ~34k | Python library (not standalone binary), general purpose (not coding-agent specific), no Git checkpointing |
+| **Goose** (Block) | Coding agent orchestration | ~15k | Recipe-based (not graph), conversational architecture, no Git checkpointing — but much more mature |
+| **OpenHands** | Coding agent platform | ~48k | Event-stream architecture, Docker sandboxes, research-oriented — no DOT graph, no Git checkpointing per stage |
+| **Ruflo** | External orchestration (guide) | 18.9k | Swarm-based (queen + workers), npm, SQLite memory — no DOT graph, no Git checkpointing |
+| **Athena Flow** | Hook-layer runtime (guide) | Watch | Hook → UDS → Node.js — entirely different architecture layer |
+| **Pipelex** | MTHDS DSL (guide) | Watch | Declarative DSL for multi-LLM pipelines — different abstraction |
+| **AutoGen** (Microsoft) | Multi-agent conversations | ~47.9k | General purpose, conversation-loop model, no coding-agent specifics |
+| **CrewAI** | Role-based agent crews | ~34.7k | Role assignment model, no graph definition, no Git checkpointing |
+
+**Fabro's unique combination** (no competitor does all three):
+1. DOT graph workflow definition as a standalone binary
+2. Git checkpointing per stage (code + metadata committed to branches)
+3. Native Claude Code integration via `claude`
+
+**Most relevant alternative for guide readers today**: LangGraph for graph-based workflows (much more mature, Python), Goose (Block) for coding agent orchestration (better traction, MIT).
+
+---
+
+## Recommendations
+
+**When to integrate:** Add as Watch entry now. Promote to full entry in `guide/ecosystem/third-party-tools.md` under "External Orchestration Frameworks" when trigger is reached.
+
+**Where:** After Athena Flow in `third-party-tools.md` External Orchestration Frameworks section.
+
+**How:** Short entry (same format as Athena Flow) with:
+- Architectural distinction (DOT graph — distinct from all three existing entries)
+- Git checkpointing differentiator
+- Security note on `curl | claude` install (same pattern as Ruflo's `curl | bash` warning)
+- Status: "Published March 2026, not yet audited"
+
+**Do NOT do:**
+- Recommend the `curl | claude` install without a security note
+- Present any feature as production-validated (no community evidence yet)
+- Add before the traction trigger is reached
+
+**Secondary discovery:** Goose (Block, github.com/block/goose) warrants its own evaluation (#080). 15k stars, MIT, recipes + subagents + 20 LLM providers — potentially more immediately relevant to the guide's audience.
+
+---
+
+## Challenge (technical-writer agent)
+
+**Initial proposed score:** 4/5
+**Score after challenge:** 3/5 (lowered)
+
+Points raised:
+
+- **Immaturity flag**: 28 stars / 4 days = same pattern as Athena Flow (score 2/5). Applying this inconsistently undermines the scoring framework. Compromise: 3/5 because Fabro shows more architectural evidence than Athena Flow at equivalent age.
+- **`curl | claude` security**: more dangerous than typical `curl | bash` — pipes directly into the codebase with no review step. Guide's own security section would flag this. Must be noted in any future integration.
+- **Ambitious unverified feature set**: cloud sandboxes, DOT routing, retrospectives — none validated by community use at evaluation time.
+- **Risk of not integrating at this stage**: near zero. Category already covered by 3 entries.
+- **What is genuinely novel**: DOT graph definition as diffable text (distinct from Ruflo/Pipelex/Athena) + Git checkpointing per stage = angles worth tracking.
+
+---
+
+## Fact-Check
+
+| Claim | Verified | Source |
+|-------|----------|--------|
+| 28 GitHub stars | Verified | GitHub API direct |
+| Created 2026-03-13 | Verified | GitHub API `created_at` |
+| Rust, single binary, zero deps | Verified | README |
+| MIT license | Verified | GitHub API + README badge |
+| `curl \| claude` install | Verified | README + fabro.sh landing |
+| DOT graph workflows | Verified | README + example code |
+| Daytona cloud sandboxes | Declared | README feature table (unaudited) |
+| Supports Claude/OpenAI/Gemini | Declared | WebFetch landing (unaudited) |
+| Git checkpointing per stage | Declared | README feature table (unaudited) |
+| Automatic retrospectives | Declared | README feature table (unaudited) |
+| Bryan from qlty.sh | Verified | `bryan@qlty.sh` in README |
+
+**No corrections needed:** all claims traced to primary sources. Features marked "Declared" are present in README but not community-validated.
+
+---
+
+## Final Decision
+
+- **Final score**: 3/5
+- **Action**: Watch — add to `watch-list.md`, revisit when trigger reached
+- **Re-eval trigger**: >200 GitHub stars OR practitioner write-up from production use
+- **Confidence**: High on score, medium on features (project too recent for full audit)
+- **Next action**: Evaluate Goose (Block) as #080 — more immediately relevant to guide's audience
--- a/docs/resource-evaluations/080-goose-block-coding-agent.md
+++ b/docs/resource-evaluations/080-goose-block-coding-agent.md
@ -0,0 +1,120 @@
+# Resource Evaluation #080 — Goose (Block): Open-Source AI Coding Agent
+
+**Source:** [block.github.io/goose](https://block.github.io/goose) / [github.com/block/goose](https://github.com/block/goose)
+**Type:** Open source tool (Apache 2.0) — on-machine AI coding agent
+**Author:** Block (formerly Square) — maintained by Block's engineering team
+**Evaluated:** 2026-03-17
+**Maturity at evaluation:** Launched officially January 2025, 33,166 stars, 3,058 forks
+
+---
+
+## Summary
+
+- **On-machine AI coding agent**: local-first CLI + desktop app, not cloud. Automates complex engineering tasks end-to-end
+- **Model-agnostic**: works with Claude (recommended for tool calling), GPT-4o, Gemini, Groq, local models (Ollama) — 20+ providers
+- **Recipes**: versionable, shareable, parameterized multi-step workflows. Distinct from "rules files" — recipes define what agents do, not how they behave
+- **Subagent orchestration**: spawn specialized agents autonomously or via sub-recipes, with dynamic model switching per task/cost
+- **1,700+ MCP servers** supported (first open source agent to support MCP, January 2025)
+- **Goose Grant Program**: Block funds developers building Goose extensions (launched July 2025)
+- **Custom Distributions**: teams can build branded Goose distros with preconfigured providers, extensions, and branding
+- **Backed by Block** (Square, Cash App) — institutional engineering resources, not a solo project
+
+---
+
+## Status in the Guide
+
+**Already documented**: `guide/ecosystem/ai-ecosystem.md` §11.1 "Goose: Open-Source Alternative (Block)"
+
+**The entry exists and is structurally sound.** The issue is outdated data and two missing feature callouts.
+
+---
+
+## Relevance Score
+
+| Score | Meaning |
+|-------|---------|
+| 5 | Essential — Major gap in the guide |
+| **4** | **Very relevant — Significant improvement needed** |
+| 3 | Pertinent — Useful complement |
+| 2 | Marginal — Secondary information |
+| 1 | Out of scope — Not relevant |
+
+**Final score: 4/5 (Update existing entry)**
+
+**Justification:** Goose is already documented. Score reflects the importance of keeping the entry current — at 33k stars (2x what the guide says), Goose is clearly not a niche alternative. The missing Recipes and subagent orchestration paragraphs are also directly relevant to a guide that extensively documents Claude Code's equivalent patterns (skills, slash commands, multi-agent).
+
+---
+
+## What Needs Updating in §11.1
+
+### 1. Stats (outdated)
+
+| Field | Current guide (Jan 2026) | Actual (Mar 2026) |
+|-------|--------------------------|-------------------|
+| GitHub Stars | 15,400+ | 33,166 |
+| MCP servers | 3,000+ (table) vs 1,700+ (inconsistency) | 1,700+ (per Goose docs) |
+| Releases | 100+ | ~175+ (estimated, fast release cadence) |
+
+**Fix**: Update the metrics table and resolve the 3,000 vs 1,700 MCP inconsistency.
+
+### 2. Recipes — missing
+
+Recipes are Goose's equivalent of Claude Code slash commands + skills combined. They are:
+- Versionable, shareable as standalone workflows
+- Importable via deeplinks
+- Parameterized (reusable across contexts)
+- Can be shared across teams
+
+This is directly relevant to a guide section that extensively documents commands and skills. One paragraph with a cross-reference to §3 (commands) and §4 (skills) is warranted.
+
+### 3. Subagent orchestration — missing
+
+Goose's July 2025 roadmap introduced subagent orchestration: spawn specialized sub-agents (Planner, Architect, Frontend Dev, Backend Dev) with dynamic model switching per agent. Example from Berkeley Agentic AI Summit: 7 agents collaboratively built a full-stack app in under an hour.
+
+This overlaps with Claude Code's own multi-agent patterns (§9). A one-paragraph callout with a comparison to Claude Code's Agent tool would help readers understand the architectural difference (Claude Code: single agent + Tool spawning vs Goose: recipe-defined multi-agent subflows).
+
+### 4. agentskills.io — verify live status
+
+The "Skill Portability" paragraph references agentskills.io. **Verified live** (2026-03-17). No change needed.
+
+---
+
+## Challenge (technical-writer agent)
+
+**Score confirmed: 4/5 (update pass)**
+
+Key points:
+- 15.4k → 33k stars: 2x undercount signals the section hasn't been maintained. Trust erosion, not just a metric miss.
+- MCP discrepancy (3,000 vs 1,700): one of these is wrong. Fix before any update goes live.
+- Recipes and subagents absent: closest Goose analogy to Claude Code's skills + multi-agent. Should be documented.
+- Risk of not updating: low urgency for readers, moderate for guide credibility as a current reference.
+- Scope: 30-minute update pass, not a restructure.
+
+---
+
+## Fact-Check
+
+| Claim | Verified | Source |
+|-------|----------|--------|
+| 33,166 GitHub stars | Verified | GitHub API (2026-03-17) |
+| 3,058 forks | Verified | GitHub API |
+| Apache 2.0 license | Verified | GitHub API + README |
+| Rust (primary language) | Verified | GitHub API (`language: "Rust"`) |
+| Created August 2024, launched Jan 2025 | Verified | GitHub API + "1 Year of goose" discussion (Jan 2026) |
+| First open source agent to support MCP | Claimed | "1 Year of goose" GitHub discussion |
+| 1,700+ MCP servers | Per Perplexity (sourced from Goose docs) | Cross-check recommended |
+| Claude 3.5 Sonnet recommended for tool calling | Claimed | Perplexity search citing Goose docs |
+| Goose Grant Program (July 2025) | Verified | block.xyz/inside/introducing-the-goose-grant-program |
+| agentskills.io live | Verified | HTTP fetch (2026-03-17) |
+| Dynamic model switching per subagent | Claimed | GitHub roadmap discussion #3319 |
+
+**MCP server count discrepancy**: Guide says 3,000+ (comparison table), Perplexity reports 1,700+ from Goose docs. Need to check Goose documentation directly before updating. Use the more conservative figure if unsure.
+
+---
+
+## Final Decision
+
+- **Final score**: 4/5
+- **Action**: Update existing `guide/ecosystem/ai-ecosystem.md` §11.1 — stats refresh + Recipes paragraph + subagent orchestration paragraph
+- **Confidence**: High on stats, medium on MCP server count (needs direct doc check)
+- **Priority**: Medium — not urgent, but a 2x star count delta is worth fixing promptly
--- a/docs/resource-evaluations/2026-03-17-martignole-agent-maturity-levels.md
+++ b/docs/resource-evaluations/2026-03-17-martignole-agent-maturity-levels.md
@ -0,0 +1,43 @@
+# Resource Evaluation: Martignole Agent Adoption Maturity Levels
+
+**Date**: 2026-03-17
+**Evaluator**: Claude Code Ultimate Guide team
+**Score**: 3/5 — Integrated
+
+---
+
+## Resource
+
+- **URL**: https://www.touilleur-express.fr/2026/03/17/decouvrir-les-niveaux-de-maturite-de-ladoption-des-coding-agents
+- **Author**: Nicolas Martignole, Principal Engineer at Back Market, Le Touilleur Express blog
+- **Type**: Practitioner framework / blog post
+- **Language**: French
+
+## Summary
+
+6-level maturity framework (0-5 in the original, extended to 6 here) for individual developers adopting coding agents. The real contribution is the Level 3-5 arc: basic user → stage delegator → context engineer → orchestrator. Maps cleanly onto Claude Code concepts (Plan mode, sub-agents, MCP servers, context engineering).
+
+## Score Justification
+
+**3/5 — Pertinent complement**
+
+- Fills a real gap: no individual self-placement scale existed in the guide
+- Practitioner source (production engineering context), not a blogger opinion piece
+- Zero empirical data behind the levels — one engineer's taxonomy
+- Levels 0-2 are noise for the guide's audience (already using Claude Code)
+- ThoughtWorks occupies the "maturity model" reference slot; Martignole's upper levels are more Claude Code-specific
+
+## Integration
+
+Adapted into `guide/roles/learning-with-ai.md` as a "Where Are You on the Agent Adoption Curve?" section (inserted before the 30-Day Progression Plan). The level descriptions were extended (6 levels vs 5 in the original) and diagnostic questions added. Attribution and source link included.
+
+## Fact-Check Notes
+
+- Author identity and role: confirmed via LinkedIn
+- "~5% manual coding at Level 3+": practitioner estimate, not empirical — used as illustrative signal
+- "Free Claude Code Architect certification by Anthropic": not verifiable, not reproduced in the guide
+- Article date 2026-03-17: confirmed from URL
+
+## Decision
+
+**Integrated** — adapted content added to `learning-with-ai.md`. Not a verbatim reproduction; the framework was restructured and extended for the guide's English-speaking, already-technical audience.