release: v3.37.1 - threat-db v2.8.0, CC releases v2.1.78, 19 skills update, doc fixes

- threat-db v2.8.0: GhostClaw campaign, Fake OpenClaw Installer, CVE-2026-24910 (Bun), T017 Shadow MCP, T018 AI Search Poisoning, Jozu Agent Guard, MCP Sentinel - Claude Code releases tracked to v2.1.78 (StopFailure hook, plugin state, security fixes) - 19 skill descriptions improved (PR #9 selective merge, @popey/Tessl) - MCP vs CLI token overhead corrected (lazy loading, 85% reduction benchmark) - Agent Adoption Curve self-assessment (7-level maturity scale, Martignole framework) - ctx7 CLI section §5.5 + resource evals #079 #080 #081 Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>
2026-03-18 15:49:01 +01:00 · 2026-03-18 15:49:01 +01:00 · 44818a3f04
commit 44818a3f04
parent eea5662a65
19 changed files with 785 additions and 87 deletions
--- a/guide/cheatsheet.md
+++ b/guide/cheatsheet.md
@ -12,7 +12,7 @@ tags: [cheatsheet, reference]

 **Written with**: Claude (Anthropic)

-**Version**: 3.37.0 | **Last Updated**: March 2026
+**Version**: 3.37.1 | **Last Updated**: March 2026

 ---

@ -639,4 +639,4 @@ Speed: `rg` (~20ms) → Serena (~100ms) → ast-grep (~200ms) → grepai (~500ms

 **Author**: Florian BRUNIAUX | [@Méthode Aristote](https://methode-aristote.fr) | Written with Claude

-*Last updated: March 2026 | Version 3.37.0*
+*Last updated: March 2026 | Version 3.37.1*
--- a/guide/ecosystem/ai-ecosystem.md
+++ b/guide/ecosystem/ai-ecosystem.md
@ -2107,11 +2107,11 @@ For developers hitting Claude Code's subscription limits or needing model flexib

 An **on-machine AI coding agent** developed by Block (formerly Square), released under Apache 2.0 license. Unlike Claude Code, Goose runs entirely locally and is **model-agnostic**—it can use Claude, GPT, Gemini, Groq, or any LLM provider.

-| Metric | Value (Jan 2026) |
+| Metric | Value (Mar 2026) |
 |--------|------------------|
-| **GitHub Stars** | 15,400+ |
-| **Contributors** | 350+ |
-| **Releases** | 100+ since Jan 2025 |
+| **GitHub Stars** | 33,000+ |
+| **Contributors** | 400+ |
+| **Releases** | 175+ since Jan 2025 |
 | **License** | Apache 2.0 (permissive) |
 | **Primary Language** | Rust (64%) + TypeScript (26%) |

@ -2124,7 +2124,7 @@ An **on-machine AI coding agent** developed by Block (formerly Square), released
 | **Cost Model** | Subscription ($20-$200/mo) | Free + your LLM API costs |
 | **Rate Limits** | Anthropic's weekly/5-hour caps | Your LLM provider's limits |
 | **Token Visibility** | Opaque (no per-prompt tracking) | Full transparency |
-| **MCP Support** | Native (growing ecosystem) | 3,000+ MCP servers available |
+| **MCP Support** | Native (growing ecosystem) | Thousands of MCP servers available |
 | **Setup Complexity** | Simple (npm install) | Moderate (Rust toolchain, API keys) |

 ### When to Consider Goose
@ -2142,6 +2142,14 @@ An **on-machine AI coding agent** developed by Block (formerly Square), released
 - You value Claude's specific reasoning capabilities and can't substitute
 - You don't want to manage LLM API credentials

+### Recipes: Goose's Equivalent of Skills + Commands
+
+Goose has a workflow primitive called **Recipes** — versionable, shareable, parameterized multi-step workflows. Unlike Claude Code's skills (which define agent capabilities) or slash commands (which trigger one-shot actions), Recipes define complete execution sequences: what to do, in what order, with which model at each step. They can be shared as deeplinks, imported by teammates, and checked into source control. Closest Claude Code analogy: a skill that chains multiple commands in a defined sequence, with state carried between steps.
+
+### Subagent Orchestration
+
+Since mid-2025, Goose supports spawning specialized subagents within a workflow. A parent agent can delegate subtasks to subagents with different roles (Planner, Architect, Frontend Dev, Backend Dev), each potentially running a different LLM optimized for its task. This differs from Claude Code's Agent tool (which spawns a subagent with the same model) — Goose enables heterogeneous agent teams where model choice is per-role, not per-session. See §9 for Claude Code's native multi-agent patterns.
+
 ### Skill Portability

 Both Claude Code and Goose support the [Agent Skills open standard](https://agentskills.io) (agentskills.io). Skills you create with SKILL.md are portable across 26+ platforms including Cursor, VS Code, GitHub, OpenAI Codex, and Gemini CLI. Claude Code-specific fields (`context`, `agent`) are ignored by other platforms but don't break compatibility.
--- a/guide/ecosystem/mcp-vs-cli.md
+++ b/guide/ecosystem/mcp-vs-cli.md
@ -40,7 +40,7 @@ This page compares two integration patterns for giving Claude Code access to ext

 | Advantage | Detail |
 |-----------|--------|
-| **Zero context overhead** | No schema injected at startup — relevant when context budget is tight |
+| **Zero context overhead** | No schema injected at startup. Since v2.1.7 lazy loading closes most of the gap, but CLI is still the absolute minimum. |
 | **Deterministic actions** | Explicit commands with predictable output are easier to audit and test |
 | **Human + AI use** | The same CLI wrapper works for a developer running it manually and for Claude |
 | **Frontier models** | Claude Opus/Sonnet 4.6 can drive complex CLIs (aws-cli, glab, gh) without a structured schema |
@ -52,7 +52,7 @@ This page compares two integration patterns for giving Claude Code access to ext

 | Weakness | Detail |
 |----------|--------|
-| **Schema token cost** | Every MCP server injects its full tool list into the context window at session start, whether or not those tools are used that session |
+| **Schema token cost** | Since v2.1.7, lazy loading (MCP Tool Search) means unused tools inject only their name, not their full schema. Cost is still non-zero: tool names load at startup, full schemas load on first use. The pre-v2.1.7 worst case (~55K tokens for a 5-server setup) now averages ~8.7K — an 85% reduction, but not zero. |
 | **Connection overhead** | Session startup takes longer with many MCP servers connected |
 | **Debugging difficulty** | Failures inside an MCP server are harder to trace than a failed shell command |
 | **Maintenance complexity** | Running, updating, and securing remote MCP servers adds infrastructure |
@ -112,7 +112,7 @@ Quick reference — not rules, but directional defaults.
 | Individual dev, local machine | **CLI or skill** | Simpler, faster, no infrastructure |
 | Deterministic actions (git, CI, deploy) | **CLI** | Explicit commands, predictable output, auditable |
 | Complex auth (OAuth, token refresh) | **MCP** | Server handles auth; CLI would require credential plumbing |
-| Tight context budget / many tools loaded | **CLI** | No schema injection at startup |
+| Tight context budget / many tools loaded | **CLI** | Still the minimum-overhead option. Lazy loading (v2.1.7+) reduces MCP cost significantly, but CLI has zero schema cost by design. |
 | Agent-to-agent structured output | **MCP** | JSON responses are more reliable than parsed CLI text |
 | Debugging / prototyping a new integration | **CLI** | Easier to inspect, faster to iterate |
 | Browser automation (non-frontier model) | **MCP** | Playwright MCP structures interaction reliably |
@ -138,18 +138,35 @@ The mistake is applying one answer to both layers. A solo developer building a C

 ## Token cost of MCP schemas — what the numbers look like

-MCP servers inject their full tool list into the context at session start. This is not free.
+Since v2.1.7 (January 2026), Claude Code uses **MCP Tool Search** (lazy loading) by default. This changes the token math significantly, but does not eliminate schema cost entirely.

-A typical MCP server with 10-15 tools injects 500-2,000 tokens per session before any task starts. With 5 MCP servers connected, that is 2,500-10,000 tokens of overhead on every session, whether or not those tools are used.
+**How lazy loading works:** instead of injecting all tool schemas at session start, Claude receives only tool names in an `<available-deferred-tools>` block. Full schemas are fetched via `ToolSearch` only when Claude decides to call a specific tool. Unused tools in a session cost only their name in context (~0 schema tokens), not the full definition.

-The practical consequence: if you load 10 MCP servers but only use 2 in a given session, you are paying for 8 servers worth of schema every time. This compounds with long sessions and high-frequency workflows.
+**Measured impact** (Anthropic benchmarks, 5-server setup):

-**Mitigation strategies:**
+| Scenario | Token overhead | Note |
+|----------|---------------|------|
+| Before v2.1.7 (eager loading) | ~55,000 tokens | All schemas preloaded |
+| After v2.1.7 (lazy loading) | ~8,700 tokens | 85% reduction |
+| CLI (no MCP) | ~0 tokens | Baseline |
+
+The old worst-case claim of "500-2,000 tokens per server" described eager loading, which is no longer the default. With lazy loading, the cost per unused server is near zero. The cost per *used* server (~600 tokens per tool schema loaded on demand) remains real, but is now pay-per-use rather than always-on.
+
+**What still adds overhead even with lazy loading:**
+
+- Tool names are still injected at startup (one line per tool per server)
+- Schemas load at first invocation — long sessions using many tools accumulate cost
+- Connection setup per server is unchanged (latency, not tokens)
+- Many connected MCP servers still means more names in context, even if schemas stay deferred
+
+**Configuration** (v2.1.9+): the `ENABLE_TOOL_SEARCH` environment variable controls the threshold. `auto:N` triggers lazy loading when MCP tools exceed N% of context (default 10%).
+
+**Mitigation strategies** (still relevant, lower urgency):

 - Load MCP servers selectively per project (project-level config vs global config)
- Use CLI tools for high-frequency operations where schema overhead accumulates
- Monitor token usage per session to identify which MCP schemas are loaded but unused
- Consider a CLI wrapper for tools you use frequently in tight loops (compile → test → fix cycles)
+- Use CLI tools for high-frequency tight loops where any overhead compounds (compile → test → fix)
+- Monitor token usage per session to identify which schemas are being loaded at invocation time
+- Consider a CLI wrapper for tools you use constantly but don't need structured output from

 ---

--- a/guide/roles/learning-with-ai.md
+++ b/guide/roles/learning-with-ai.md
@ -28,11 +28,12 @@ tags: [guide, workflows]
 8. [Embracing AI Tools (Pattern: Avoidant)](#embracing-ai-tools)
 9. [Optimizing Your Flow (Pattern: Augmented)](#optimizing-your-flow)
 10. [Case Study: Hybrid Learning Principles](#case-study-hybrid-learning-principles)
-11. [30-Day Progression Plan](#30-day-progression-plan)
-12. [For Tech Leads & Engineering Managers](#for-tech-leads--engineering-managers)
-13. [Red Flags Checklist](#red-flags-checklist)
-14. [Sources & Research](#sources--research)
-15. [See Also](#see-also)
+11. [Where Are You on the Agent Adoption Curve?](#where-are-you-on-the-agent-adoption-curve)
+12. [30-Day Progression Plan](#30-day-progression-plan)
+13. [For Tech Leads & Engineering Managers](#for-tech-leads--engineering-managers)
+14. [Red Flags Checklist](#red-flags-checklist)
+15. [Sources & Research](#sources--research)
+16. [See Also](#see-also)

 ---

@ -819,6 +820,35 @@ The combination of **human accountability + AI practice** beats either alone. Th

 ---

+## Where Are You on the Agent Adoption Curve?
+
+> **Audience**: Developers already using Claude Code who want to gauge their current sophistication — not beginners starting from scratch (use the 30-Day Plan below for that).
+
+Before picking a learning path, locate yourself. Nicolas Martignole (Principal Engineer at Back Market) proposed a 6-level maturity scale in March 2026 that maps well onto practical Claude Code usage. The levels below are adapted from his framework, with the upper half (3-5) being where most of this guide's content lives.
+
+| Level | Profile | Signal |
+|-------|---------|--------|
+| **0** | Never used AI dev tools | Using chatbots at most, nothing integrated in workflow |
+| **1** | Editor autocomplete | Cursor, Copilot, Windsurf — but no agent-level usage |
+| **2** | External LLM, copy-paste | ChatGPT or Claude in browser, pasting code manually into editor |
+| **3** | Claude Code basic user | Running Plan mode, simple prompts, reviewing everything manually |
+| **4** | Stage delegator | Handing off full development stages (research, architecture, implementation, tests) — writing less than 10% of code manually |
+| **5** | Context engineer | Designing CLAUDE.md, sub-agents, custom skills, MCP servers — building the environment for agents to operate in |
+| **6** | Orchestrator | Coordinating agent graphs, reinforcement loops, distributed agent systems |
+
+**Quick self-placement questions:**
+
+- Can you leave Claude Code running on a feature branch for 20+ minutes without checking in? → Level 4+
+- Do you write CLAUDE.md before starting a project, not after? → Level 5
+- Have you built a custom agent or hook in the last month? → Level 5-6
+- Is your primary output prompts and system design, not code? → Level 6
+
+If you landed at Level 3 or below: the 30-Day Plan below is the right path. If you're at Level 4-6: skip to [Context Engineering](../core/context-engineering.md), [Agent Patterns](../../examples/agents/), or [MCP Ecosystem](../ecosystem/mcp-servers-ecosystem.md).
+
+> Source: Nicolas Martignole, ["Découvrir les niveaux de maturité de l'adoption des coding agents"](https://www.touilleur-express.fr/2026/03/17/decouvrir-les-niveaux-de-maturite-de-ladoption-des-coding-agents), Le Touilleur Express, March 2026. Adapted and extended.
+
+---
+
 ## 30-Day Progression Plan

 A concrete path from wherever you are to augmented developer.
--- a/guide/ultimate-guide.md
+++ b/guide/ultimate-guide.md
@ -16,7 +16,7 @@ tags: [guide, reference, workflows, agents, hooks, mcp, security]

 **Last updated**: January 2026

-**Version**: 3.37.0
+**Version**: 3.37.1

 ---

@ -5166,7 +5166,7 @@ The `.claude/` folder is your project's Claude Code directory for memory, settin
 | Personal preferences | `CLAUDE.md` | ❌ Gitignore |
 | Personal permissions | `settings.local.json` | ❌ Gitignore |

-### 3.37.0 Version Control & Backup
+### 3.37.1 Version Control & Backup

 **Problem**: Without version control, losing your Claude Code configuration means hours of manual reconfiguration across agents, skills, hooks, and MCP servers.

@ -23482,4 +23482,4 @@ We'll evaluate and add it to this section if it meets quality criteria.

 **Contributions**: Issues and PRs welcome.

-**Last updated**: January 2026 | **Version**: 3.37.0
+**Last updated**: January 2026 | **Version**: 3.37.1