release: v3.37.1 - threat-db v2.8.0, CC releases v2.1.78, 19 skills update, doc fixes

- threat-db v2.8.0: GhostClaw campaign, Fake OpenClaw Installer, CVE-2026-24910 (Bun),
  T017 Shadow MCP, T018 AI Search Poisoning, Jozu Agent Guard, MCP Sentinel
- Claude Code releases tracked to v2.1.78 (StopFailure hook, plugin state, security fixes)
- 19 skill descriptions improved (PR #9 selective merge, @popey/Tessl)
- MCP vs CLI token overhead corrected (lazy loading, 85% reduction benchmark)
- Agent Adoption Curve self-assessment (7-level maturity scale, Martignole framework)
- ctx7 CLI section §5.5 + resource evals #079 #080 #081

Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>
This commit is contained in:
Florian BRUNIAUX 2026-03-18 15:49:01 +01:00
parent eea5662a65
commit 44818a3f04
19 changed files with 785 additions and 87 deletions

View file

@ -12,7 +12,7 @@ tags: [cheatsheet, reference]
**Written with**: Claude (Anthropic)
**Version**: 3.37.0 | **Last Updated**: March 2026
**Version**: 3.37.1 | **Last Updated**: March 2026
---
@ -639,4 +639,4 @@ Speed: `rg` (~20ms) → Serena (~100ms) → ast-grep (~200ms) → grepai (~500ms
**Author**: Florian BRUNIAUX | [@Méthode Aristote](https://methode-aristote.fr) | Written with Claude
*Last updated: March 2026 | Version 3.37.0*
*Last updated: March 2026 | Version 3.37.1*

View file

@ -2107,11 +2107,11 @@ For developers hitting Claude Code's subscription limits or needing model flexib
An **on-machine AI coding agent** developed by Block (formerly Square), released under Apache 2.0 license. Unlike Claude Code, Goose runs entirely locally and is **model-agnostic**—it can use Claude, GPT, Gemini, Groq, or any LLM provider.
| Metric | Value (Jan 2026) |
| Metric | Value (Mar 2026) |
|--------|------------------|
| **GitHub Stars** | 15,400+ |
| **Contributors** | 350+ |
| **Releases** | 100+ since Jan 2025 |
| **GitHub Stars** | 33,000+ |
| **Contributors** | 400+ |
| **Releases** | 175+ since Jan 2025 |
| **License** | Apache 2.0 (permissive) |
| **Primary Language** | Rust (64%) + TypeScript (26%) |
@ -2124,7 +2124,7 @@ An **on-machine AI coding agent** developed by Block (formerly Square), released
| **Cost Model** | Subscription ($20-$200/mo) | Free + your LLM API costs |
| **Rate Limits** | Anthropic's weekly/5-hour caps | Your LLM provider's limits |
| **Token Visibility** | Opaque (no per-prompt tracking) | Full transparency |
| **MCP Support** | Native (growing ecosystem) | 3,000+ MCP servers available |
| **MCP Support** | Native (growing ecosystem) | Thousands of MCP servers available |
| **Setup Complexity** | Simple (npm install) | Moderate (Rust toolchain, API keys) |
### When to Consider Goose
@ -2142,6 +2142,14 @@ An **on-machine AI coding agent** developed by Block (formerly Square), released
- You value Claude's specific reasoning capabilities and can't substitute
- You don't want to manage LLM API credentials
### Recipes: Goose's Equivalent of Skills + Commands
Goose has a workflow primitive called **Recipes** — versionable, shareable, parameterized multi-step workflows. Unlike Claude Code's skills (which define agent capabilities) or slash commands (which trigger one-shot actions), Recipes define complete execution sequences: what to do, in what order, with which model at each step. They can be shared as deeplinks, imported by teammates, and checked into source control. Closest Claude Code analogy: a skill that chains multiple commands in a defined sequence, with state carried between steps.
### Subagent Orchestration
Since mid-2025, Goose supports spawning specialized subagents within a workflow. A parent agent can delegate subtasks to subagents with different roles (Planner, Architect, Frontend Dev, Backend Dev), each potentially running a different LLM optimized for its task. This differs from Claude Code's Agent tool (which spawns a subagent with the same model) — Goose enables heterogeneous agent teams where model choice is per-role, not per-session. See §9 for Claude Code's native multi-agent patterns.
### Skill Portability
Both Claude Code and Goose support the [Agent Skills open standard](https://agentskills.io) (agentskills.io). Skills you create with SKILL.md are portable across 26+ platforms including Cursor, VS Code, GitHub, OpenAI Codex, and Gemini CLI. Claude Code-specific fields (`context`, `agent`) are ignored by other platforms but don't break compatibility.

View file

@ -40,7 +40,7 @@ This page compares two integration patterns for giving Claude Code access to ext
| Advantage | Detail |
|-----------|--------|
| **Zero context overhead** | No schema injected at startup — relevant when context budget is tight |
| **Zero context overhead** | No schema injected at startup. Since v2.1.7 lazy loading closes most of the gap, but CLI is still the absolute minimum. |
| **Deterministic actions** | Explicit commands with predictable output are easier to audit and test |
| **Human + AI use** | The same CLI wrapper works for a developer running it manually and for Claude |
| **Frontier models** | Claude Opus/Sonnet 4.6 can drive complex CLIs (aws-cli, glab, gh) without a structured schema |
@ -52,7 +52,7 @@ This page compares two integration patterns for giving Claude Code access to ext
| Weakness | Detail |
|----------|--------|
| **Schema token cost** | Every MCP server injects its full tool list into the context window at session start, whether or not those tools are used that session |
| **Schema token cost** | Since v2.1.7, lazy loading (MCP Tool Search) means unused tools inject only their name, not their full schema. Cost is still non-zero: tool names load at startup, full schemas load on first use. The pre-v2.1.7 worst case (~55K tokens for a 5-server setup) now averages ~8.7K — an 85% reduction, but not zero. |
| **Connection overhead** | Session startup takes longer with many MCP servers connected |
| **Debugging difficulty** | Failures inside an MCP server are harder to trace than a failed shell command |
| **Maintenance complexity** | Running, updating, and securing remote MCP servers adds infrastructure |
@ -112,7 +112,7 @@ Quick reference — not rules, but directional defaults.
| Individual dev, local machine | **CLI or skill** | Simpler, faster, no infrastructure |
| Deterministic actions (git, CI, deploy) | **CLI** | Explicit commands, predictable output, auditable |
| Complex auth (OAuth, token refresh) | **MCP** | Server handles auth; CLI would require credential plumbing |
| Tight context budget / many tools loaded | **CLI** | No schema injection at startup |
| Tight context budget / many tools loaded | **CLI** | Still the minimum-overhead option. Lazy loading (v2.1.7+) reduces MCP cost significantly, but CLI has zero schema cost by design. |
| Agent-to-agent structured output | **MCP** | JSON responses are more reliable than parsed CLI text |
| Debugging / prototyping a new integration | **CLI** | Easier to inspect, faster to iterate |
| Browser automation (non-frontier model) | **MCP** | Playwright MCP structures interaction reliably |
@ -138,18 +138,35 @@ The mistake is applying one answer to both layers. A solo developer building a C
## Token cost of MCP schemas — what the numbers look like
MCP servers inject their full tool list into the context at session start. This is not free.
Since v2.1.7 (January 2026), Claude Code uses **MCP Tool Search** (lazy loading) by default. This changes the token math significantly, but does not eliminate schema cost entirely.
A typical MCP server with 10-15 tools injects 500-2,000 tokens per session before any task starts. With 5 MCP servers connected, that is 2,500-10,000 tokens of overhead on every session, whether or not those tools are used.
**How lazy loading works:** instead of injecting all tool schemas at session start, Claude receives only tool names in an `<available-deferred-tools>` block. Full schemas are fetched via `ToolSearch` only when Claude decides to call a specific tool. Unused tools in a session cost only their name in context (~0 schema tokens), not the full definition.
The practical consequence: if you load 10 MCP servers but only use 2 in a given session, you are paying for 8 servers worth of schema every time. This compounds with long sessions and high-frequency workflows.
**Measured impact** (Anthropic benchmarks, 5-server setup):
**Mitigation strategies:**
| Scenario | Token overhead | Note |
|----------|---------------|------|
| Before v2.1.7 (eager loading) | ~55,000 tokens | All schemas preloaded |
| After v2.1.7 (lazy loading) | ~8,700 tokens | 85% reduction |
| CLI (no MCP) | ~0 tokens | Baseline |
The old worst-case claim of "500-2,000 tokens per server" described eager loading, which is no longer the default. With lazy loading, the cost per unused server is near zero. The cost per *used* server (~600 tokens per tool schema loaded on demand) remains real, but is now pay-per-use rather than always-on.
**What still adds overhead even with lazy loading:**
- Tool names are still injected at startup (one line per tool per server)
- Schemas load at first invocation — long sessions using many tools accumulate cost
- Connection setup per server is unchanged (latency, not tokens)
- Many connected MCP servers still means more names in context, even if schemas stay deferred
**Configuration** (v2.1.9+): the `ENABLE_TOOL_SEARCH` environment variable controls the threshold. `auto:N` triggers lazy loading when MCP tools exceed N% of context (default 10%).
**Mitigation strategies** (still relevant, lower urgency):
- Load MCP servers selectively per project (project-level config vs global config)
- Use CLI tools for high-frequency operations where schema overhead accumulates
- Monitor token usage per session to identify which MCP schemas are loaded but unused
- Consider a CLI wrapper for tools you use frequently in tight loops (compile → test → fix cycles)
- Use CLI tools for high-frequency tight loops where any overhead compounds (compile → test → fix)
- Monitor token usage per session to identify which schemas are being loaded at invocation time
- Consider a CLI wrapper for tools you use constantly but don't need structured output from
---

View file

@ -28,11 +28,12 @@ tags: [guide, workflows]
8. [Embracing AI Tools (Pattern: Avoidant)](#embracing-ai-tools)
9. [Optimizing Your Flow (Pattern: Augmented)](#optimizing-your-flow)
10. [Case Study: Hybrid Learning Principles](#case-study-hybrid-learning-principles)
11. [30-Day Progression Plan](#30-day-progression-plan)
12. [For Tech Leads & Engineering Managers](#for-tech-leads--engineering-managers)
13. [Red Flags Checklist](#red-flags-checklist)
14. [Sources & Research](#sources--research)
15. [See Also](#see-also)
11. [Where Are You on the Agent Adoption Curve?](#where-are-you-on-the-agent-adoption-curve)
12. [30-Day Progression Plan](#30-day-progression-plan)
13. [For Tech Leads & Engineering Managers](#for-tech-leads--engineering-managers)
14. [Red Flags Checklist](#red-flags-checklist)
15. [Sources & Research](#sources--research)
16. [See Also](#see-also)
---
@ -819,6 +820,35 @@ The combination of **human accountability + AI practice** beats either alone. Th
---
## Where Are You on the Agent Adoption Curve?
> **Audience**: Developers already using Claude Code who want to gauge their current sophistication — not beginners starting from scratch (use the 30-Day Plan below for that).
Before picking a learning path, locate yourself. Nicolas Martignole (Principal Engineer at Back Market) proposed a 6-level maturity scale in March 2026 that maps well onto practical Claude Code usage. The levels below are adapted from his framework, with the upper half (3-5) being where most of this guide's content lives.
| Level | Profile | Signal |
|-------|---------|--------|
| **0** | Never used AI dev tools | Using chatbots at most, nothing integrated in workflow |
| **1** | Editor autocomplete | Cursor, Copilot, Windsurf — but no agent-level usage |
| **2** | External LLM, copy-paste | ChatGPT or Claude in browser, pasting code manually into editor |
| **3** | Claude Code basic user | Running Plan mode, simple prompts, reviewing everything manually |
| **4** | Stage delegator | Handing off full development stages (research, architecture, implementation, tests) — writing less than 10% of code manually |
| **5** | Context engineer | Designing CLAUDE.md, sub-agents, custom skills, MCP servers — building the environment for agents to operate in |
| **6** | Orchestrator | Coordinating agent graphs, reinforcement loops, distributed agent systems |
**Quick self-placement questions:**
- Can you leave Claude Code running on a feature branch for 20+ minutes without checking in? → Level 4+
- Do you write CLAUDE.md before starting a project, not after? → Level 5
- Have you built a custom agent or hook in the last month? → Level 5-6
- Is your primary output prompts and system design, not code? → Level 6
If you landed at Level 3 or below: the 30-Day Plan below is the right path. If you're at Level 4-6: skip to [Context Engineering](../core/context-engineering.md), [Agent Patterns](../../examples/agents/), or [MCP Ecosystem](../ecosystem/mcp-servers-ecosystem.md).
> Source: Nicolas Martignole, ["Découvrir les niveaux de maturité de l'adoption des coding agents"](https://www.touilleur-express.fr/2026/03/17/decouvrir-les-niveaux-de-maturite-de-ladoption-des-coding-agents), Le Touilleur Express, March 2026. Adapted and extended.
---
## 30-Day Progression Plan
A concrete path from wherever you are to augmented developer.

View file

@ -16,7 +16,7 @@ tags: [guide, reference, workflows, agents, hooks, mcp, security]
**Last updated**: January 2026
**Version**: 3.37.0
**Version**: 3.37.1
---
@ -5166,7 +5166,7 @@ The `.claude/` folder is your project's Claude Code directory for memory, settin
| Personal preferences | `CLAUDE.md` | ❌ Gitignore |
| Personal permissions | `settings.local.json` | ❌ Gitignore |
### 3.37.0 Version Control & Backup
### 3.37.1 Version Control & Backup
**Problem**: Without version control, losing your Claude Code configuration means hours of manual reconfiguration across agents, skills, hooks, and MCP servers.
@ -23482,4 +23482,4 @@ We'll evaluate and add it to this section if it meets quality criteria.
**Contributions**: Issues and PRs welcome.
**Last updated**: January 2026 | **Version**: 3.37.0
**Last updated**: January 2026 | **Version**: 3.37.1