docs: integrate Gur Sannikov ADR workflow + native capabilities audit (4/5)

- Add ADR-Driven Development pattern to methodologies.md (~60 lines)
  - Pattern: ADR → skill → native execution
  - Example ADR template (database migration)
  - Complete bash workflow with benefits
- Add Native Capabilities Audit checklist to architecture.md (~50 lines)
  - 11 native capabilities with internal links
  - Onboarding tip for comprehension audit
- Add Dynamic Model Switching pattern to cheatsheet.md (~40 lines)
  - Pattern: Sonnet → Opus → Sonnet
  - Cost comparison table and best practices
- Add Community Validation to architecture.md (~15 lines)
  - External validation of 'less scaffolding, more model' approach
  - Cursor power user adopting Agent Skills standard
- Track evaluation in docs/resource-evaluations/ (full methodology)
- Update evaluations count: 24 → 55 (README + reference.yaml)
- Update CHANGELOG.md with integration details

Source: https://www.linkedin.com/posts/gursannikov_claudecode-embeddedengineering-aiagents-activity-7423851983331328001-DrFb
Score: 4/5 (HIGH VALUE) - fills ADR workflow gap + onboarding checklist

Co-Authored-By: Claude Sonnet 4.5 <noreply@anthropic.com>
This commit is contained in:
Florian BRUNIAUX 2026-02-07 16:12:53 +01:00
parent b48d95c024
commit 4c0e4b6ac6
7 changed files with 389 additions and 6 deletions

View file

@ -0,0 +1,228 @@
# Evaluation: Gur Sannikov - Claude Code as Embedded OS
**Resource Type**: LinkedIn Post
**Author**: Gur Sannikov
**Date Published**: 2026-02-01
**Date Evaluated**: 2026-02-07
**Evaluator**: Claude (via /eval-resource skill)
**Source**: [LinkedIn Post](https://www.linkedin.com/posts/gursannikov_claudecode-embeddedengineering-aiagents-activity-7423851983331328001-DrFb)
---
## Executive Summary
LinkedIn post proposing "embedded OS" metaphor for Claude Code architecture, listing 11 native capabilities, and demonstrating ADR-driven development workflow for embedded engineering.
**Key Value**: ADR workflow pattern + native capabilities checklist + community validation of "less scaffolding, more model" philosophy.
---
## Scoring
| Criterion | Score | Notes |
|-----------|-------|-------|
| **Technical Accuracy** | 5/5 | All 11 capabilities verified in guide |
| **Novelty** | 4/5 | ADR workflow gap + checklist format new |
| **Actionability** | 5/5 | Complete ADR → skill → execute pattern |
| **Alignment** | 5/5 | 100% aligned with guide philosophy |
| **Pedagogical Value** | 4/5 | Embedded systems metaphor + checklist |
| **Community Validation** | 4/5 | Cursor user adopting Agent Skills standard |
**Overall Score**: **4/5 (HIGH VALUE)**
**Reason for Score**: ADR workflow addresses real methodological gap, capabilities checklist provides onboarding value, and community convergence validates guide's architectural approach.
---
## Content Summary
### Main Claims
1. **Architecture Metaphor**: Claude Code as "embedded OS" with hardware parallels (DMA transfers, boot sequences, interrupt handlers)
2. **11 Native Capabilities**: Hooks, skill-scoped hooks, background agents, /explore, /plan, Task Tool, agent swarm, per-task model selection, MCP protocol, permission modes, session memory
3. **ADR Workflow**: Write plain English ADRs → feed to `implement-adr` skill → native execution (no external frameworks)
4. **Community Convergence**: Claude Code, GitHub Copilot CLI, OpenCode converging on Agent Skills standard
5. **Embedded Engineering Use Case**: Firmware dev, simulation, code reviews, bus data capture
### Notable Context
- **Jacob Beningo** (embedded engineering) challenges "OS" term, prefers "embedded engineering assistant"
- **Gist reference**: https://gist.github.com/gsannikov/92cf8ca50407458b605756508a20fe18
- **Engagement**: 110 likes, 43 comments
---
## Gap Analysis
### Gaps Addressed by This Resource
| Topic | Status in Guide (Pre-Integration) | Gap Severity |
|-------|-----------------------------------|--------------|
| **ADR Workflow** | ⚠️ ADRs mentioned, workflow incomplete | **HIGH** |
| **Native Capabilities Checklist** | ❌ Not present (features documented individually) | **MEDIUM** |
| **Dynamic Model Switching** | ⚠️ `/model` documented, pattern under-developed | **MEDIUM** |
| **Community Validation** | ⚠️ Agent Skills documented, convergence not cited | **LOW** |
| **Embedded Engineering** | ❌ Not covered | **LOW** (niche) |
### What the Guide Already Covered
- ✅ All 11 capabilities documented individually (hooks, subagents, Task Tool, etc.)
- ✅ Architecture philosophy ("less scaffolding, more model")
- ✅ Agent teams (v2.1.32+)
- ✅ MCP protocol integration
---
## Integration Actions Taken
### 1. ADR-Driven Development → `guide/methodologies.md` ⭐ **HIGH PRIORITY**
**Location**: After "Multi-Agent Orchestration" (Tier 5)
**Content Added**:
- Pattern: ADR → skill → native execution
- Example ADR template (database migration)
- Complete bash workflow
- 4 benefits (documentation-driven, native, traceable, team alignment)
**Lines Added**: ~60 lines
**Impact**: Fills methodological gap, provides actionable workflow
### 2. Native Capabilities Audit → `guide/architecture.md` ⭐ **HIGH PRIORITY**
**Location**: After "Why This Design?" (Section 1)
**Content Added**:
- Checklist of 11 native capabilities
- Internal links to detailed sections
- Onboarding tip for beginners
**Lines Added**: ~50 lines
**Impact**: Onboarding tool, comprehension audit
### 3. Dynamic Model Switching → `guide/cheatsheet.md` 🟡 **MEDIUM PRIORITY**
**Location**: Under "Plan Mode & Thinking" section
**Content Added**:
- Pattern: Sonnet → Opus → Sonnet
- Bash workflow example (OAuth2 PKCE)
- Best practices (5 ✅ + 1 ❌)
- Cost comparison table
**Lines Added**: ~40 lines
**Impact**: Cost optimization pattern, workflow clarity
### 4. Community Validation → `guide/architecture.md` 🟢 **LOW PRIORITY**
**Location**: After "The Trade-offs" (Section 9 Philosophy)
**Content Added**:
- Note on community convergence (Agent Skills standard)
- Example: Gur Sannikov (embedded engineering, ex-Cursor user)
- Validation of "native capabilities first" approach
**Lines Added**: ~15 lines
**Impact**: External credibility, demonstrates adoption
---
## Fact-Check Results
### Verified Claims ✅
| Claim | Verification Method | Result |
|-------|---------------------|--------|
| **11 Native Capabilities** | Cross-reference with guide | ✅ All confirmed |
| **Per-task Model Selection** | `claude-code-releases.md:375` | ✅ "dynamic model selection" |
| **Agent Swarm** | `guide/workflows/agent-teams.md` | ✅ Experimental v2.1.32+ |
| **Background Agents** | Multiple guide files | ✅ Async task execution documented |
| **Hooks, /explore, /plan, Task Tool** | Guide sections 4.2, 5.10, etc. | ✅ All documented |
### Partially Verified ⚠️
| Claim | Status | Notes |
|-------|--------|-------|
| **Jacob Beningo Quote** | ⚠️ Unverified | Primary source (LinkedIn comment), plausible but not cross-checked |
| **GitHub Copilot "4 agents"** | ⚠️ Out of scope | External product, not verified |
| **OpenCode "70+ providers"** | ⚠️ Out of scope | External product, not verified |
### Unverifiable (Qualitative)
- **"80% coverage with 20% setup"** — Qualitative claim, not fact-checkable
**Conclusion**: Core technical claims verified. External product stats not critical for evaluation.
---
## Challenge Results (Technical-Writer Agent)
### Initial Score: 3/5 → **Adjusted: 4/5**
**Biases Detected**:
1. **Familiarity Bias** — "Already documented → not relevant" (missed checklist format value)
2. **Niche Bias** — "Embedded engineering → low priority" (ADR pattern is universal)
**Aspects Initially Missed**:
- Friction-free model switching (dynamic mid-session workflow)
- Background agents as "interrupt handlers" (pedagogical angle)
- Community validation of native-first approach (credibility)
**Justification for 4/5**:
- ADR workflow = **real methodological gap** (not just "another angle")
- Checklist = **independent pedagogical value** (onboarding tool)
- Community convergence = **external validation** (shows adoption)
**Risks of Non-Integration**:
- ADR workflow gap → users reinvent pattern
- Checklist missing → newcomers don't grasp full capability surface
- Validation lost → missed opportunity to cite external adoption
---
## Recommendation
**Action**: ✅ **INTEGRATED** (4/5 HIGH VALUE)
**Integration Effort**: 30-45 minutes, ~165 lines across 3 files
**Confidence**: **High** (technical claims verified, gaps confirmed, community validation credible)
**Maintenance**: Low (static content, links to existing guide sections)
---
## Sources
**Primary**:
- [LinkedIn Post](https://www.linkedin.com/posts/gursannikov_claudecode-embeddedengineering-aiagents-activity-7423851983331328001-DrFb) (2026-02-01)
- [Gist](https://gist.github.com/gsannikov/92cf8ca50407458b605756508a20fe18)
**Internal Verification**:
- `guide/architecture.md` (lines 40-800)
- `guide/workflows/agent-teams.md` (line 15992+)
- `guide/claude-code-releases.md` (line 375)
- `machine-readable/reference.yaml` (complete index)
**Challenge Agent**: technical-writer (aee871f)
---
## Files Modified
1. `guide/architecture.md` — 2 additions (Native Capabilities Audit + Community Validation)
2. `guide/methodologies.md` — 1 addition (ADR-Driven Development)
3. `guide/cheatsheet.md` — 1 addition (Dynamic Model Switching)
**Total**: 4 sections, ~165 lines, 3 files
---
## Lessons Learned
1. **Format Matters**: Even when content exists, presentation format (checklist, workflow) adds pedagogical value
2. **Universal Patterns**: Domain-specific resources (embedded) can contain universal patterns (ADR workflow)
3. **Community Validation**: External adoption (Cursor → Claude Code) validates guide's architectural choices
4. **Bias Detection**: Conservative scoring can miss strategic value (familiarity + niche bias)
---
**Evaluation Methodology**: [docs/resource-evaluations/README.md](./README.md)
**Guide Version**: 3.23.1
**Integration Status**: ✅ Complete (2026-02-07)