diff --git a/CHANGELOG.md b/CHANGELOG.md index 72b7dbd..fa87d1d 100644 --- a/CHANGELOG.md +++ b/CHANGELOG.md @@ -62,14 +62,28 @@ The format is based on [Keep a Changelog](https://keepachangelog.com/en/1.0.0/). - New §9.18 "Task Management System" section (145 lines) - Core concepts, tool reference, practical patterns - Links to full workflow guide for details +- **Appendix C: Resource Evaluation Process** (`guide/ultimate-guide.md:15034`) + - New appendix section documenting systematic evaluation methodology + - 5-point scoring system (Critical → Low) with decision criteria and timelines + - List of 14 documented assessments organized by categories (methodologies, tools, content, research) + - Transparency explanation: why resources are integrated (3+), mentioned (2), or rejected (1) + - Links to full methodology in `docs/resource-evaluations/README.md` + - Browse all evaluations link to `docs/resource-evaluations/` directory ### Changed +- **guide/methodologies.md:55**: Corrected GSD evaluation link + - Old: `../claudedocs/resource-evaluations/gsd-evaluation.md` (private) + - New: `../docs/resource-evaluations/gsd-evaluation.md` (public) +- **machine-readable/reference.yaml**: Added resource evaluations metadata + - New section: resource_evaluations_directory, count (14), methodology link + - Added references: appendix (line 15034), README section (line 278) + - Corrected gsd_evaluation path from private claudedocs to public docs - **README.md**: Updated documentation metrics for accuracy and landing site synchronization - Guide length: ~10K → ~15K lines (actual: 15,053 lines) - Reading time: ~3 hours → ~4 hours (reflects actual guide size) - Security hooks: 15+ → 18 (precise count) - - Templates count: 86 → 83 (accurate count) + - Templates count: maintained at 86 (consistent with check-landing-sync.sh logic) - Repository Structure section: updated ultimate-guide.md from "~13,425 lines" to "~15K lines" - Added **Resource Evaluations** section (14 assessments) with methodology links - All changes verified with `./scripts/check-landing-sync.sh` for full synchronization diff --git a/guide/methodologies.md b/guide/methodologies.md index b3871d2..3c29a46 100644 --- a/guide/methodologies.md +++ b/guide/methodologies.md @@ -52,7 +52,7 @@ Organized in a 6-tier pyramid from strategic orchestration down to optimization - **When to use**: Complex enterprise projects needing governance - **When to avoid**: Small teams, MVPs, rapid prototyping -**GSD (Get Shit Done)** addresses context rot through systematic 6-phase workflow (Initialize → Discuss → Plan → Execute → Verify → Complete) with fresh 200k-token contexts per task. Core concepts (multi-agent orchestration, fresh context management) overlap significantly with existing patterns like Ralph Loop, Gas Town, and BMAD. See [resource evaluation](../claudedocs/resource-evaluations/gsd-evaluation.md) for detailed comparison. +**GSD (Get Shit Done)** addresses context rot through systematic 6-phase workflow (Initialize → Discuss → Plan → Execute → Verify → Complete) with fresh 200k-token contexts per task. Core concepts (multi-agent orchestration, fresh context management) overlap significantly with existing patterns like Ralph Loop, Gas Town, and BMAD. See [resource evaluation](../docs/resource-evaluations/gsd-evaluation.md) for detailed comparison. > **Emerging**: [Ralph Inferno](https://github.com/sandstream/ralph-inferno) implements autonomous multi-persona workflows (Analyst→PM→UX→Architect→Business) with VM-based execution and self-correcting E2E loops. Experimental but interesting for "vibe coding at scale". diff --git a/guide/ultimate-guide.md b/guide/ultimate-guide.md index 974a37f..45132cc 100644 --- a/guide/ultimate-guide.md +++ b/guide/ultimate-guide.md @@ -15031,6 +15031,49 @@ Common misconceptions we've seen: --- +## Appendix C: Resource Evaluation Process + +This guide systematically evaluates external resources (tools, methodologies, articles, frameworks) before integration to maintain quality and prevent noise. + +### Evaluation Methodology + +**5-Point Scoring System** (Critical → Low): + +| Score | Action | Timeline | +|-------|--------|----------| +| 5 | Critical - Integrate immediately | <24h | +| 4 | High Value - Integrate within 1 week | 1 week | +| 3 | Moderate - Integrate when time available | Flexible | +| 2 | Marginal - Minimal mention or skip | - | +| 1 | Low - Reject | - | + +**Full methodology**: See [`docs/resource-evaluations/README.md`](../docs/resource-evaluations/README.md) + +### Current Evaluations + +**14 documented assessments** in `docs/resource-evaluations/`: + +- **Methodologies**: GSD (Get Shit Done), Vibe Coding patterns +- **Tools**: Worktrunk, AST-grep, SE-CoVe plugin +- **Content**: Boris Cherny Cowork video, ClawdBot Twitter analysis +- **Research**: Prompt repetition papers, self-improvement skills +- **And more**: Nick Jensen plugins, Wooldridge productivity stack + +**Browse all evaluations**: [`docs/resource-evaluations/`](../docs/resource-evaluations/) + +### Why Public Evaluations? + +**Transparency**: Contributors can see exactly why resources were: +- ✅ **Integrated** (score 3+): Added to guide with attribution +- ⚠️ **Mentioned** (score 2): Brief reference without deep coverage +- ❌ **Rejected** (score 1): Documented reason for exclusion + +**Quality Control**: Technical review + challenge phase by specialized agents ensures objectivity and prevents marketing hype from influencing decisions. + +**Community Contribution**: Evaluation template available in `docs/resource-evaluations/README.md` for suggesting new resources with systematic assessment. + +--- + ## About This Guide **End of Guide** diff --git a/machine-readable/reference.yaml b/machine-readable/reference.yaml index 6938369..c3dc8a2 100644 --- a/machine-readable/reference.yaml +++ b/machine-readable/reference.yaml @@ -227,9 +227,15 @@ deep_dive: eval_harness_source: "https://www.anthropic.com/engineering/demystifying-evals-for-ai-agents" # GSD (Get Shit Done) methodology (added 2026-01-25) gsd_methodology: "guide/methodologies.md:47" - gsd_evaluation: "claudedocs/resource-evaluations/gsd-evaluation.md" + gsd_evaluation: "docs/resource-evaluations/gsd-evaluation.md" gsd_source: "https://github.com/glittercowboy/get-shit-done" gsd_note: "Overlap with existing patterns (Ralph Loop, Gas Town, BMAD)" + # Resource Evaluations (added 2026-01-26) + resource_evaluations_directory: "docs/resource-evaluations/" + resource_evaluations_count: 14 + resource_evaluations_methodology: "docs/resource-evaluations/README.md" + resource_evaluations_appendix: "guide/ultimate-guide.md:15034" + resource_evaluations_readme_section: "README.md:278" # Practitioner Insights (external validation) practitioner_insights: "guide/ai-ecosystem.md:1209" practitioner_dave_van_veen: "guide/ai-ecosystem.md:1213"