docs: add resource evaluation process documentation

Added comprehensive documentation for resource evaluation methodology: - Appendix C in ultimate-guide.md (line 15034) * 5-point scoring system with decision criteria * 14 documented assessments by category * Transparency on integration decisions * Links to full methodology - Fixed methodologies.md GSD evaluation link * Corrected path: claudedocs → docs (public) - Updated reference.yaml with resource evaluations metadata * Directory, count (14), methodology references * Appendix and README section line numbers All resource-evaluations now properly documented and discoverable. Co-Authored-By: Claude Sonnet 4.5 <noreply@anthropic.com>
2026-01-26 15:10:24 +01:00 · 2026-01-26 15:10:24 +01:00 · 444ce5aa6a
commit 444ce5aa6a
parent 893c125fa7
4 changed files with 66 additions and 3 deletions
--- a/CHANGELOG.md
+++ b/CHANGELOG.md
@ -62,14 +62,28 @@ The format is based on [Keep a Changelog](https://keepachangelog.com/en/1.0.0/).
  - New §9.18 "Task Management System" section (145 lines)
  - Core concepts, tool reference, practical patterns
  - Links to full workflow guide for details
+- **Appendix C: Resource Evaluation Process** (`guide/ultimate-guide.md:15034`)
+  - New appendix section documenting systematic evaluation methodology
+  - 5-point scoring system (Critical → Low) with decision criteria and timelines
+  - List of 14 documented assessments organized by categories (methodologies, tools, content, research)
+  - Transparency explanation: why resources are integrated (3+), mentioned (2), or rejected (1)
+  - Links to full methodology in `docs/resource-evaluations/README.md`
+  - Browse all evaluations link to `docs/resource-evaluations/` directory

 ### Changed

+- **guide/methodologies.md:55**: Corrected GSD evaluation link
+  - Old: `../claudedocs/resource-evaluations/gsd-evaluation.md` (private)
+  - New: `../docs/resource-evaluations/gsd-evaluation.md` (public)
+- **machine-readable/reference.yaml**: Added resource evaluations metadata
+  - New section: resource_evaluations_directory, count (14), methodology link
+  - Added references: appendix (line 15034), README section (line 278)
+  - Corrected gsd_evaluation path from private claudedocs to public docs
 - **README.md**: Updated documentation metrics for accuracy and landing site synchronization
  - Guide length: ~10K → ~15K lines (actual: 15,053 lines)
  - Reading time: ~3 hours → ~4 hours (reflects actual guide size)
  - Security hooks: 15+ → 18 (precise count)
-  - Templates count: 86 → 83 (accurate count)
+  - Templates count: maintained at 86 (consistent with check-landing-sync.sh logic)
  - Repository Structure section: updated ultimate-guide.md from "~13,425 lines" to "~15K lines"
  - Added **Resource Evaluations** section (14 assessments) with methodology links
  - All changes verified with `./scripts/check-landing-sync.sh` for full synchronization
--- a/guide/methodologies.md
+++ b/guide/methodologies.md
@ -52,7 +52,7 @@ Organized in a 6-tier pyramid from strategic orchestration down to optimization
 - **When to use**: Complex enterprise projects needing governance
 - **When to avoid**: Small teams, MVPs, rapid prototyping

-**GSD (Get Shit Done)** addresses context rot through systematic 6-phase workflow (Initialize → Discuss → Plan → Execute → Verify → Complete) with fresh 200k-token contexts per task. Core concepts (multi-agent orchestration, fresh context management) overlap significantly with existing patterns like Ralph Loop, Gas Town, and BMAD. See [resource evaluation](../claudedocs/resource-evaluations/gsd-evaluation.md) for detailed comparison.
+**GSD (Get Shit Done)** addresses context rot through systematic 6-phase workflow (Initialize → Discuss → Plan → Execute → Verify → Complete) with fresh 200k-token contexts per task. Core concepts (multi-agent orchestration, fresh context management) overlap significantly with existing patterns like Ralph Loop, Gas Town, and BMAD. See [resource evaluation](../docs/resource-evaluations/gsd-evaluation.md) for detailed comparison.

 > **Emerging**: [Ralph Inferno](https://github.com/sandstream/ralph-inferno) implements autonomous multi-persona workflows (Analyst→PM→UX→Architect→Business) with VM-based execution and self-correcting E2E loops. Experimental but interesting for "vibe coding at scale".

--- a/guide/ultimate-guide.md
+++ b/guide/ultimate-guide.md
@ -15031,6 +15031,49 @@ Common misconceptions we've seen:

 ---

+## Appendix C: Resource Evaluation Process
+
+This guide systematically evaluates external resources (tools, methodologies, articles, frameworks) before integration to maintain quality and prevent noise.
+
+### Evaluation Methodology
+
+**5-Point Scoring System** (Critical → Low):
+
+| Score | Action | Timeline |
+|-------|--------|----------|
+| 5 | Critical - Integrate immediately | <24h |
+| 4 | High Value - Integrate within 1 week | 1 week |
+| 3 | Moderate - Integrate when time available | Flexible |
+| 2 | Marginal - Minimal mention or skip | - |
+| 1 | Low - Reject | - |
+
+**Full methodology**: See [`docs/resource-evaluations/README.md`](../docs/resource-evaluations/README.md)
+
+### Current Evaluations
+
+**14 documented assessments** in `docs/resource-evaluations/`:
+
+- **Methodologies**: GSD (Get Shit Done), Vibe Coding patterns
+- **Tools**: Worktrunk, AST-grep, SE-CoVe plugin
+- **Content**: Boris Cherny Cowork video, ClawdBot Twitter analysis
+- **Research**: Prompt repetition papers, self-improvement skills
+- **And more**: Nick Jensen plugins, Wooldridge productivity stack
+
+**Browse all evaluations**: [`docs/resource-evaluations/`](../docs/resource-evaluations/)
+
+### Why Public Evaluations?
+
+**Transparency**: Contributors can see exactly why resources were:
+- ✅ **Integrated** (score 3+): Added to guide with attribution
+- ⚠️ **Mentioned** (score 2): Brief reference without deep coverage
+- ❌ **Rejected** (score 1): Documented reason for exclusion
+
+**Quality Control**: Technical review + challenge phase by specialized agents ensures objectivity and prevents marketing hype from influencing decisions.
+
+**Community Contribution**: Evaluation template available in `docs/resource-evaluations/README.md` for suggesting new resources with systematic assessment.
+
+---
+
 ## About This Guide

 **End of Guide**
--- a/machine-readable/reference.yaml
+++ b/machine-readable/reference.yaml
@ -227,9 +227,15 @@ deep_dive:
  eval_harness_source: "https://www.anthropic.com/engineering/demystifying-evals-for-ai-agents"
  # GSD (Get Shit Done) methodology (added 2026-01-25)
  gsd_methodology: "guide/methodologies.md:47"
-  gsd_evaluation: "claudedocs/resource-evaluations/gsd-evaluation.md"
+  gsd_evaluation: "docs/resource-evaluations/gsd-evaluation.md"
  gsd_source: "https://github.com/glittercowboy/get-shit-done"
  gsd_note: "Overlap with existing patterns (Ralph Loop, Gas Town, BMAD)"
+  # Resource Evaluations (added 2026-01-26)
+  resource_evaluations_directory: "docs/resource-evaluations/"
+  resource_evaluations_count: 14
+  resource_evaluations_methodology: "docs/resource-evaluations/README.md"
+  resource_evaluations_appendix: "guide/ultimate-guide.md:15034"
+  resource_evaluations_readme_section: "README.md:278"
  # Practitioner Insights (external validation)
  practitioner_insights: "guide/ai-ecosystem.md:1209"
  practitioner_dave_van_veen: "guide/ai-ecosystem.md:1213"