docs: add resource evaluation process documentation

Added comprehensive documentation for resource evaluation methodology:

- Appendix C in ultimate-guide.md (line 15034)
  * 5-point scoring system with decision criteria
  * 14 documented assessments by category
  * Transparency on integration decisions
  * Links to full methodology

- Fixed methodologies.md GSD evaluation link
  * Corrected path: claudedocs → docs (public)

- Updated reference.yaml with resource evaluations metadata
  * Directory, count (14), methodology references
  * Appendix and README section line numbers

All resource-evaluations now properly documented and discoverable.

Co-Authored-By: Claude Sonnet 4.5 <noreply@anthropic.com>
This commit is contained in:
Florian BRUNIAUX 2026-01-26 15:10:24 +01:00
parent 893c125fa7
commit 444ce5aa6a
4 changed files with 66 additions and 3 deletions

View file

@ -62,14 +62,28 @@ The format is based on [Keep a Changelog](https://keepachangelog.com/en/1.0.0/).
- New §9.18 "Task Management System" section (145 lines)
- Core concepts, tool reference, practical patterns
- Links to full workflow guide for details
- **Appendix C: Resource Evaluation Process** (`guide/ultimate-guide.md:15034`)
- New appendix section documenting systematic evaluation methodology
- 5-point scoring system (Critical → Low) with decision criteria and timelines
- List of 14 documented assessments organized by categories (methodologies, tools, content, research)
- Transparency explanation: why resources are integrated (3+), mentioned (2), or rejected (1)
- Links to full methodology in `docs/resource-evaluations/README.md`
- Browse all evaluations link to `docs/resource-evaluations/` directory
### Changed
- **guide/methodologies.md:55**: Corrected GSD evaluation link
- Old: `../claudedocs/resource-evaluations/gsd-evaluation.md` (private)
- New: `../docs/resource-evaluations/gsd-evaluation.md` (public)
- **machine-readable/reference.yaml**: Added resource evaluations metadata
- New section: resource_evaluations_directory, count (14), methodology link
- Added references: appendix (line 15034), README section (line 278)
- Corrected gsd_evaluation path from private claudedocs to public docs
- **README.md**: Updated documentation metrics for accuracy and landing site synchronization
- Guide length: ~10K → ~15K lines (actual: 15,053 lines)
- Reading time: ~3 hours → ~4 hours (reflects actual guide size)
- Security hooks: 15+ → 18 (precise count)
- Templates count: 86 → 83 (accurate count)
- Templates count: maintained at 86 (consistent with check-landing-sync.sh logic)
- Repository Structure section: updated ultimate-guide.md from "~13,425 lines" to "~15K lines"
- Added **Resource Evaluations** section (14 assessments) with methodology links
- All changes verified with `./scripts/check-landing-sync.sh` for full synchronization

View file

@ -52,7 +52,7 @@ Organized in a 6-tier pyramid from strategic orchestration down to optimization
- **When to use**: Complex enterprise projects needing governance
- **When to avoid**: Small teams, MVPs, rapid prototyping
**GSD (Get Shit Done)** addresses context rot through systematic 6-phase workflow (Initialize → Discuss → Plan → Execute → Verify → Complete) with fresh 200k-token contexts per task. Core concepts (multi-agent orchestration, fresh context management) overlap significantly with existing patterns like Ralph Loop, Gas Town, and BMAD. See [resource evaluation](../claudedocs/resource-evaluations/gsd-evaluation.md) for detailed comparison.
**GSD (Get Shit Done)** addresses context rot through systematic 6-phase workflow (Initialize → Discuss → Plan → Execute → Verify → Complete) with fresh 200k-token contexts per task. Core concepts (multi-agent orchestration, fresh context management) overlap significantly with existing patterns like Ralph Loop, Gas Town, and BMAD. See [resource evaluation](../docs/resource-evaluations/gsd-evaluation.md) for detailed comparison.
> **Emerging**: [Ralph Inferno](https://github.com/sandstream/ralph-inferno) implements autonomous multi-persona workflows (Analyst→PM→UX→Architect→Business) with VM-based execution and self-correcting E2E loops. Experimental but interesting for "vibe coding at scale".

View file

@ -15031,6 +15031,49 @@ Common misconceptions we've seen:
---
## Appendix C: Resource Evaluation Process
This guide systematically evaluates external resources (tools, methodologies, articles, frameworks) before integration to maintain quality and prevent noise.
### Evaluation Methodology
**5-Point Scoring System** (Critical → Low):
| Score | Action | Timeline |
|-------|--------|----------|
| 5 | Critical - Integrate immediately | <24h |
| 4 | High Value - Integrate within 1 week | 1 week |
| 3 | Moderate - Integrate when time available | Flexible |
| 2 | Marginal - Minimal mention or skip | - |
| 1 | Low - Reject | - |
**Full methodology**: See [`docs/resource-evaluations/README.md`](../docs/resource-evaluations/README.md)
### Current Evaluations
**14 documented assessments** in `docs/resource-evaluations/`:
- **Methodologies**: GSD (Get Shit Done), Vibe Coding patterns
- **Tools**: Worktrunk, AST-grep, SE-CoVe plugin
- **Content**: Boris Cherny Cowork video, ClawdBot Twitter analysis
- **Research**: Prompt repetition papers, self-improvement skills
- **And more**: Nick Jensen plugins, Wooldridge productivity stack
**Browse all evaluations**: [`docs/resource-evaluations/`](../docs/resource-evaluations/)
### Why Public Evaluations?
**Transparency**: Contributors can see exactly why resources were:
- ✅ **Integrated** (score 3+): Added to guide with attribution
- ⚠️ **Mentioned** (score 2): Brief reference without deep coverage
- ❌ **Rejected** (score 1): Documented reason for exclusion
**Quality Control**: Technical review + challenge phase by specialized agents ensures objectivity and prevents marketing hype from influencing decisions.
**Community Contribution**: Evaluation template available in `docs/resource-evaluations/README.md` for suggesting new resources with systematic assessment.
---
## About This Guide
**End of Guide**

View file

@ -227,9 +227,15 @@ deep_dive:
eval_harness_source: "https://www.anthropic.com/engineering/demystifying-evals-for-ai-agents"
# GSD (Get Shit Done) methodology (added 2026-01-25)
gsd_methodology: "guide/methodologies.md:47"
gsd_evaluation: "claudedocs/resource-evaluations/gsd-evaluation.md"
gsd_evaluation: "docs/resource-evaluations/gsd-evaluation.md"
gsd_source: "https://github.com/glittercowboy/get-shit-done"
gsd_note: "Overlap with existing patterns (Ralph Loop, Gas Town, BMAD)"
# Resource Evaluations (added 2026-01-26)
resource_evaluations_directory: "docs/resource-evaluations/"
resource_evaluations_count: 14
resource_evaluations_methodology: "docs/resource-evaluations/README.md"
resource_evaluations_appendix: "guide/ultimate-guide.md:15034"
resource_evaluations_readme_section: "README.md:278"
# Practitioner Insights (external validation)
practitioner_insights: "guide/ai-ecosystem.md:1209"
practitioner_dave_van_veen: "guide/ai-ecosystem.md:1213"