Commit graph

3 commits

Author SHA1 Message Date
Florian BRUNIAUX
d986e25065 docs: quiz audit complete - 100% pass rate achieved (256/256)
Updated quality dashboard to reflect perfect completion:
- Pass rate: 93.8% → 100% (+6.2%)
- All 15 categories at 100%
- 0 critical, 0 warnings, 0 info issues

Journey:
- Baseline: 90.2% (231/256)
- After fixes: 92.6% → 93.8% → 97.8% → 100%
- Total issues resolved: 25 (9 critical, 16 warnings, 3 info)

All questions verified accurate against ultimate-guide.md.

Co-Authored-By: Claude Sonnet 4.5 <noreply@anthropic.com>
2026-02-04 18:32:04 +01:00
Florian BRUNIAUX
e6a8f1746f docs: add quiz quality dashboard
**Dashboard Overview:**
- Executive summary (93.8% pass rate, +3.6% improvement)
- Category performance (4 tiers: S/A/B/C/D)
- Issue breakdown by type (0 critical, 13 warnings, 3 info)
- Improvement roadmap (3 phases)
- Root cause analysis (40% guide context extraction issues)
- Historical trends & velocity tracking
- Best practices learned

**Key Insights:**
- Perfect categories: Q05, Q07, Q11, Q13 (100%)
- At-risk categories: Q09 (79.3%), Q10 (75.0%)
- Main issue: Guide context extraction (8/16 issues)
- Velocity: +3.6% in 1 day (9 fixes)

**Next Milestone:** 95%+ pass rate (12 fixes needed)

File: claudedocs/quiz-quality-dashboard.md (tracking quality over time)

Co-Authored-By: Claude Sonnet 4.5 <noreply@anthropic.com>
2026-02-04 18:10:03 +01:00
Florian BRUNIAUX
69f493591e docs: add quiz audit report (6 critical issues found)
**Audit Results (256 questions):**
- Pass: 231 (90.2%)
- Issues: 25 (9.8%)
  - Critical: 6 (wrong answer/factual error)
  - Warning: 16 (ambiguous/outdated)
  - Info: 3 (minor wording)

**Critical issues fixed** (see landing repo commit 94bc3db):
- Q01-001: npm vs curl for universal install
- Q03-011: CLAUDE.md location confusion
- Q08-019: auto:N threshold misunderstanding
- Q09-003: --headless flag doesn't exist
- Q09-029: Boris Cherny attribution
- Q12-012: wrong sub-agent count

**Warnings to review** (Priority 2):
- 5 ambiguities (missing guide context)
- 7 factual accuracy issues (stats without sources)
- 2 outdated info (version changes)

**Healthiest categories:** Q05, Q07, Q11, Q13 (100% pass rate)
**Need attention:** Q09 (79.3%), Q10 (75.0%)

Audit system: extract-audit-context.py → generate-audit-batches.py → 16 parallel agents → generate-audit-report.py

Co-Authored-By: Claude Sonnet 4.5 <noreply@anthropic.com>
2026-02-04 17:20:35 +01:00