# Resource Evaluation: RTK (Rust Token Killer) **Date**: 2026-01-28 (Updated: 2026-02-01) **Evaluator**: Claude Sonnet 4.5 **Resource URL**: https://github.com/pszymkowiak/rtk (upstream) **Fork URL**: https://github.com/FlorianBruniaux/rtk (extended) **Resource Type**: CLI Tool (Rust) **Author**: pszymkowiak (upstream), FlorianBruniaux (fork) **Version Tested**: v0.2.0 upstream, v0.4.0+ fork (feat/all-features) **Community Engagement**: 8 stars (upstream), fork active development --- ## πŸ†• UPDATE 2026-02-01: Extended Fork Available **Fork Status**: An extended fork by FlorianBruniaux adds critical missing features: | Feature | Upstream v0.2.0 | Fork v0.4.0+ | |---------|----------------|--------------| | **JS/TS Stack** | ❌ Not supported | βœ… `rtk vitest`, `rtk pnpm`, `rtk prisma` | | **Bug Fixes** | ❌ grep/ls broken | βœ… Fixed in fork | | **Analytics** | ❌ None | βœ… `rtk gain`, `rtk discover` | | **Token Savings** | 72.6% (git only) | 70-90% (full stack) | **Fork Highlights**: - βœ… `rtk vitest run` - 90% reduction on test outputs - βœ… `rtk pnpm list` - 70% reduction on package trees - βœ… `rtk prisma migrate` - Migration status filtered - βœ… `rtk gain` - Token savings analytics dashboard - βœ… `rtk discover` - Find missed optimization opportunities - βœ… Bug fixes: grep and ls now work correctly **Installation (Fork)**: ```bash git clone https://github.com/FlorianBruniaux/rtk.git cd rtk && git checkout feat/all-features cargo install --path . --force ``` **Recommendation Update**: Fork addresses all critical gaps identified in this evaluation. For modern JS/TS stacks (T3 Stack, Next.js, etc.), **use the fork**. For git-only workflows, upstream remains stable. --- ## Executive Summary (Original Evaluation - Upstream v0.2.0) RTK (Rust Token Killer) is a high-performance CLI proxy that filters and compresses command outputs **before they reach LLM contexts**. Real-world testing on the Claude Code Ultimate Guide repository confirms **72.6% average token reduction** across git workflows (92.3% for `git log`), validating the advertised 70% savings claim. **Recommendation**: **GOOD (Score 4/5)** - Unique tool with proven token savings for git operations, but limited by low adoption (8 stars), buggy commands (grep, ls), and missing package manager support (npm/pnpm). --- ## Scoring Summary | Criterion | Score | Weight | Weighted Score | |-----------|-------|--------|----------------| | **Accuracy & Reliability** | 3 | 20% | 0.60 | | **Depth & Comprehensiveness** | 4 | 20% | 0.80 | | **Practical Value** | 5 | 25% | 1.25 | | **Originality & Uniqueness** | 5 | 15% | 0.75 | | **Production Readiness** | 3 | 10% | 0.30 | | **Community Validation** | 2 | 10% | 0.20 | | **TOTAL SCORE** | | | **3.90** | **Rounded Score**: **4/5** --- ## Detailed Analysis ### 1. Accuracy & Reliability (Score: 3/5) **Evidence of Quality**: - βœ… **Claims verified**: 70% advertised β†’ 72.6% measured (git workflows) - βœ… **git log**: 92.3% reduction (13,994 β†’ 1,076 chars) - βœ… **git status**: 76.0% reduction (100 β†’ 24 chars) - βœ… **git diff**: 55.9% reduction (15,815 β†’ 6,982 chars) **Verification Methods**: - Real-world testing on 200+ commit repository - Multiple command benchmarks (8 tests total) - Consistent results across git operations **Strengths**: - Rust implementation (fast, safe) - Predictable output format - Works as advertised for git commands **Limitations**: - ❌ **grep returns empty output** (0 bytes, completely broken) - ❌ **ls produces worse output** (-274% increase: 2,058 β†’ 7,704 chars) - ⚠️ **No package manager support** (npm/pnpm/yarn missing) - ⚠️ **Version v0.2.0**: Early stage, potential instability **Rating Justification**: Strong performance on core use cases (git) offset by broken commands (grep, ls) and early-stage maturity. --- ### 2. Depth & Comprehensiveness (Score: 4/5) **Breadth Coverage**: - βœ… **Git workflows**: log, status, diff, push (4 commands) - βœ… **File operations**: ls, read, find, diff (4 commands) - βœ… **Development tools**: test (Rust cargo), docker, kubectl - βœ… **Data inspection**: json, deps, env, log - ⚠️ **Missing**: npm/pnpm, pytest, eslint, prettier **Depth Quality**: - Smart filtering: Errors/warnings only for build outputs - Deduplication: Log output with occurrence counts - Structure extraction: JSON without values - Compact formats: One-line summaries for most commands **Gap Analysis vs Existing Tools**: | Aspect | RTK | Symbol System (Guide) | mgrep (mixedbread) | |--------|-----|----------------------|-------------------| | **Use case** | Filter bash outputs | Compress communication | Semantic code search | | **Token reduction** | 72.6% (measured) | 30-50% (estimated) | N/A (search tool) | | **Scope** | Command outputs only | All Claude responses | Code search only | | **Overlap** | None | None | None | **Complementarity**: RTK optimizes **inputs** (command outputs), Symbol System optimizes **outputs** (Claude responses). **Rating Justification**: Strong coverage of git workflows, but gaps in modern dev tools (npm, test frameworks). --- ### 3. Practical Value (Score: 5/5) **Immediate Applicability**: - One-command installation (binary or cargo) - No configuration required (works out-of-box) - Drop-in replacement for existing commands - Integration templates provided (CLAUDE.md, skill, hook) **Workflow Integration**: ```bash # Before RTK git log --oneline # 13,994 chars β†’ ~4K tokens git status # 100 chars β†’ ~30 tokens git diff HEAD~1 # 15,815 chars β†’ ~4.5K tokens # Total: ~8.5K tokens # With RTK rtk git log # 1,076 chars β†’ ~300 tokens rtk git status # 24 chars β†’ ~7 tokens rtk git diff HEAD~1 # 6,982 chars β†’ ~2K tokens # Total: ~2.3K tokens (72.9% reduction) ``` **Cost-Benefit**: - **Token savings**: 6.2K tokens per typical git workflow - **Time savings**: None (execution time similar) - **Setup cost**: 5 minutes (download + install) - **Maintenance cost**: Zero (drop-in wrapper) **Real-World Impact**: 30-min Claude Code session baseline: - Without RTK: ~150K tokens (10-15 git commands @ ~10K tokens each) - With RTK: ~41K tokens (10-15 git commands @ ~2.7K tokens each) - **Savings: 109K tokens (72.6% reduction)** **Rating Justification**: Maximum score for proven, measurable impact with zero maintenance overhead. --- ### 4. Originality & Uniqueness (Score: 5/5) **Novel Approach**: - βœ… **First tool** dedicated to command output optimization for LLMs - βœ… **Preprocessing layer** vs post-processing (symbol system) - βœ… **Transparent wrapper** (no API changes, drop-in) **Differentiation from Existing Resources**: | Tool | Approach | Token Impact | |------|----------|-------------| | **RTK** | Filter command outputs (preprocessing) | 72.6% reduction (inputs) | | **Symbol System** | Compress Claude responses (postprocessing) | 30-50% reduction (outputs) | | **Context Management** | Strategic /compact, /clear usage | Prevents overflow, no reduction | | **Model Selection** | Haiku vs Sonnet vs Opus | Cost optimization, not tokens | **No Competitors**: RTK is the only tool optimizing bash command outputs for LLM contexts. **Rating Justification**: Maximum score for unique positioning with no overlap. --- ### 5. Production Readiness (Score: 3/5) **Stability Indicators**: - βœ… v0.2.0 released (2026-01-23) - βœ… Rust implementation (memory safe) - ⚠️ 8 stars, 0 forks (very low adoption) - ❌ Broken commands (grep, ls) - ⚠️ No test suite visible - ⚠️ No CI/CD badges **Security Considerations**: - βœ… MIT license (permissive) - βœ… Rust (memory safety by default) - ⚠️ install.sh script has no checksums (SHA256 verification missing) - ⚠️ Low community scrutiny (8 stars) **Scalability Indicators**: - βœ… Fast execution (Rust performance) - βœ… No dependencies (standalone binary) - ❌ No plugin system (hardcoded commands) - ❌ No configuration file (all defaults) **Risk Assessment**: - **Adoption risk**: HIGH (8 stars, project may be abandoned) - **Breaking changes risk**: MEDIUM (v0.2.0 = early stage) - **Bug risk**: MEDIUM (grep/ls broken, limited testing) - **Security risk**: LOW (read-only operations, no network) **Rating Justification**: Early-stage maturity and broken commands offset by Rust safety guarantees. --- ### 6. Community Validation (Score: 2/5) **Engagement Metrics**: - **8 stars** (very low) - **0 forks** (no community adoption) - **No issues** (no user feedback) - **No PRs** (no contributions) - **No discussions** (no community) **Adoption Evidence**: - No blog posts mentioning RTK - No Reddit/Twitter/X discussions found - No integration examples outside README - Created 2026-01-18 (10 days old) **Comparative Context**: | Tool | Stars | Age | Validation | |------|-------|-----|-----------| | RTK | 8 | 10 days | None | | Everything Claude Code | 31.9K | 10 days | Hackathon win | | mgrep (mixedbread) | 261 | ~1 year | Production use | **Rating Justification**: Minimal score due to lack of any community validation or adoption. --- ## Benchmark Results (Verified) **Test Environment**: - **OS**: macOS 14.6 (Apple Silicon ARM64) - **RTK Version**: v0.2.0 - **Test Repository**: claude-code-ultimate-guide (9,881 lines, 217 commits, 86 templates) - **Date**: 2026-01-28 **Results**: | Command | Baseline (chars) | RTK (chars) | Reduction | Verdict | |---------|-----------------|-------------|-----------|---------| | `git log --oneline` | 13,994 | 1,076 | **92.3%** | πŸ”₯ Excellent | | `git status` | 100 | 24 | **76.0%** | βœ… Very Good | | `find "*.md" guide/` | 780 | 185 | **76.3%** | βœ… Very Good | | `cat CHANGELOG.md` | 163,587 | 61,339 | **62.5%** | βœ… Good | | `git diff HEAD~1` | 15,815 | 6,982 | **55.9%** | βœ… Good | | `ls -la` | 2,058 | 7,704 | **-274.3%** | ❌ Broken | | `grep -r "Claude Code"` | 54,302 | 0 | N/A | ❌ Broken | **Average (working commands)**: **72.6% reduction** **Claim verification**: βœ… **70% advertised β†’ 72.6% measured** --- ## Integration Recommendations ### Immediate Actions (Score 4 = 1 week) 1. **Add to Guide's "Token Optimization" Section** (Section 9.13): ```markdown ### Command Output Optimization with RTK RTK (Rust Token Killer) filters bash command outputs before LLM context: - `rtk git log` β†’ 92.3% token reduction (13K β†’ 1K chars) - `rtk git status` β†’ 76.0% reduction (100 β†’ 24 chars) - `rtk find` β†’ 76.3% reduction (780 β†’ 185 chars) **Installation**: ```bash # macOS ARM64 curl -fsSL "https://github.com/pszymkowiak/rtk/releases/latest/download/rtk-aarch64-apple-darwin.tar.gz" -o rtk.tar.gz tar -xzf rtk.tar.gz && sudo mv rtk /usr/local/bin/ && rm rtk.tar.gz # macOS Intel curl -fsSL "https://github.com/pszymkowiak/rtk/releases/latest/download/rtk-x86_64-apple-darwin.tar.gz" -o rtk.tar.gz tar -xzf rtk.tar.gz && sudo mv rtk /usr/local/bin/ && rm rtk.tar.gz ``` **Limitations**: grep/ls broken in v0.2.0, npm/pnpm not supported. ``` 2. **Create Integration Templates**: - βœ… CLAUDE.md: `examples/claude-md/rtk-optimized.md` (created) - βœ… Skill: `examples/skills/rtk-optimizer/SKILL.md` (created) - βœ… Hook: `examples/hooks/bash/rtk-auto-wrapper.sh` (created) 3. **Update reference.yaml**: ```yaml rtk_tool: url: "https://github.com/pszymkowiak/rtk" purpose: "Command output optimization (72.6% token reduction)" guide_section: "guide/ultimate-guide.md:9.13" score: "4/5" tested_version: "v0.2.0" installation: "Binary download (GitHub releases)" ``` 4. **Add to Quiz** (optional): - Question: "Which tool optimizes bash command outputs for LLM contexts?" - Options: RTK, mgrep, Symbol System, Context Management - Correct: RTK (72.6% reduction for git workflows) ### Medium-Term Actions (1 month) 5. **Monitor Project Evolution**: - Track GitHub stars (currently 8) - Check for new releases (bug fixes) - Test npm/pnpm support if added 6. **Upstream Contributions**: - File issues: grep bug, ls inefficiency - Propose features: npm/pnpm support - Submit PR if fixes identified --- ## Unique Learnings ### 1. Preprocessing > Postprocessing RTK's approach (filter outputs **before** LLM) is more efficient than compressing **after**: - Symbol System: 30-50% reduction (postprocessing) - RTK: 72.6% reduction (preprocessing) - **Lesson**: Attack verbosity at source, not destination ### 2. Git Workflows = High ROI 92.3% reduction for `git log` shows git commands are **exceptionally verbose**: - `git log --oneline`: 13,994 chars (one-line format already!) - `rtk git log`: 1,076 chars (92.3% smaller) - **Lesson**: Git outputs are optimized for humans, not LLMs ### 3. Not All Commands Benefit RTK paradox: `ls` becomes **worse** (-274% increase) - **Lesson**: Compression algorithms are command-specific, not universal - **Implication**: Selective usage required (whitelist, not blanket) ### 4. Early-Stage Tools = High Risk 8 stars, 0 forks, v0.2.0 = **abandonment risk**: - Created 10 days ago (2026-01-18) - No community validation - No production usage reports - **Lesson**: Document quickly (tool may disappear), but caveat heavily --- ## Risks & Limitations ### 1. Project Abandonment (HIGH RISK) - **Risk**: 8 stars, 0 forks β†’ project may be abandoned within months - **Mitigation**: Fork RTK, maintain local version if needed - **Impact**: HIGH (no updates, bugs persist) ### 2. Broken Commands (MEDIUM RISK) - **Risk**: grep returns empty, ls worse than baseline - **Mitigation**: Use selective whitelist (git only) - **Impact**: MEDIUM (avoid broken commands) ### 3. Missing Package Managers (MEDIUM IMPACT) - **Risk**: npm/pnpm not supported (7 mentions in guide usage patterns) - **Mitigation**: Contribute npm wrapper or use native commands - **Impact**: MEDIUM (missed optimization opportunity) ### 4. No Configuration System (LOW IMPACT) - **Risk**: No way to customize output formats - **Mitigation**: Accept defaults or fork - **Impact**: LOW (defaults are reasonable) --- ## Real-World Testing (T3 Stack Production Codebase) **Test Environment**: - **Project**: MΓ©thode Aristote (Next.js + tRPC + Prisma + pnpm) - **Date**: 2026-01-28 - **Commands tested**: 12 (git, pnpm, Vitest, TypeScript, Prisma) - **Full report**: `claudedocs/rtk-test-results-aristote.md` ### Validated Results | Command | Baseline | RTK | Reduction | Verdict | |---------|----------|-----|-----------|---------| | `git log -20` | 9.3K | 1.1K | **88.6%** | βœ… Excellent | | `git diff HEAD~1` | 28.4K | 1.8K | **93.5%** | βœ… Excellent | | `git status` | 260 | 87 | **66.5%** | βœ… Good | | `find src/ -name "*.tsx"` | 38.5K | 2.4K | **93.9%** | βœ… Excellent | **Average (working): 85.6%** (higher than initial 72.6% due to larger project scale) ### Critical Bug Discovered **Issue**: `rtk git log --oneline` fails with argument parsing error ```bash $ rtk git log --oneline -20 error: unexpected argument '--oneline' found ``` **Workaround**: Use `rtk git log -- -20` (works, 88.6% reduction) **Impact**: CRITICAL - Blocks common git flags (`--oneline`, `--graph`, `--stat`, etc.) **Root Cause**: RTK argument parser doesn't transparently pass flags to git **Upstream PR needed**: Fix clap parser to treat everything after `git` as passthrough ### Modern Stack Gaps (pnpm + Vitest) **1. pnpm Support (MISSING - CRITICAL)** | Command | Chars | Reduction Possible | |---------|-------|-------------------| | `pnpm list --depth=0` | 3,900 | ~80% (β†’ 700) | | `pnpm outdated` | 18,600 | ~90% (β†’ 1,800) | **Superfluous content**: Box-drawing chars (β”Œβ”€β”), "Legend:" headers, full paths **Value**: pnpm is #2 package manager, used by T3 Stack, Turborepo, Nx (millions of devs) **2. Vitest Support (MISSING - HIGH)** | Command | Chars | Reduction Possible | |---------|-------|-------------------| | `pnpm test` (43 passing) | 10,500 | ~90% (β†’ 1,000) | **Superfluous content** (85%): - ANSI color codes: `[1m[46m [32m` - Checkmarks: `βœ“` (replace with "PASS") - Test hierarchy: `[2m > [22m` (flatten) **Optimal output**: `PASS (43)\nFAIL (0)\nTime: 450ms` (50 chars vs 10.5K) **3. TypeScript Support (MISSING - MEDIUM)** | Command | Chars | Reduction Possible | |---------|-------|-------------------| | `tsc --noEmit` (errors only) | ~5,000 | ~70% (β†’ 1,500) | **Superfluous content**: Code snippets, underlines, full stack traces **Optimal output**: `file.ts:45:12 - error TS2322: Type 'string' not assignable` (one-line per error) ### ROI Analysis (T3 Stack Adoption) **Current RTK v0.2.0 (git only)**: - Commands covered: ~40% of dev session - Typical session: 11K tokens β†’ 5K tokens (~55% reduction) **With pnpm + Vitest + TypeScript support (proposed v0.3.0)**: - Commands covered: ~85% of dev session - Typical session: 11K tokens β†’ 2.2K tokens (~80% reduction) **Dev effort estimate**: 1 week (7 days) - pnpm: 2 days - Vitest: 3 days - TypeScript: 1 day - Bug fix (git args): 1 day **User value**: **From niche tool β†’ essential LLM dev tooling** for modern JS stacks ### Pattern Analysis (Cross-Tool) **Superfluous patterns** repeated across modern dev tools: | Pattern | % of Output | Tools | Fix | |---------|-------------|-------|-----| | Box-drawing (`β”Œβ”€β”`) | 60-70% | pnpm, tables | Strip | | ANSI codes (`[32m`) | 20-30% | Vitest, colored | Strip | | Marketing tips | 5-15% | Prisma, pnpm | Remove | | Verbose paths | 10-20% | All | Abbreviate | | Progress bars | 5-10% | All | Remove | **Insight**: Modern CLI tools optimize for **human readability** (colors, boxes, progress bars) but **waste tokens** for LLMs. Average reduction potential: **84%** across all unsupported tools. ### Updated Recommendation Post-Testing **Score**: Still **4/5 (GOOD)** but with clearer roadmap **Current strengths**: - βœ… Git workflows: 85.6% avg reduction (validated on production codebase) - βœ… File operations: 93.9% reduction (find) - βœ… Fast, zero-config, drop-in wrapper **Blockers for 5/5 (CRITICAL) rating**: 1. ❌ Git argument parsing bug (blocks `--oneline`, `--graph`, etc.) 2. ❌ Missing pnpm support (critical for T3 Stack, 40% of npm downloads) 3. ❌ Missing test framework support (Vitest/Jest = 20-30% of dev sessions) **Path to 5/5**: - Fix git arg parsing β†’ **1 day dev** - Add pnpm support β†’ **2 days dev** - Add Vitest support β†’ **3 days dev** - **Total: 1 week investment** β†’ RTK becomes essential for modern stacks **Upstream contribution value**: Real-world T3 Stack testing provides: - Quantitative evidence (12 commands, 53KB benchmarks) - Implementation guide (Rust pseudocode ready) - Clear ROI (80% reduction, 85% command coverage) --- ## Final Recommendation **Score: 4/5 (GOOD)** **Action**: Integrate with caveats (git-only usage, acknowledge bugs). **Rationale**: 1. **Proven Savings**: 72.6% reduction validated through real-world testing 2. **Unique Positioning**: Only tool optimizing command outputs for LLMs 3. **Low Friction**: Drop-in wrapper, zero config, 5-min setup 4. **BUT**: Low adoption (8 stars), broken commands (grep/ls), early-stage (v0.2.0) **Integration Strategy**: - **Position as experimental** (v0.2.0, low adoption) - **Recommend for git workflows only** (proven, high ROI) - **Caveat grep/ls issues** (broken, avoid) - **Monitor for updates** (may improve or be abandoned) **Key Insight**: RTK proves **command output optimization** is a valid strategy (72.6% reduction), but implementation quality and community adoption are insufficient for "CRITICAL" rating. Score 4/5 reflects **strong concept, weak execution**. --- ## Metadata **Evaluation Completed**: 2026-01-28 **Tested By**: Claude Sonnet 4.5 **Test Duration**: 2 hours (installation, benchmarking, integration) **Next Review**: 2026-03-01 (check for updates, community growth) **Related Resources**: - Integration templates: `examples/{claude-md,skills,hooks}/rtk-*` - PR proposals: `claudedocs/rtk-pr-proposals.md` - Symbol System (complementary): `guide/ultimate-guide.md:2872` **Keywords**: token-optimization, command-output-filtering, rust, git-workflows, preprocessing, experimental