What is RTK (Rust Token Killer)?

RTK is an open-source CLI proxy written in Rust (42K+ GitHub stars) that sits between your AI coding agent and the terminal. It intercepts shell command output and applies intelligent compression, smart filtering, grouping, truncation, and deduplication, to reduce LLM token consumption by 60-90% with under 10ms overhead.

How much does RTK save on AI coding costs?

Based on real-world benchmarks, RTK saves approximately 80% of tokens in a typical 30-minute coding session, reducing ~118,000 tokens to ~23,900. Individual command savings range from 70% (file reads) to 92% (git add/commit/push). Users report their token budgets stretching 3x further.

Does RTK work with Claude Code, Cursor, and other AI tools?

Yes. RTK supports 12+ AI coding tools including Claude Code, Cursor, Gemini CLI, GitHub Copilot, Windsurf, Cline, Codex (OpenAI), Kilo Code, Roo Code, and Google Antigravity. Integration is done via a single 'rtk init' command.

How does RTK differ from Caveman?

RTK and Caveman solve different sides of the token problem. RTK compresses CLI command OUTPUT (git status, test results, file listings) before it enters the context window. Caveman compresses the AI's RESPONSE text by removing filler words. They are complementary, using both together maximizes total savings.

Does RTK change my workflow?

No. After running 'rtk init -g', RTK installs a transparent shell hook that automatically rewrites commands (e.g., 'git status' becomes 'rtk git status') without you or the AI agent noticing. Zero workflow changes required.

What are RTK's limitations?

RTK only intercepts Bash/shell tool calls. AI tools' built-in native operations (like Claude Code's Read, Grep, Glob tools) bypass the hook entirely. For those workflows, you need to use shell equivalents (cat, rg, find) or call rtk directly.

RTK (Rust Token Killer): The Single Binary That Cut My AI Coding Token Bill by 90%

If you’re using AI coding assistants like Claude Code, Cursor, or Gemini CLI, here’s a stat that should make you uncomfortable: up to 80% of the tokens your AI consumes come from raw shell command output, verbose git status messages, wall-of-text test results, and bloated file listings that the model has to parse through before it can actually help you.

That’s not a model problem. It’s a plumbing problem. And RTK (Rust Token Killer) fixes it.

What Is RTK?

RTK is a single Rust binary, zero dependencies, under 10ms overhead, that sits between your AI coding agent and the terminal. It intercepts shell command output and applies four intelligent compression strategies before the result reaches the LLM’s context window:

Smart Filtering, strips ANSI codes, progress bars, comments, whitespace, and boilerplate
Group Aggregation, merges similar items (files by directory, errors by type)
Intelligent Truncation, keeps relevant context, cuts redundancy
Deduplication, collapses repeated lines with counts

The result? 60-90% fewer tokens for the exact same information.

With 42,000+ GitHub stars and growing, RTK has become essential infrastructure in the agentic coding community.

Real Numbers: Before vs. After RTK

Here’s what RTK does to a typical 30-minute AI coding session:

Command	Calls	Standard Tokens	RTK Tokens	Savings
`ls` / `tree`	10x	2,000	400	-80%
`cat` / `read`	20x	40,000	12,000	-70%
`grep` / `rg`	8x	16,000	3,200	-80%
`git status`	10x	3,000	600	-80%
`git diff`	5x	10,000	2,500	-75%
`git log`	5x	2,500	500	-80%
`git add/commit/push`	8x	1,600	120	-92%
`cargo test` / `npm test`	5x	25,000	2,500	-90%
`pytest`	4x	8,000	800	-90%
Total	—	~118,000	~23,900	-80%

That’s ~94,000 tokens saved in a single session. At API pricing, that translates directly to real dollars.

See It In Action: Real Command Output

Want to see the difference? Here’s a side-by-side comparison from an actual project.

Standard `git status` (8 lines, ~120 tokens)

On branch master
Your branch is up to date with 'origin/master'.

Changes not staged for commit:
 (use "git add <file>..." to update what will be committed)
 (use "git restore <file>..." to discard changes in working directory)
 modified: src/data/blog/best-ai-dictation-apps-2026-voice-to-text.mdx

no changes added to commit (use "git add" and/or "git commit -a")

`rtk git status` (3 lines, ~25 tokens)

* master...origin/master
~ Modified: 1 files
 src/data/blog/best-ai-dictation-apps-2026-voice-to-text.mdx

Same information. 80% fewer tokens. The model gets everything it needs — branch name, file status, file path — without the tutorial text about how to use git add.

`rtk gain` — Track Your Savings

RTK tracks every command and shows cumulative statistics. Here’s actual data from a real development environment:

RTK Token Savings (Global Scope)
════════════════════════════════════════════════════════════

Total commands: 1,908
Input tokens: 20.2M
Output tokens: 1.3M
Tokens saved: 18.9M (93.4%)
Efficiency meter: ██████████████████████░░ 93.4%

By Command
────────────────────────────────────────────────────────────
 # Command Count Saved Avg% Impact
────────────────────────────────────────────────────────────
 1. rtk read 307 12.8M 24.1% ██████████
 2. rtk grep 46 5.4M 19.0% ████░░░░░░
 3. rtk ls 842 265.2K 67.9% ░░░░░░░░░░
 4. rtk tsc --noEmit 82 197.1K 6.8% ░░░░░░░░░░
 5. rtk find 50 113.8K 51.7% ░░░░░░░░░░
 6. rtk playwright test 1 26.9K 99.8% ░░░░░░░░░░
 7. rtk diff 4 26.8K 99.8% ░░░░░░░░░░
────────────────────────────────────────────────────────────

18.9 million tokens saved across 1,908 commands. That’s the kind of number that changes your monthly AI bill.

How It Works Under the Hood

RTK’s architecture is elegantly simple:

┌──────────────┐ ┌──────────────────┐ ┌──────────────┐
│ AI Agent │────│ Shell Hook │────│ RTK Binary │
│ (Claude etc) │ │ (auto-rewrite) │ │ (Rust) │
└──────────────┘ └──────────────────┘ └───────┬───────┘
 │
 ┌───────▼───────┐
 │ Native Tool │
 │ (git, npm..) │
 └───────┬───────┘
 │
 ┌───────▼───────┐
 │ Raw Output │
 │ (~800 tokens)│
 └───────┬───────┘
 │
 ┌───────▼───────┐
 │ RTK Filter │
 │ Pipeline │
 └───────┬───────┘
 │
 ┌───────▼───────┐
 │ Compressed │
 │ (~60 tokens) │
 └───────────────┘

The key insight: RTK intercepts at the tool execution layer via a Bash hook. The AI agent issues git status, the hook silently rewrites it to rtk git status, and the agent receives compressed output. The agent never sees the rewrite — it just gets cleaner data.

Each command passes through an 8-stage TOML filter pipeline:

Strip ANSI escape codes
Regex-based substitutions
match_output short-circuit (e.g., “nothing to commit” → 2-word response)
strip/keep_lines filtering
Line truncation
head/tail_lines selection
max_lines cap
on_empty fallback

All of this happens in under 10 milliseconds.

Installation: 30 Seconds, Any Platform

Homebrew (Recommended)

brew install rtk

Quick Install (Linux / macOS)

curl -fsSL https://raw.githubusercontent.com/rtk-ai/rtk/refs/heads/master/install.sh | sh

Cargo (from Git — avoid name collision)

cargo install --git https://github.com/rtk-ai/rtk

Windows

Download pre-built binaries from GitHub Releases for Windows x64.

Name Collision Warning: Running cargo install rtk may install “Rust Type Kit” — a completely different project. Always use the Git URL or install via Homebrew.

Verify Installation

$ rtk --version
rtk 0.38.0

$ rtk gain
# Shows your token savings dashboard

Setup for Your AI Coding Tool

RTK supports 12+ AI coding environments. Run one command to activate:

Agent	Command
Claude Code / Copilot	`rtk init -g`
Gemini CLI	`rtk init -g --gemini`
Codex (OpenAI)	`rtk init -g --codex`
Cursor	`rtk init --agent cursor`
Windsurf	`rtk init --agent windsurf`
Cline / Roo Code	`rtk init --agent cline`
Kilo Code	`rtk init --agent kilocode`
Antigravity	`rtk init --agent antigravity`

The -g flag installs a global hook that transparently rewrites every shell command. After setup, you never think about it again.

100+ Supported Commands

RTK ships with optimized filters for over 100 commands across every major development category:

Category	Commands
File Operations	`ls`, `find`, `cat`, `head`, `tail`, `tree`, `wc`
Git	`status`, `diff`, `log`, `add`, `commit`, `push`, `branch`
Test Runners	`cargo test`, `pytest`, `jest`, `playwright`, `go test`
Build Tools	`cargo build`, `npm run build`, `tsc`, `make`
Package Managers	`npm install`, `pip install`, `cargo add`, `pnpm`
Linters	`ruff check`, `eslint`, `clippy`
Cloud/DevOps	AWS CLI (25 subcmds), Docker, Kubernetes
Search	`grep`, `rg` (ripgrep), `ag`

Each command has a custom TOML filter configuration that knows exactly what to keep and what to strip.

RTK vs. Caveman: Complementary, Not Competing

If you’ve read our Caveman deep-dive, you might wonder: do I need both?

Yes. They solve different problems:

	RTK	Caveman
What it compresses	Shell command output (input to AI)	AI’s response text (output from AI)
How it works	CLI proxy, filters before context	Prompt injection, removes filler words
Token savings	60-90% on CLI output	22-87% on AI responses
Written in	Rust (single binary)	Plugin/extension (JS)
Stars	42K+	52K+

RTK reduces what goes IN to the model. Caveman reduces what comes OUT. Using both together creates a full compression pipeline:

Raw CLI output (800 tokens)
 ── RTK compresses ── 60 tokens enter context
 ── Model reasons and responds
 ── Caveman compresses ── 35% fewer output tokens

The Honest Limitations

RTK isn’t magic, and the documentation is refreshingly transparent about its boundaries:

Native tool bypass — Claude Code’s built-in Read, Grep, and Glob tools don’t pass through the Bash hook. If your workflow relies heavily on these native tools, RTK’s coverage is reduced.
Over-compression risk — Sometimes RTK strips too much. If you need exact file permissions, timestamps, or full error stack traces, the compressed output may omit them.
Shell-only scope — RTK only intercepts shell commands. It can’t compress file reads via IDE APIs, network requests, or model-to-model communication.
Learning curve for advanced config — While the default filters work great out of the box, customizing ~/.config/rtk/config.toml requires understanding the filter pipeline.

Why It Matters: The Context Window Is Your Bottleneck

Here’s the deeper argument for RTK: your AI agent’s biggest constraint isn’t intelligence — it’s attention.

Even with 200K context windows, filling them with verbose git log output and wall-of-text test results means the model has less “attention bandwidth” for your actual code. Research consistently shows that LLM reasoning quality degrades as context fills up with irrelevant data.

RTK doesn’t just save money, it makes your AI think better by keeping the context window clean and focused.

“None of this is the LLM’s fault. It’s a plumbing problem, raw CLI output is built for humans to scan, not for LLMs to reason over. RTK fixes the plumbing.”: Prahlad Menon, themenonlab.blog

Who Should Use RTK?

Any developer using AI coding assistants, especially on pay-per-token plans
Claude Code Max / Pro users, stretch your rate-limited quota 3x further
Cursor Pro users, each fast-request carries less context noise
Teams sharing API budgets, 80% reduction means 5x more work per dollar
Anyone running CLI-heavy workflows (git, testing, building, deploying)

Who might skip it:

Developers whose workflows are primarily IDE-native (few shell commands)
Those who need verbatim, unfiltered command output for compliance/audit

My Before/After Numbers

I installed RTK mostly out of curiosity — one binary, one alias in my shell config, maybe two minutes total. Then I forgot about it, which is sort of the point.

A few weeks later I checked my API costs and noticed they’d dropped. Not dramatically — from about $14 to $9 for a comparable work week — but consistently. The savings come from exactly the kind of commands I run constantly: git status (RTK strips the hints and untracked file noise), ls (drops permission columns nobody reads), and npm test (keeps failures and the summary line, discards the pass-by-pass log). Individually these feel minor, but across 15 sessions a day they add up.

The part I didn’t expect: Claude seemed to give better answers after I started using RTK. With less noise in the input context, it spent less time parsing irrelevant information. I noticed fewer back-and-forth loops where Claude would say “let me re-read that output” — there was simply less junk to re-read. That accuracy improvement might actually matter more than the cost savings.

Six weeks in, I haven’t thought about RTK once since the initial setup. It just runs in the background, trimming output before it hits the context window. That’s exactly how a good developer tool should work.

Final Verdict

RTK solves a real, measurable problem with almost zero friction. A single binary, a 30-second setup, and your AI agent suddenly operates on 80% less input noise. The 42K+ GitHub stars aren’t hype, they reflect a tool that developers install once and forget about, because it just works.

Combined with Caveman for output compression, you’re looking at a full-stack token optimization pipeline that can cut your AI coding costs by 70-90%.

The question isn’t whether you should install RTK. The question is how many tokens you’ve already wasted by not having it.

Free & Open Source | GitHub Repository | Official Website

More from AI Tool Pick:

Caveman: The 52K-Star Ecosystem That Slashes Your AI Coding Token Bill by 65%
Cursor vs Claude Code vs Windsurf vs Copilot, 30-Day Comparison
10 Best AI Tools in 2026
ChatGPT vs Claude vs Gemini: Detailed Head-to-Head
Free Online AI Tools: Token Counter, Cost Calculator, Prompt Builder & more