ecotokens saves real AI coding context

On one developer workstation, ecotokens recorded 19 928 hook executions between 2026-03-06 and 2026-05-27. The result: 94 282 087 tokens before filtering, 5 874 896 tokens after filtering, and 88 407 191 tokens saved. That is a 93.8% overall reduction across real shell commands and native tool results.

Real-world metric	Value
Hook executions measured	19 928
Tokens saved	88 407 191
Overall reduction	93.8%
Commands with savings	5 735 / 19 928, or 28.8%
Biggest command family	`grep`, with 55 383 168 tokens saved

Claude Code, Gemini CLI, Qwen Code, Pi, Hermes, and Codex can all dump massive command outputs and native tool results into your context window. ecotokens sits in front of those outputs, removes the noise, preserves the important bits, and records the before/after savings locally.

Built on a "set it and forget it!" philosophy: one install command, zero configuration, then automatic compression for shell commands, file reads, grep/search results, directory listings, and code-intelligence workflows.

Full methodology and per-family breakdown: docs/BENCHMARKS.md.

ecotokens demo

Features highlight

Feature	Details
PreToolUse hook	Intercepts every shell (`Bash`) command before its output reaches the model - filters, compresses, and records savings
PostToolUse hook	Intercepts native tool results (`Read`/`read_file`, `Grep`/`search_file_content`, `Glob`/`list_directory`) - outline-based compression for source files, grep trimming, glob denoising
Gain dashboard	Interactive TUI - token savings by command family or project, sparkline, diff view, history log
Multi-agent support	Works with Claude Code, Gemini CLI, Qwen Code, Pi, Hermes, and Codex out of the box
Precision guarantees	Errors, failures, and stack traces are never removed; secrets are redacted before filtering
Code intelligence	BM25 + vector search (Candle, zero-config), symbol lookup, call graph tracing, near-duplicate detection
MCP server	Exposes code-intelligence tools over stdio (`ecotokens mcp-server`) and auto-registers in agent settings on install
AI summarization (optional)	Large outputs compressed by a local Ollama model instead of being truncated
Word abbreviations (optional)	Replace common words with shorter forms (`function`→`fn`, `configuration`→`config`, …) in narrative text, and nudge the model to do the same via a SessionStart instruction
Zero config	One `ecotokens install` command - works automatically from there

Full compatibility matrix: harness feature matrix

How it works

ecotokens installs hooks that intercept tool outputs before they reach the model. Two interception points are supported:

PreToolUse / BeforeTool - fires before every shell (Bash) command:

Runs the command and captures its output
Applies a family-specific filter (git, cargo, python, …)
Optionally summarizes large outputs via a local AI model (Ollama)
Returns the compressed output to the model
Records the before/after token counts in a local metrics store

PostToolUse / AfterTool (Claude Code, Gemini CLI, Qwen Code) - fires after native file-tool calls:

Intercepts the tool result before it enters the context window
Applies a specialized filter (outline for source files, grep result trimming, glob path denoising)
Returns the compressed result to the model
Records the savings under the native_read, grep, or fs family

Claude Code uses the PreToolUse + PostToolUse hooks (~/.claude/settings.json). Gemini CLI uses the BeforeTool + AfterTool hooks (~/.gemini/settings.json). Qwen Code uses the PreToolUse + PostToolUse hooks (~/.qwen/settings.json). Pi uses a TypeScript extension (~/.pi/agent/extensions/ecotokens.ts) that intercepts tool_call (bash pre-exec) and tool_result (read/grep/find/ls post-exec) events in-process. Hermes uses a plugin that sends outputs through filter-output via HermesTransformTerminalOutput and HermesTransformToolResult hook types. Codex uses PreToolUse + PostToolUse hooks in ~/.codex/hooks.json and registers the MCP server in ~/.codex/config.toml.

For a focused view of the runtime path, see docs/hook-filter-metrics-flow.md.

The result: the model sees clean, concise output - and you keep your context window.

Quick install

hljs language-bash

cargo install --git https://github.com/hansipie/ecotokens

For exact token counting (tiktoken cl100k_base instead of the character heuristic):

hljs language-bash

cargo install --git https://github.com/hansipie/ecotokens --features exact-tokens

Build from source

hljs language-bash

git clone https://github.com/hansipie/ecotokens.git
cd ecotokens
cargo build --release
./target/release/ecotokens --help

To install the locally built binary into Cargo's bin directory:

hljs language-bash

cargo install --path .

With exact token counting enabled via tiktoken (cl100k_base encoding):

hljs language-bash

cargo install --path . --features exact-tokens

By default, token counts use a fast character heuristic (chars × 0.25, ~80-85% accuracy). This has no effect on filtering behavior - only the token counts recorded in metrics are more precise.

Installation

Claude Code

hljs language-bash

cargo install --path .
ecotokens install

In addition to hook installation, this also registers an MCP server entry in ~/.claude/settings.json:

hljs language-json

{
  "mcpServers": {
    "ecotokens": {
      "command": "ecotokens",
      "args": ["mcp-server"]
    }
  }
}

Gemini CLI

Requires Gemini CLI ≥ 0.1.0.

hljs language-bash

cargo install --path .
ecotokens install --target gemini

This writes BeforeTool and AfterTool hook entries into ~/.gemini/settings.json. The AfterTool hook intercepts read_file, search_file_content, and list_directory results.

It also registers the ecotokens MCP server in ~/.gemini/settings.json.

Qwen Code

Requires Qwen Code.

hljs language-bash

cargo install --path .
ecotokens install --target qwen

This writes PreToolUse and PostToolUse hook entries into ~/.qwen/settings.json. The PostToolUse hook intercepts read_file, search_files, and list_dir results.

It also registers the ecotokens MCP server in ~/.qwen/settings.json.

Pi

Requires Pi (@mariozechner/pi-coding-agent ≥ 0.62.0).

hljs language-bash

cargo install --path .
ecotokens install --target pi

This writes a TypeScript extension to ~/.pi/agent/extensions/ecotokens.ts. Pi auto-discovers it on next startup (or /reload inside an active session). The extension intercepts bash commands before execution and filters native tool results (read, grep, find, ls) after execution.

Hermes

Requires Hermes.

hljs language-bash

cargo install --path .
ecotokens install --target hermes

This installs an ecotokens plugin in ~/.hermes/plugins/. The plugin intercepts two Hermes hooks:

transform_terminal_output — filters raw terminal output before Hermes truncates it
transform_tool_result — filters non-terminal tool results (read_file, search_files, browser_snapshot, MCP tools, etc.)

Each hook calls ecotokens filter-output as a subprocess with the appropriate --hook-type, so savings are tracked separately in ecotokens gain.

Enabling the plugin — Hermes requires explicit activation. Two options:

hljs language-bash

# Option A: via Hermes CLI (requires hermes to be installed and in PATH)
hermes plugins enable ecotokens

# Option B: write directly to ~/.hermes/config.yaml (no hermes CLI needed)
ecotokens install --target hermes --enable-plugin

--enable-plugin adds ecotokens to plugins.enabled in ~/.hermes/config.yaml, creating the file if it does not exist and preserving all existing keys. Restart Hermes after enabling.

Per-tool filtering — tool result labels (hermes-tool:<name>) are mapped to the appropriate filter family automatically:

Tool	Family	Filter applied
`read_file`, `list_directory`, `create_file`	`fs`	file listing compaction
`search_files`, `find_files`, `search_in_file`	`grep`	match deduplication
`browser_snapshot`, `web_fetch`, `web_search`	`network`	HTML/JSON reduction
`run_python_code`, `execute_python`	`python`	traceback extraction
`delegate_task`, MCP tools, others	`generic`	200-line / 50 KB cap

Runtime variables — the generated plugin reads these from the environment:

Variable	Default	Effect
`ECOTOKENS_BIN`	path at install time	Override the ecotokens binary called by the plugin
`ECOTOKENS_HERMES_MIN_CHARS`	`2000`	Skip filtering outputs shorter than this
`ECOTOKENS_HERMES_TIMEOUT`	`10`	Subprocess timeout in seconds

The plugin is fail-open: any error, timeout, or empty output returns the original content unchanged.

Codex

hljs language-bash

cargo install --path .
ecotokens install --target codex

This installs three things:

Plugin (~/.codex/plugins/ecotokens/.codex-plugin/plugin.json) — identifies ecotokens to Codex
Hooks (~/.codex/hooks.json) — PreToolUse (bash pre-exec filtering) and PostToolUse (bash post-exec filtering)
MCP server (~/.codex/config.toml) — registers ecotokens mcp-server under [mcp_servers.ecotokens]

All targets at once

hljs language-bash

ecotokens install --target all

--target all covers Claude Code, Gemini CLI, Qwen Code, Pi, Hermes, and Codex in a single command.

With AI summarization

Enable AI-powered output compression via Ollama at install time:

hljs language-bash

ecotokens install --ai-summary                          # use default model (llama3.2:3b)
ecotokens install --ai-summary-model qwen2.5:3b         # specify model (implies --ai-summary)

This writes ai_summary_enabled and ai_summary_model to ~/.config/ecotokens/config.json. Ollama must be running and the model must be pulled (ollama pull llama3.2:3b).

Uninstall

hljs language-bash

ecotokens uninstall                    # Claude Code
ecotokens uninstall --target gemini    # Gemini CLI
ecotokens uninstall --target qwen      # Qwen Code
ecotokens uninstall --target pi        # Pi
ecotokens uninstall --target hermes    # Hermes
ecotokens uninstall --target codex     # Codex
ecotokens uninstall --target all       # all targets

Commands

Command	Description
`ecotokens install`	Install the PreToolUse + PostToolUse hooks and register the MCP server entry in `~/.claude/settings.json`
`ecotokens install --target hermes`	Install the Hermes plugin in `~/.hermes/plugins/`
`ecotokens install --target hermes --enable-plugin`	Install and add to `plugins.enabled` in `~/.hermes/config.yaml` directly
`ecotokens install --target codex`	Install the Codex plugin, `PreToolUse`/`PostToolUse` hooks in `~/.codex/hooks.json`, and MCP server in `~/.codex/config.toml`
`ecotokens uninstall`	Remove all hooks (PreToolUse, PostToolUse, SessionStart, SessionEnd where supported) and the MCP server entry
`ecotokens filter -- CMD [ARGS]`	Run a command, filter its output, record metrics
`ecotokens filter --cwd DIR -- CMD [ARGS]`	Same, with an explicit working directory
`ecotokens filter-output --command LABEL --exit-code N`	Filter captured output read from stdin and record metrics (used by Hermes hooks)
`ecotokens filter-output ... --hook-type transform-tool-result`	Same, attributed to the `transform_tool_result` hook in metrics
`ecotokens hook-post`	PostToolUse handler - intercept native tool results (Read, Grep, Glob)
`ecotokens gain`	Interactive TUI dashboard - savings by family or project
`ecotokens gain --period PERIOD`	Filter TUI to a time window (`all`, `today`, `week`, `month`)
`ecotokens gain --history`	Print a savings summary table for 24h / 7 days / 30 days
`ecotokens gain --json`	JSON report
`ecotokens config [--debug true\|false]`	Show or update global configuration (including debug mode)
`ecotokens config --model MODEL`	Set the default model used for cost calculations (empty or unknown value lists available models)
`ecotokens index [--path DIR]`	Index a codebase for BM25 + symbolic search
`ecotokens search QUERY [--context N] [--include GLOB] [--exclude GLOB] [--no-trace]`	Search the indexed codebase with line numbers, context, and optional trace augmentation
`ecotokens outline PATH`	List symbols in a file or directory
`ecotokens symbol ID`	Look up a symbol by its stable ID
`ecotokens trace callers SYMBOL`	Find callers of a symbol
`ecotokens trace callees SYMBOL`	Find callees of a symbol
`ecotokens watch [--path DIR]`	Watch a directory and keep the index up to date
`ecotokens mcp-server [--index-dir DIR]`	Start the stdio MCP server exposing search/outline/symbol/trace/duplicates tools
`ecotokens auto-watch enable`	Start watch automatically on each Claude Code, Qwen Code, Pi, Hermes or Codex session
`ecotokens auto-watch disable`	Disable automatic watch
`ecotokens abbreviations enable`	Replace common words with abbreviations in filtered outputs + inject a matching instruction at SessionStart
`ecotokens abbreviations disable`	Turn abbreviations off (default)
`ecotokens abbreviations list`	List the active dictionary (defaults merged with user overrides)
`ecotokens duplicates`	Detect near-duplicate code blocks in the indexed codebase
`ecotokens clear --all`	Delete all recorded interceptions
`ecotokens clear --before DATE`	Delete interceptions recorded before DATE (YYYY-MM-DD)
`ecotokens clear --older-than DURATION`	Delete interceptions older than a duration (e.g. `30d`, `2w`, `1m`)
`ecotokens clear --family FAMILY`	Delete interceptions of a specific command family
`ecotokens clear --project PATH`	Delete interceptions for a specific project (use `"[undefined]"` for entries without a git root)
`ecotokens completions SHELL`	Generate a shell completion script (`bash`, `zsh`, `fish`, `powershell`, `elvish`)

Shell completions

hljs language-bash

# zsh
ecotokens completions zsh > ~/.zsh/completions/_ecotokens

# bash
ecotokens completions bash > ~/.local/share/bash-completion/completions/ecotokens

# fish
ecotokens completions fish > ~/.config/fish/completions/ecotokens.fish

# PowerShell
ecotokens completions powershell >> $PROFILE

Reload your shell (or open a new terminal) to activate completions.

Gain dashboard

hljs language-css

ecotokens gain                                          # all time, uses default model from config
ecotokens gain --period today                           # today only
ecotokens gain --period week                            # last 7 days
ecotokens gain --period month --model claude-sonnet-4-6 # last 30 days, override model
ecotokens gain --history                                # summary table: 24h / 7d / 30d
ecotokens gain --history --json                         # same, as JSON

The model used for cost calculations defaults to the value set with ecotokens config --model (or claude-sonnet-4-6 if not configured). Pass --model to override for a single invocation.

Interactive TUI showing token savings per command family and per project, with a sparkline. The --period flag filters both the stats and the history panels.

Keybindings:

Key	Action
`j` / `u`	Navigate up / down in list
`k` / `i`	Scroll history log down / up (family log view)
`l` / `o`	Scroll detail / diff / SplitRaw BEFORE panel down / up
`L` / `O`	Scroll SplitRaw AFTER panel down / up
`p`	Switch to project view (from family view)
`f`	Switch to family view (from project view)
`d`	Cycle detail mode (details → diff → split raw) - family view only
`s`	Cycle sparkline scale (linear / log / capped)
`q` / `Esc`	Quit

Filter command

ecotokens filter runs a command directly and returns its filtered output. Useful for testing filters or wrapping commands in scripts:

hljs language-bash

ecotokens filter -- cargo test
ecotokens filter --debug -- git log --oneline -50
ecotokens filter --cwd /path/to/project -- cargo test

The output is compressed by the same family-specific filters used by the hook, and token savings are recorded in the metrics store.

Watch command

ecotokens watch monitors a directory and automatically re-indexes files as they change.

hljs language-bash

ecotokens watch                    # foreground, TUI progress
ecotokens watch --path ./src       # watch a specific directory
ecotokens watch --background       # fork to background
ecotokens watch --status           # show status of background process
ecotokens watch --status --json    # JSON status output
ecotokens watch --stop             # stop the background process

Note: Background logs are only written if global debug is enabled (ecotokens config --debug true).

Auto-watch (Claude Code, Qwen Code, Pi, Hermes, Codex)

ecotokens auto-watch integrates with agent session lifecycles to start and stop the watcher automatically where both lifecycle events are available.

hljs language-bash

ecotokens auto-watch enable    # enable auto-watch
ecotokens auto-watch disable   # disable (hooks remain installed but are no-ops)

When enabled, ecotokens watch --background starts automatically when a session opens, and stops when it closes on agents that expose an end-of-session event. The setting is stored in ~/.config/ecotokens/config.json (auto_watch: true/false).

Support by agent:

Agent	Mechanism	Notes
Claude Code	`SessionStart` / `SessionEnd` shell hooks in `~/.claude/settings.json`	Installed by `auto-watch enable`
Qwen Code	`SessionStart` / `SessionEnd` shell hooks in `~/.qwen/settings.json`	Installed automatically if Qwen hook is present
Pi	`session_start` / `session_end` events in the TypeScript extension	Built into the Pi extension
Hermes	`on_session_start` / `on_session_end` plugin hooks	Built into the Hermes plugin; install first with `ecotokens install --target hermes`
Codex	—	Session hooks not yet supported; auto-watch not available for Codex
Gemini CLI	—	Gemini does not expose session lifecycle hooks

Word abbreviations

hljs language-bash

ecotokens abbreviations enable    # transform narrative text + inject model instruction
ecotokens abbreviations list      # show the active dictionary
ecotokens abbreviations disable   # back to default

When enabled, a post-processing pass replaces full words with shorter forms in the narrative parts of tool outputs (code blocks between triple backticks are preserved). A matching additionalContext payload is emitted at SessionStart so the model adopts the same abbreviations in its own responses.

See the full list of default abbreviations in docs/abbreviations.md.

Keep the feature flag in ~/.config/ecotokens/config.json

hljs language-json

{
  "abbreviations_enabled": true
}

... and put custom pairs in a separate ~/.config/ecotokens/abbreviations.json file:

hljs language-json

{
  "function": "func",
  "repository": "repo"
}

Bonus Tools

MCP server (Claude Code, Gemini CLI, Qwen Code, Codex)

ecotokens mcp-server starts a stdio MCP server backed by the ecotokens index and trace engines.

hljs language-bash

ecotokens mcp-server
ecotokens mcp-server --index-dir ~/.config/ecotokens/index

Exposed tools:

ecotokens_search - BM25 + semantic search
ecotokens_outline - symbol outline for file/directory
ecotokens_symbol - fetch full symbol source by stable ID
ecotokens_trace_callers - find callers of a symbol
ecotokens_trace_callees - find callees (with depth)
ecotokens_duplicates - detect near-duplicate code blocks

For Claude Code, Gemini CLI, Qwen Code, and Codex, ecotokens install registers this server automatically in each target's settings file (mcpServers in JSON settings, [mcp_servers.ecotokens] in Codex's config.toml).

Search command

ecotokens search QUERY performs BM25 (+ optional semantic) search over the indexed codebase and returns results anchored to the matching line.

hljs language-bash

ecotokens search "embed_text"                        # top 5 results, 2 lines of context
ecotokens search "embed_text" --context 4            # 4 lines above and below the match
ecotokens search "error" --include "*.rs"            # Rust files only
ecotokens search "TODO" --exclude "*.md" --exclude "*.toml"
ecotokens search "find_callers" --no-trace           # pure BM25, no trace augmentation
ecotokens search "find_callers" --json               # JSON output with callers array
ecotokens search "query" --top-k 10                  # more results

Output format:

hljs language-rust

src/search/query.rs:29 (score: 11.068)
  27:  
  28:  pub fn search_index(opts: SearchOptions) -> tantivy::Result<Vec<SearchResult>> {
  29:      let index = Index::open_in_dir(&opts.index_dir)?;
  30:      let (_, file_path_field, content_field, kind_field, line_start_field, _) = build_schema();
  31:

When the query matches a symbol name, callers are automatically appended:

hljs language-bash

# Symbol match - call sites via trace
  src/main.rs:1301 [caller]  cmd_search

Results are automatically scoped to the current git project when using the global index - files from other indexed projects are silently filtered out.

Duplicates command

Less code is less tokens

ecotokens duplicates scans the indexed codebase for near-identical code blocks and reports them grouped by similarity.

hljs language-bash

ecotokens duplicates                          # default: threshold=70%, min_lines=5
ecotokens duplicates --threshold 80           # only report ≥ 80% similarity
ecotokens duplicates --min-lines 10           # ignore blocks shorter than 10 lines
ecotokens duplicates --json                   # JSON output

Each group shows the file paths, line ranges, similarity score, and a refactoring proposal (exact duplicate, near duplicate, or subset).

Configuration

hljs language-bash

ecotokens config           # show all settings (text)
ecotokens config --json    # show all settings (JSON)

Output includes:

hljs language-bash

hook_installed        : true
debug                 : false
debuglog              : false
default_model         : claude-sonnet-4-6
exclusions            : []
embed_provider        : candle (sentence-transformers/all-MiniLM-L6-v2)
ai_summary_enabled    : false
ai_summary_model      : llama3.2:3b (default)
ai_summary_url        : http://localhost:11434 (default)
abbreviations_enabled : false

Debug mode

Enable the global debug mode to see detailed interception logs and enable background logging for the watch command:

hljs language-bash

ecotokens config --debug true
ecotokens config --debug false

This updates the debug field in ~/.config/ecotokens/config.json.

Debug file logging

Enable structured per-hook logging to a file for deeper tracing of what ecotokens intercepts:

hljs language-bash

ecotokens config --debuglog true
ecotokens config --debuglog false

When enabled, every hook invocation appends a JSONL entry to ~/.config/ecotokens/debug.log:

hljs language-json

{"ts":"2026-05-08T12:00:00Z","uid":"a1b2c3d4","cmd":"git status","phase":"input","data":{...}}
{"ts":"2026-05-08T12:00:00Z","uid":"a1b2c3d4","cmd":"git status","phase":"output","data":{...}}

Each entry contains a short uid to correlate the input and output phases of the same invocation. Distinct from --debug (which prints to stderr) - --debuglog writes silently to disk and survives across sessions.

Default model for cost calculations

The model selected here determines the per-token price used in gain reports:

hljs language-bash

ecotokens config --model claude-opus-4-7    # set default model
ecotokens config --model ""                 # list available models
ecotokens config --model unknown-model      # unknown model → lists available models

The model name must be present in the built-in pricing table (or added via ~/.config/ecotokens/pricing.json). Passing an empty value or an unrecognised name prints the full list and exits.

See the full list of built-in models and prices in docs/models.md.

Override any entry or add a new model by creating ~/.config/ecotokens/pricing.json:

hljs language-json

{
  "my-custom-model": { "input_usd_per_1m": 0.50, "output_usd_per_1m": 2.00 }
}

Word abbreviations (optional)

hljs language-bash

ecotokens abbreviations enable    # transform narrative text + inject model instruction
ecotokens abbreviations list      # show the active dictionary
ecotokens abbreviations disable   # back to default

Extend or override the built-in dictionary via a separate ~/.config/ecotokens/abbreviations.json file:

hljs language-json

{
  "function": "func",
  "repository": "repo"
}

The feature flag stays in ~/.config/ecotokens/config.json:

hljs language-json

{
  "abbreviations_enabled": true
}

Supported command families

Family	Examples
`git`	`git status`, `git diff`, `git log`
`cargo`	`cargo build`, `cargo test`, `cargo clippy`
`python`	`pytest`, `ruff`, `uv run`, `poetry`, `pipx`
`javascript`	`jest`, `mocha`, `yarn`
`cpp`	`gcc`, `clang`, `make`, `cmake`
`fs`	`ls`, `find`, `tree`
`markdown`	`.md` files
`config`	`.toml`, `.json`, `.yaml`
`generic`	Everything else (truncated to 200 lines / 50 KB)
`native_read`	Claude Code `Read` tool results (PostToolUse, outline-based compression)

Note: Family detection uses the basename of the first token, so commands invoked via absolute path (/usr/bin/git), venv (.venv/bin/pytest), version managers (~/.cargo/bin/cargo), or wrappers (poetry run) are correctly matched to their family.

Hermes tool result labels (hermes-tool:<name>) are mapped to the same families: read_file → fs, search_files → grep, browser_snapshot → network, run_python_code → python, everything else → generic.

Embeddings

ecotokens search uses dual BM25 + vector retrieval with score fusion (0.4 × BM25 + 0.6 × cosine). The vector index is powered by Candle - a zero-config local embedding engine. No external service required.

Provider

Candle (default) - runs sentence-transformers/all-MiniLM-L6-v2 (384 dim) locally. The model is downloaded automatically from HuggingFace Hub on first use (~90 MB, cached in ~/.cache/huggingface/).

hljs language-bash

# Candle is active by default - nothing to configure
ecotokens index --path /your/project
ecotokens search "your query"

Each result includes a retrieval_source field (bm25, vector, or both) visible in JSON output.

Disable embeddings

hljs language-bash

ecotokens config --embed-provider none    # fall back to pure BM25

Workflow

hljs language-bash

# 1. Index your project (Candle embeddings computed automatically)
ecotokens index --path /your/project

# 2. Search with hybrid scoring
ecotokens search "your query"

# 3. JSON output with retrieval_source
ecotokens search "your query" --json

Model change detection

When the configured embedding model changes, ecotokens index automatically rebuilds the vector index (hnsw_index.bin) without touching the BM25 index. Embeddings for unchanged files are reused between runs.

AI summarization (optional)

When enabled, large command outputs (> ~2500 tokens) are summarized by a local Ollama model instead of being truncated. Falls back to generic filtering if Ollama is unavailable or times out.

Enable via install:

hljs language-bash

ecotokens install --ai-summary-model llama3.2:3b

Or update the config file directly (~/.config/ecotokens/config.json):

hljs language-json

{
  "ai_summary_enabled": true,
  "ai_summary_model": "llama3.2:3b"
}

Ollama must be running locally. The model is called with a 3-second timeout to avoid blocking the model.

Benchmarks

Measured on a real developer workstation from 2026-03-06 to 2026-05-27 (19 928 hook executions):

Metric	Value
Tokens before filtering	94 282 087
Tokens after filtering	5 874 896
Tokens saved	88 407 191
Overall reduction	93.8 %
Commands with savings	5 735 / 19 928, or 28.8 %
Biggest family	`grep`, 55 383 168 tokens saved
Best single run	11 323 890 tokens saved from one `grep` scan

Full benchmark report

Precision Guarantees

Filtering is aggressive on noise, conservative on signal:

Short outputs are never modified - outputs under 200 lines or 50 KB pass through unchanged
Errors are always preserved - error[, FAILED, E (pytest), --- FAIL: (Go), stack traces and panic messages are never removed
Failure sections are fully kept - structured blocks (=== FAILURES ===, failures:, failure diffs) are always passed through in their entirety
Conservative fallback - if a family filter doesn't improve the output (filtered ≥ original), the original is returned as-is
Secrets are redacted before filtering - 33 patterns covering cloud keys, AI APIs, VCS tokens, payment secrets and more are detected and replaced before any content reaches the model. See docs/secret-patterns.md for the full list.
UTF-8 safe truncation - truncation always happens at character boundaries, never mid-codepoint
Head + tail preservation - when generic truncation applies, the first and last 20 lines are always kept (start context + end result)

Requirements

Rust ≥ 1.75 (stable)
One or more of: Claude Code (with hook support), Gemini CLI ≥ 0.1.0, Qwen Code, Pi ≥ 0.62.0
Internet access on first use (Candle downloads all-MiniLM-L6-v2 ~90 MB from HuggingFace Hub; cached locally after that)
Ollama (optional, for AI summarization only)

Contributing

Contributions are welcome! Please read the contributing guidelines before submitting a pull request.

License

MIT

ecotokens saves real AI coding context

Real-world metric	Value
Hook executions measured	19 928
Tokens saved	88 407 191
Overall reduction	93.8%
Commands with savings	5 735 / 19 928, or 28.8%
Biggest command family	`grep`, with 55 383 168 tokens saved

Full methodology and per-family breakdown: docs/BENCHMARKS.md.

ecotokens demo

Features highlight

Feature	Details
PreToolUse hook	Intercepts every shell (`Bash`) command before its output reaches the model - filters, compresses, and records savings
PostToolUse hook	Intercepts native tool results (`Read`/`read_file`, `Grep`/`search_file_content`, `Glob`/`list_directory`) - outline-based compression for source files, grep trimming, glob denoising
Gain dashboard	Interactive TUI - token savings by command family or project, sparkline, diff view, history log
Multi-agent support	Works with Claude Code, Gemini CLI, Qwen Code, Pi, Hermes, and Codex out of the box
Precision guarantees	Errors, failures, and stack traces are never removed; secrets are redacted before filtering
Code intelligence	BM25 + vector search (Candle, zero-config), symbol lookup, call graph tracing, near-duplicate detection
MCP server	Exposes code-intelligence tools over stdio (`ecotokens mcp-server`) and auto-registers in agent settings on install
AI summarization (optional)	Large outputs compressed by a local Ollama model instead of being truncated
Word abbreviations (optional)	Replace common words with shorter forms (`function`→`fn`, `configuration`→`config`, …) in narrative text, and nudge the model to do the same via a SessionStart instruction
Zero config	One `ecotokens install` command - works automatically from there

Full compatibility matrix: harness feature matrix

How it works

ecotokens installs hooks that intercept tool outputs before they reach the model. Two interception points are supported:

PreToolUse / BeforeTool - fires before every shell (Bash) command:

Runs the command and captures its output
Applies a family-specific filter (git, cargo, python, …)
Optionally summarizes large outputs via a local AI model (Ollama)
Returns the compressed output to the model
Records the before/after token counts in a local metrics store

PostToolUse / AfterTool (Claude Code, Gemini CLI, Qwen Code) - fires after native file-tool calls:

Intercepts the tool result before it enters the context window
Applies a specialized filter (outline for source files, grep result trimming, glob path denoising)
Returns the compressed result to the model
Records the savings under the native_read, grep, or fs family

For a focused view of the runtime path, see docs/hook-filter-metrics-flow.md.

The result: the model sees clean, concise output - and you keep your context window.

Quick install

hljs language-bash

cargo install --git https://github.com/hansipie/ecotokens

For exact token counting (tiktoken cl100k_base instead of the character heuristic):

hljs language-bash

cargo install --git https://github.com/hansipie/ecotokens --features exact-tokens

Build from source

hljs language-bash

git clone https://github.com/hansipie/ecotokens.git
cd ecotokens
cargo build --release
./target/release/ecotokens --help

To install the locally built binary into Cargo's bin directory:

hljs language-bash

cargo install --path .

With exact token counting enabled via tiktoken (cl100k_base encoding):

hljs language-bash

cargo install --path . --features exact-tokens

By default, token counts use a fast character heuristic (chars × 0.25, ~80-85% accuracy). This has no effect on filtering behavior - only the token counts recorded in metrics are more precise.

Installation

Claude Code

hljs language-bash

cargo install --path .
ecotokens install

In addition to hook installation, this also registers an MCP server entry in ~/.claude/settings.json:

hljs language-json

{
  "mcpServers": {
    "ecotokens": {
      "command": "ecotokens",
      "args": ["mcp-server"]
    }
  }
}

Gemini CLI

Requires Gemini CLI ≥ 0.1.0.

hljs language-bash

cargo install --path .
ecotokens install --target gemini

This writes BeforeTool and AfterTool hook entries into ~/.gemini/settings.json. The AfterTool hook intercepts read_file, search_file_content, and list_directory results.

It also registers the ecotokens MCP server in ~/.gemini/settings.json.

Qwen Code

Requires Qwen Code.

hljs language-bash

cargo install --path .
ecotokens install --target qwen

This writes PreToolUse and PostToolUse hook entries into ~/.qwen/settings.json. The PostToolUse hook intercepts read_file, search_files, and list_dir results.

It also registers the ecotokens MCP server in ~/.qwen/settings.json.

Pi

Requires Pi (@mariozechner/pi-coding-agent ≥ 0.62.0).

hljs language-bash

cargo install --path .
ecotokens install --target pi

Hermes

Requires Hermes.

hljs language-bash

cargo install --path .
ecotokens install --target hermes

This installs an ecotokens plugin in ~/.hermes/plugins/. The plugin intercepts two Hermes hooks:

transform_terminal_output — filters raw terminal output before Hermes truncates it
transform_tool_result — filters non-terminal tool results (read_file, search_files, browser_snapshot, MCP tools, etc.)

Each hook calls ecotokens filter-output as a subprocess with the appropriate --hook-type, so savings are tracked separately in ecotokens gain.

Enabling the plugin — Hermes requires explicit activation. Two options:

hljs language-bash

# Option A: via Hermes CLI (requires hermes to be installed and in PATH)
hermes plugins enable ecotokens

# Option B: write directly to ~/.hermes/config.yaml (no hermes CLI needed)
ecotokens install --target hermes --enable-plugin

--enable-plugin adds ecotokens to plugins.enabled in ~/.hermes/config.yaml, creating the file if it does not exist and preserving all existing keys. Restart Hermes after enabling.

Per-tool filtering — tool result labels (hermes-tool:<name>) are mapped to the appropriate filter family automatically:

Tool	Family	Filter applied
`read_file`, `list_directory`, `create_file`	`fs`	file listing compaction
`search_files`, `find_files`, `search_in_file`	`grep`	match deduplication
`browser_snapshot`, `web_fetch`, `web_search`	`network`	HTML/JSON reduction
`run_python_code`, `execute_python`	`python`	traceback extraction
`delegate_task`, MCP tools, others	`generic`	200-line / 50 KB cap

Runtime variables — the generated plugin reads these from the environment:

Variable	Default	Effect
`ECOTOKENS_BIN`	path at install time	Override the ecotokens binary called by the plugin
`ECOTOKENS_HERMES_MIN_CHARS`	`2000`	Skip filtering outputs shorter than this
`ECOTOKENS_HERMES_TIMEOUT`	`10`	Subprocess timeout in seconds

The plugin is fail-open: any error, timeout, or empty output returns the original content unchanged.

Codex

hljs language-bash

cargo install --path .
ecotokens install --target codex

This installs three things:

Plugin (~/.codex/plugins/ecotokens/.codex-plugin/plugin.json) — identifies ecotokens to Codex
Hooks (~/.codex/hooks.json) — PreToolUse (bash pre-exec filtering) and PostToolUse (bash post-exec filtering)
MCP server (~/.codex/config.toml) — registers ecotokens mcp-server under [mcp_servers.ecotokens]

All targets at once

hljs language-bash

ecotokens install --target all

--target all covers Claude Code, Gemini CLI, Qwen Code, Pi, Hermes, and Codex in a single command.

With AI summarization

Enable AI-powered output compression via Ollama at install time:

hljs language-bash

ecotokens install --ai-summary                          # use default model (llama3.2:3b)
ecotokens install --ai-summary-model qwen2.5:3b         # specify model (implies --ai-summary)

This writes ai_summary_enabled and ai_summary_model to ~/.config/ecotokens/config.json. Ollama must be running and the model must be pulled (ollama pull llama3.2:3b).

Uninstall

hljs language-bash

ecotokens uninstall                    # Claude Code
ecotokens uninstall --target gemini    # Gemini CLI
ecotokens uninstall --target qwen      # Qwen Code
ecotokens uninstall --target pi        # Pi
ecotokens uninstall --target hermes    # Hermes
ecotokens uninstall --target codex     # Codex
ecotokens uninstall --target all       # all targets

Commands

Command	Description
`ecotokens install`	Install the PreToolUse + PostToolUse hooks and register the MCP server entry in `~/.claude/settings.json`
`ecotokens install --target hermes`	Install the Hermes plugin in `~/.hermes/plugins/`
`ecotokens install --target hermes --enable-plugin`	Install and add to `plugins.enabled` in `~/.hermes/config.yaml` directly
`ecotokens install --target codex`	Install the Codex plugin, `PreToolUse`/`PostToolUse` hooks in `~/.codex/hooks.json`, and MCP server in `~/.codex/config.toml`
`ecotokens uninstall`	Remove all hooks (PreToolUse, PostToolUse, SessionStart, SessionEnd where supported) and the MCP server entry
`ecotokens filter -- CMD [ARGS]`	Run a command, filter its output, record metrics
`ecotokens filter --cwd DIR -- CMD [ARGS]`	Same, with an explicit working directory
`ecotokens filter-output --command LABEL --exit-code N`	Filter captured output read from stdin and record metrics (used by Hermes hooks)
`ecotokens filter-output ... --hook-type transform-tool-result`	Same, attributed to the `transform_tool_result` hook in metrics
`ecotokens hook-post`	PostToolUse handler - intercept native tool results (Read, Grep, Glob)
`ecotokens gain`	Interactive TUI dashboard - savings by family or project
`ecotokens gain --period PERIOD`	Filter TUI to a time window (`all`, `today`, `week`, `month`)
`ecotokens gain --history`	Print a savings summary table for 24h / 7 days / 30 days
`ecotokens gain --json`	JSON report
`ecotokens config [--debug true\|false]`	Show or update global configuration (including debug mode)
`ecotokens config --model MODEL`	Set the default model used for cost calculations (empty or unknown value lists available models)
`ecotokens index [--path DIR]`	Index a codebase for BM25 + symbolic search
`ecotokens search QUERY [--context N] [--include GLOB] [--exclude GLOB] [--no-trace]`	Search the indexed codebase with line numbers, context, and optional trace augmentation
`ecotokens outline PATH`	List symbols in a file or directory
`ecotokens symbol ID`	Look up a symbol by its stable ID
`ecotokens trace callers SYMBOL`	Find callers of a symbol
`ecotokens trace callees SYMBOL`	Find callees of a symbol
`ecotokens watch [--path DIR]`	Watch a directory and keep the index up to date
`ecotokens mcp-server [--index-dir DIR]`	Start the stdio MCP server exposing search/outline/symbol/trace/duplicates tools
`ecotokens auto-watch enable`	Start watch automatically on each Claude Code, Qwen Code, Pi, Hermes or Codex session
`ecotokens auto-watch disable`	Disable automatic watch
`ecotokens abbreviations enable`	Replace common words with abbreviations in filtered outputs + inject a matching instruction at SessionStart
`ecotokens abbreviations disable`	Turn abbreviations off (default)
`ecotokens abbreviations list`	List the active dictionary (defaults merged with user overrides)
`ecotokens duplicates`	Detect near-duplicate code blocks in the indexed codebase
`ecotokens clear --all`	Delete all recorded interceptions
`ecotokens clear --before DATE`	Delete interceptions recorded before DATE (YYYY-MM-DD)
`ecotokens clear --older-than DURATION`	Delete interceptions older than a duration (e.g. `30d`, `2w`, `1m`)
`ecotokens clear --family FAMILY`	Delete interceptions of a specific command family
`ecotokens clear --project PATH`	Delete interceptions for a specific project (use `"[undefined]"` for entries without a git root)
`ecotokens completions SHELL`	Generate a shell completion script (`bash`, `zsh`, `fish`, `powershell`, `elvish`)

Shell completions

hljs language-bash

# zsh
ecotokens completions zsh > ~/.zsh/completions/_ecotokens

# bash
ecotokens completions bash > ~/.local/share/bash-completion/completions/ecotokens

# fish
ecotokens completions fish > ~/.config/fish/completions/ecotokens.fish

# PowerShell
ecotokens completions powershell >> $PROFILE

Reload your shell (or open a new terminal) to activate completions.

Gain dashboard

hljs language-css

ecotokens gain                                          # all time, uses default model from config
ecotokens gain --period today                           # today only
ecotokens gain --period week                            # last 7 days
ecotokens gain --period month --model claude-sonnet-4-6 # last 30 days, override model
ecotokens gain --history                                # summary table: 24h / 7d / 30d
ecotokens gain --history --json                         # same, as JSON

The model used for cost calculations defaults to the value set with ecotokens config --model (or claude-sonnet-4-6 if not configured). Pass --model to override for a single invocation.

Interactive TUI showing token savings per command family and per project, with a sparkline. The --period flag filters both the stats and the history panels.

Keybindings:

Key	Action
`j` / `u`	Navigate up / down in list
`k` / `i`	Scroll history log down / up (family log view)
`l` / `o`	Scroll detail / diff / SplitRaw BEFORE panel down / up
`L` / `O`	Scroll SplitRaw AFTER panel down / up
`p`	Switch to project view (from family view)
`f`	Switch to family view (from project view)
`d`	Cycle detail mode (details → diff → split raw) - family view only
`s`	Cycle sparkline scale (linear / log / capped)
`q` / `Esc`	Quit

Filter command

ecotokens filter runs a command directly and returns its filtered output. Useful for testing filters or wrapping commands in scripts:

hljs language-bash

ecotokens filter -- cargo test
ecotokens filter --debug -- git log --oneline -50
ecotokens filter --cwd /path/to/project -- cargo test

The output is compressed by the same family-specific filters used by the hook, and token savings are recorded in the metrics store.

Watch command

ecotokens watch monitors a directory and automatically re-indexes files as they change.

hljs language-bash

ecotokens watch                    # foreground, TUI progress
ecotokens watch --path ./src       # watch a specific directory
ecotokens watch --background       # fork to background
ecotokens watch --status           # show status of background process
ecotokens watch --status --json    # JSON status output
ecotokens watch --stop             # stop the background process

Note: Background logs are only written if global debug is enabled (ecotokens config --debug true).

Auto-watch (Claude Code, Qwen Code, Pi, Hermes, Codex)

ecotokens auto-watch integrates with agent session lifecycles to start and stop the watcher automatically where both lifecycle events are available.

hljs language-bash

ecotokens auto-watch enable    # enable auto-watch
ecotokens auto-watch disable   # disable (hooks remain installed but are no-ops)

Support by agent:

Agent	Mechanism	Notes
Claude Code	`SessionStart` / `SessionEnd` shell hooks in `~/.claude/settings.json`	Installed by `auto-watch enable`
Qwen Code	`SessionStart` / `SessionEnd` shell hooks in `~/.qwen/settings.json`	Installed automatically if Qwen hook is present
Pi	`session_start` / `session_end` events in the TypeScript extension	Built into the Pi extension
Hermes	`on_session_start` / `on_session_end` plugin hooks	Built into the Hermes plugin; install first with `ecotokens install --target hermes`
Codex	—	Session hooks not yet supported; auto-watch not available for Codex
Gemini CLI	—	Gemini does not expose session lifecycle hooks

Word abbreviations

hljs language-bash

ecotokens abbreviations enable    # transform narrative text + inject model instruction
ecotokens abbreviations list      # show the active dictionary
ecotokens abbreviations disable   # back to default

See the full list of default abbreviations in docs/abbreviations.md.

Keep the feature flag in ~/.config/ecotokens/config.json

hljs language-json

{
  "abbreviations_enabled": true
}

... and put custom pairs in a separate ~/.config/ecotokens/abbreviations.json file:

hljs language-json

{
  "function": "func",
  "repository": "repo"
}

Bonus Tools

MCP server (Claude Code, Gemini CLI, Qwen Code, Codex)

ecotokens mcp-server starts a stdio MCP server backed by the ecotokens index and trace engines.

hljs language-bash

ecotokens mcp-server
ecotokens mcp-server --index-dir ~/.config/ecotokens/index

Exposed tools:

ecotokens_search - BM25 + semantic search
ecotokens_outline - symbol outline for file/directory
ecotokens_symbol - fetch full symbol source by stable ID
ecotokens_trace_callers - find callers of a symbol
ecotokens_trace_callees - find callees (with depth)
ecotokens_duplicates - detect near-duplicate code blocks

Search command

ecotokens search QUERY performs BM25 (+ optional semantic) search over the indexed codebase and returns results anchored to the matching line.

hljs language-bash

ecotokens search "embed_text"                        # top 5 results, 2 lines of context
ecotokens search "embed_text" --context 4            # 4 lines above and below the match
ecotokens search "error" --include "*.rs"            # Rust files only
ecotokens search "TODO" --exclude "*.md" --exclude "*.toml"
ecotokens search "find_callers" --no-trace           # pure BM25, no trace augmentation
ecotokens search "find_callers" --json               # JSON output with callers array
ecotokens search "query" --top-k 10                  # more results

Output format:

hljs language-rust

src/search/query.rs:29 (score: 11.068)
  27:  
  28:  pub fn search_index(opts: SearchOptions) -> tantivy::Result<Vec<SearchResult>> {
  29:      let index = Index::open_in_dir(&opts.index_dir)?;
  30:      let (_, file_path_field, content_field, kind_field, line_start_field, _) = build_schema();
  31:

When the query matches a symbol name, callers are automatically appended:

hljs language-bash

# Symbol match - call sites via trace
  src/main.rs:1301 [caller]  cmd_search

Results are automatically scoped to the current git project when using the global index - files from other indexed projects are silently filtered out.

Duplicates command

Less code is less tokens

ecotokens duplicates scans the indexed codebase for near-identical code blocks and reports them grouped by similarity.

hljs language-bash

ecotokens duplicates                          # default: threshold=70%, min_lines=5
ecotokens duplicates --threshold 80           # only report ≥ 80% similarity
ecotokens duplicates --min-lines 10           # ignore blocks shorter than 10 lines
ecotokens duplicates --json                   # JSON output

Each group shows the file paths, line ranges, similarity score, and a refactoring proposal (exact duplicate, near duplicate, or subset).

Configuration

hljs language-bash

ecotokens config           # show all settings (text)
ecotokens config --json    # show all settings (JSON)

Output includes:

hljs language-bash

hook_installed        : true
debug                 : false
debuglog              : false
default_model         : claude-sonnet-4-6
exclusions            : []
embed_provider        : candle (sentence-transformers/all-MiniLM-L6-v2)
ai_summary_enabled    : false
ai_summary_model      : llama3.2:3b (default)
ai_summary_url        : http://localhost:11434 (default)
abbreviations_enabled : false

Debug mode

Enable the global debug mode to see detailed interception logs and enable background logging for the watch command:

hljs language-bash

ecotokens config --debug true
ecotokens config --debug false

This updates the debug field in ~/.config/ecotokens/config.json.

Debug file logging

Enable structured per-hook logging to a file for deeper tracing of what ecotokens intercepts:

hljs language-bash

ecotokens config --debuglog true
ecotokens config --debuglog false

When enabled, every hook invocation appends a JSONL entry to ~/.config/ecotokens/debug.log:

hljs language-json

{"ts":"2026-05-08T12:00:00Z","uid":"a1b2c3d4","cmd":"git status","phase":"input","data":{...}}
{"ts":"2026-05-08T12:00:00Z","uid":"a1b2c3d4","cmd":"git status","phase":"output","data":{...}}

Default model for cost calculations

The model selected here determines the per-token price used in gain reports:

hljs language-bash

ecotokens config --model claude-opus-4-7    # set default model
ecotokens config --model ""                 # list available models
ecotokens config --model unknown-model      # unknown model → lists available models

The model name must be present in the built-in pricing table (or added via ~/.config/ecotokens/pricing.json). Passing an empty value or an unrecognised name prints the full list and exits.

See the full list of built-in models and prices in docs/models.md.

Override any entry or add a new model by creating ~/.config/ecotokens/pricing.json:

hljs language-json

{
  "my-custom-model": { "input_usd_per_1m": 0.50, "output_usd_per_1m": 2.00 }
}

Word abbreviations (optional)

hljs language-bash

ecotokens abbreviations enable    # transform narrative text + inject model instruction
ecotokens abbreviations list      # show the active dictionary
ecotokens abbreviations disable   # back to default

Extend or override the built-in dictionary via a separate ~/.config/ecotokens/abbreviations.json file:

hljs language-json

{
  "function": "func",
  "repository": "repo"
}

The feature flag stays in ~/.config/ecotokens/config.json:

hljs language-json

{
  "abbreviations_enabled": true
}

Supported command families

Family	Examples
`git`	`git status`, `git diff`, `git log`
`cargo`	`cargo build`, `cargo test`, `cargo clippy`
`python`	`pytest`, `ruff`, `uv run`, `poetry`, `pipx`
`javascript`	`jest`, `mocha`, `yarn`
`cpp`	`gcc`, `clang`, `make`, `cmake`
`fs`	`ls`, `find`, `tree`
`markdown`	`.md` files
`config`	`.toml`, `.json`, `.yaml`
`generic`	Everything else (truncated to 200 lines / 50 KB)
`native_read`	Claude Code `Read` tool results (PostToolUse, outline-based compression)

Note: Family detection uses the basename of the first token, so commands invoked via absolute path (/usr/bin/git), venv (.venv/bin/pytest), version managers (~/.cargo/bin/cargo), or wrappers (poetry run) are correctly matched to their family.

Embeddings

Provider

hljs language-bash

# Candle is active by default - nothing to configure
ecotokens index --path /your/project
ecotokens search "your query"

Each result includes a retrieval_source field (bm25, vector, or both) visible in JSON output.

Disable embeddings

hljs language-bash

ecotokens config --embed-provider none    # fall back to pure BM25

Workflow

hljs language-bash

# 1. Index your project (Candle embeddings computed automatically)
ecotokens index --path /your/project

# 2. Search with hybrid scoring
ecotokens search "your query"

# 3. JSON output with retrieval_source
ecotokens search "your query" --json

Model change detection

AI summarization (optional)

When enabled, large command outputs (> ~2500 tokens) are summarized by a local Ollama model instead of being truncated. Falls back to generic filtering if Ollama is unavailable or times out.

Enable via install:

hljs language-bash

ecotokens install --ai-summary-model llama3.2:3b

Or update the config file directly (~/.config/ecotokens/config.json):

hljs language-json

{
  "ai_summary_enabled": true,
  "ai_summary_model": "llama3.2:3b"
}

Ollama must be running locally. The model is called with a 3-second timeout to avoid blocking the model.

Benchmarks

Measured on a real developer workstation from 2026-03-06 to 2026-05-27 (19 928 hook executions):

Metric	Value
Tokens before filtering	94 282 087
Tokens after filtering	5 874 896
Tokens saved	88 407 191
Overall reduction	93.8 %
Commands with savings	5 735 / 19 928, or 28.8 %
Biggest family	`grep`, 55 383 168 tokens saved
Best single run	11 323 890 tokens saved from one `grep` scan

Full benchmark report

Precision Guarantees

Filtering is aggressive on noise, conservative on signal:

Short outputs are never modified - outputs under 200 lines or 50 KB pass through unchanged
Errors are always preserved - error[, FAILED, E (pytest), --- FAIL: (Go), stack traces and panic messages are never removed
Failure sections are fully kept - structured blocks (=== FAILURES ===, failures:, failure diffs) are always passed through in their entirety
Conservative fallback - if a family filter doesn't improve the output (filtered ≥ original), the original is returned as-is
Secrets are redacted before filtering - 33 patterns covering cloud keys, AI APIs, VCS tokens, payment secrets and more are detected and replaced before any content reaches the model. See docs/secret-patterns.md for the full list.
UTF-8 safe truncation - truncation always happens at character boundaries, never mid-codepoint
Head + tail preservation - when generic truncation applies, the first and last 20 lines are always kept (start context + end result)

Requirements

Rust ≥ 1.75 (stable)
One or more of: Claude Code (with hook support), Gemini CLI ≥ 0.1.0, Qwen Code, Pi ≥ 0.62.0
Internet access on first use (Candle downloads all-MiniLM-L6-v2 ~90 MB from HuggingFace Hub; cached locally after that)
Ollama (optional, for AI summarization only)

Contributing

Contributions are welcome! Please read the contributing guidelines before submitting a pull request.

License

MIT

ecotokens

ecotokens saves real AI coding context

Features highlight

How it works

Quick install

Build from source

Installation

Claude Code

Gemini CLI

Qwen Code

Pi

Hermes

Codex

All targets at once

With AI summarization

Uninstall

Commands

Shell completions

Gain dashboard

Filter command

Watch command

Auto-watch (Claude Code, Qwen Code, Pi, Hermes, Codex)

Word abbreviations

Bonus Tools

MCP server (Claude Code, Gemini CLI, Qwen Code, Codex)

Search command

Duplicates command

Configuration

Debug mode

Debug file logging

Default model for cost calculations

Word abbreviations (optional)

Supported command families

Embeddings

Provider

Disable embeddings

Workflow

Model change detection

AI summarization (optional)

Benchmarks

Precision Guarantees

Requirements

Contributing

License

Similar Packages

ecotokens

ecotokens saves real AI coding context

Features highlight

How it works

Quick install

Build from source

Installation

Claude Code

Gemini CLI

Qwen Code

Pi

Hermes

Codex

All targets at once

With AI summarization

Uninstall

Commands

Shell completions

Gain dashboard

Filter command

Watch command

Auto-watch (Claude Code, Qwen Code, Pi, Hermes, Codex)

Word abbreviations

Bonus Tools

MCP server (Claude Code, Gemini CLI, Qwen Code, Codex)

Search command

Duplicates command

Configuration

Debug mode

Debug file logging

Default model for cost calculations

Word abbreviations (optional)

Supported command families

Embeddings

Provider