caveman

why use many token when few do trick

Before/After • Install • Levels • Skills • Benchmarks • Evals

🪨 Caveman Ecosystem · caveman talk less _{(you are here)} · cavemem remember more · cavekit build better

A Claude Code skill/plugin and Codex plugin that makes agent talk like caveman — cutting ~75% of output tokens while keeping full technical accuracy. Now with 文言文 mode, terse commits, one-line code reviews, and a compression tool that cuts ~46% of input tokens every session.

Based on the viral observation that caveman-speak dramatically reduces LLM token usage without losing technical substance. So we made it a one-line install.

Before / After

🗣️ Normal Claude (69 tokens) "The reason your React component is re-rendering is likely because you're creating a new object reference on each render cycle. When you pass an inline object as a prop, React's shallow comparison sees it as a different object every time, which triggers a re-render. I'd recommend using useMemo to memoize the object."	🪨 Caveman Claude (19 tokens) "New object ref each render. Inline object prop = new ref = re-render. Wrap in `useMemo`."
🗣️ Normal Claude "Sure! I'd be happy to help you with that. The issue you're experiencing is most likely caused by your authentication middleware not properly validating the token expiry. Let me take a look and suggest a fix."	🪨 Caveman Claude "Bug in auth middleware. Token expiry check use `<` not `<=`. Fix:"

Same fix. 75% less word. Brain still big.

Pick your level of grunt:

🪶 Lite

"Your component re-renders because you create a new object reference each render. Inline object props fail shallow comparison every time. Wrap it in useMemo."

🪨 Full

"New object ref each render. Inline object prop = new ref = re-render. Wrap in useMemo."

🔥 Ultra

"Inline obj prop → new ref → re-render. useMemo."

📜 文言文

"物出新參照，致重繪。useMemo Wrap之。"

Same answer. You pick how many word.

┌─────────────────────────────────────┐
│  TOKENS SAVED          ████████ 75% │
│  TECHNICAL ACCURACY    ████████ 100%│
│  SPEED INCREASE        ████████ ~3x │
│  VIBES                 ████████ OOG │
└─────────────────────────────────────┘

Faster response — less token to generate = speed go brrr
Easier to read — no wall of text, just the answer
Same accuracy — all technical info kept, only fluff removed (science say so)
Save money — ~71% less output token = less cost
Fun — every code review become comedy

Install

Pick your agent. One command. Done.

Agent	Install
Claude Code	`claude plugin marketplace add JuliusBrussee/caveman && claude plugin install caveman@caveman`
Codex	Clone repo → `/plugins` → Search "Caveman" → Install
Gemini CLI	`gemini extensions install https://github.com/JuliusBrussee/caveman`
Cursor	`npx skills add JuliusBrussee/caveman -a cursor`
Windsurf	`npx skills add JuliusBrussee/caveman -a windsurf`
Copilot	`npx skills add JuliusBrussee/caveman -a github-copilot`
Cline	`npx skills add JuliusBrussee/caveman -a cline`
Any other	`npx skills add JuliusBrussee/caveman`

Install once. Use in every session for that install target after that. One rock. That it.

What You Get

Auto-activation is built in for Claude Code, Gemini CLI, and the repo-local Codex setup below. npx skills add installs the skill for other agents, but does not install repo rule/instruction files, so Caveman does not auto-start there unless you add the always-on snippet below.

Feature	Claude Code	Codex	Gemini CLI	Cursor	Windsurf	Cline	Copilot
Caveman mode	Y	Y	Y	Y	Y	Y	Y
Auto-activate every session	Y	Y¹	Y	—²	—²	—²	—²
`/caveman` command	Y	Y¹	Y	—	—	—	—
Mode switching (lite/full/ultra)	Y	Y¹	Y	Y³	Y³	—	—
Statusline badge	Y⁴	—	—	—	—	—	—
caveman-commit	Y	—	Y	Y	Y	Y	Y
caveman-review	Y	—	Y	Y	Y	Y	Y
caveman-compress	Y	Y	Y	Y	Y	Y	Y
caveman-help	Y	—	Y	Y	Y	Y	Y

Note

Auto-activation works differently per agent: Claude Code uses SessionStart hooks, this repo's Codex dogfood setup uses .codex/hooks.json, Gemini uses context files. Cursor/Windsurf/Cline/Copilot can be made always-on, but npx skills add installs only the skill, not the repo rule/instruction files.

¹ Codex uses $caveman syntax, not /caveman. This repo ships .codex/hooks.json, so caveman auto-starts when you run Codex inside this repo. The installed plugin itself gives you $caveman; copy the same hook into another repo if you want always-on behavior there too. caveman-commit and caveman-review are not in the Codex plugin bundle — use the SKILL.md files directly. ² Add the "Want it always on?" snippet below to those agents' system prompt or rule file if you want session-start activation. ³ Cursor and Windsurf receive the full SKILL.md with all intensity levels. Mode switching works on-demand via the skill; no slash command. ⁴ Available in Claude Code, but plugin install only nudges setup. Standalone install.sh / install.ps1 configures it automatically when no custom statusLine exists.

Claude Code — full details

The plugin install gives you skills + auto-loading hooks. If no custom statusLine is configured, Caveman nudges Claude to offer badge setup on first session.

claude plugin marketplace add JuliusBrussee/caveman
claude plugin install caveman@caveman

Standalone hooks (without plugin): If you prefer not to use the plugin system:

# macOS / Linux / WSL
bash <(curl -s https://raw.githubusercontent.com/JuliusBrussee/caveman/main/hooks/install.sh)

# Windows (PowerShell)
irm https://raw.githubusercontent.com/JuliusBrussee/caveman/main/hooks/install.ps1 | iex

Or from a local clone: bash hooks/install.sh / powershell -File hooks\install.ps1

Uninstall: bash hooks/uninstall.sh or powershell -File hooks\uninstall.ps1

Statusline badge: Shows [CAVEMAN], [CAVEMAN:ULTRA], etc. in your Claude Code status bar.

Plugin install: If you do not already have a custom statusLine, Claude should offer to configure it on first session
Standalone install: Configured automatically by install.sh / install.ps1 unless you already have a custom statusline
Custom statusline: Installer leaves your existing statusline alone. See hooks/README.md for the merge snippet

Codex — full details

macOS / Linux:

Clone repo → Open Codex in the repo directory → /plugins → Search "Caveman" → Install
Repo-local auto-start is already wired by .codex/hooks.json + .codex/config.toml

Windows:

Enable symlinks first: git config --global core.symlinks true (requires Developer Mode or admin)
Clone repo → Open VS Code → Codex Settings → Plugins → find "Caveman" under local marketplace → Install → Reload Window
Codex hooks are currently disabled on Windows, so use $caveman to start manually

This repo also ships .codex/hooks.json and enables hooks in .codex/config.toml, so caveman auto-activates while you run Codex inside this repo on macOS/Linux. The installed plugin gives you $caveman; if you want always-on behavior in other repos too, copy the same SessionStart hook there and enable:

[features]
codex_hooks = true

Gemini CLI — full details

gemini extensions install https://github.com/JuliusBrussee/caveman

Update: gemini extensions update caveman · Uninstall: gemini extensions uninstall caveman

Auto-activates via GEMINI.md context file. Also ships custom Gemini commands:

/caveman — switch intensity level (lite/full/ultra/wenyan)
/caveman-commit — generate terse commit message
/caveman-review — one-line code review

Cursor / Windsurf / Cline / Copilot — full details

npx skills add installs the skill file only — it does not install the agent's rule/instruction file, so caveman does not auto-start. For always-on, add the "Want it always on?" snippet below to your agent's rules or system prompt.

Agent	Command	Not installed	Mode switching	Always-on location
Cursor	`npx skills add JuliusBrussee/caveman -a cursor`	`.cursor/rules/caveman.mdc`	Y	Cursor rules
Windsurf	`npx skills add JuliusBrussee/caveman -a windsurf`	`.windsurf/rules/caveman.md`	Y	Windsurf rules
Cline	`npx skills add JuliusBrussee/caveman -a cline`	`.clinerules/caveman.md`	—	Cline rules or system prompt
Copilot	`npx skills add JuliusBrussee/caveman -a github-copilot`	`.github/copilot-instructions.md` + `AGENTS.md`	—	Copilot custom instructions

Uninstall: npx skills remove caveman

Copilot works with Chat, Edits, and Coding Agent.

Any other agent (opencode, Roo, Amp, Goose, Kiro, and 40+ more)

npx skills supports 40+ agents:

npx skills add JuliusBrussee/caveman           # auto-detect agent
npx skills add JuliusBrussee/caveman -a amp
npx skills add JuliusBrussee/caveman -a augment
npx skills add JuliusBrussee/caveman -a goose
npx skills add JuliusBrussee/caveman -a kiro-cli
npx skills add JuliusBrussee/caveman -a roo
# ... and many more

Uninstall: npx skills remove caveman

Windows note: npx skills uses symlinks by default. If symlinks fail, add --copy: npx skills add JuliusBrussee/caveman --copy

Important: These agents don't have a hook system, so caveman won't auto-start. Say /caveman or "talk like caveman" to activate each session.

Want it always on? Paste this into your agent's system prompt or rules file — caveman will be active from the first message, every session:

Terse like caveman. Technical substance exact. Only fluff die.
Drop: articles, filler (just/really/basically), pleasantries, hedging.
Fragments OK. Short synonyms. Code unchanged.
Pattern: [thing] [action] [reason]. [next step].
ACTIVE EVERY RESPONSE. No revert after many turns. No filler drift.
Code/commits/PRs: normal. Off: "stop caveman" / "normal mode".

Where to put it:

Agent	File
opencode	`.config/opencode/AGENTS.md`
Roo	`.roo/rules/caveman.md`
Amp	your workspace system prompt
Others	your agent's system prompt or rules file

Usage

Trigger with:

/caveman or Codex $caveman
"talk like caveman"
"caveman mode"
"less tokens please"

Stop with: "stop caveman" or "normal mode"

Intensity Levels

Level	Trigger	What it do
Lite	`/caveman lite`	Drop filler, keep grammar. Professional but no fluff
Full	`/caveman full`	Default caveman. Drop articles, fragments, full grunt
Ultra	`/caveman ultra`	Maximum compression. Telegraphic. Abbreviate everything

文言文 (Wenyan) Mode

Classical Chinese literary compression — same technical accuracy, but in the most token-efficient written language humans ever invented.

Level	Trigger	What it do
Wenyan-Lite	`/caveman wenyan-lite`	Semi-classical. Grammar intact, filler gone
Wenyan-Full	`/caveman wenyan`	Full 文言文. Maximum classical terseness
Wenyan-Ultra	`/caveman wenyan-ultra`	Extreme. Ancient scholar on a budget

Level stick until you change it or session end.

Caveman Skills

caveman-commit

/caveman-commit — terse commit messages. Conventional Commits. ≤50 char subject. Why over what.

caveman-review

/caveman-review — one-line PR comments: L42: 🔴 bug: user null. Add guard. No throat-clearing.

caveman-help

/caveman-help — quick-reference card. All modes, skills, commands, one command away.

caveman-compress

/caveman:compress <filepath> — caveman make Claude speak with fewer tokens. Compress make Claude read fewer tokens.

Your CLAUDE.md loads on every session start. Caveman Compress rewrites memory files into caveman-speak so Claude reads less — without you losing the human-readable original.

/caveman:compress CLAUDE.md

CLAUDE.md          ← compressed (Claude reads this every session — fewer tokens)
CLAUDE.original.md ← human-readable backup (you read and edit this)

File	Original	Compressed	Saved
`claude-md-preferences.md`	706	285	59.6%
`project-notes.md`	1145	535	53.3%
`claude-md-project.md`	1122	636	43.3%
`todo-list.md`	627	388	38.1%
`mixed-with-code.md`	888	560	36.9%
Average	898	481	46%

Code blocks, URLs, file paths, commands, headings, dates, version numbers — anything technical passes through untouched. Only prose gets compressed. See the full caveman-compress README for details. Security note: Snyk flags this as High Risk due to subprocess/file patterns — it's a false positive.

Benchmarks

Real token counts from the Claude API (reproduce it yourself):

Task	Normal (tokens)	Caveman (tokens)	Saved
Explain React re-render bug	1180	159	87%
Fix auth middleware token expiry	704	121	83%
Set up PostgreSQL connection pool	2347	380	84%
Explain git rebase vs merge	702	292	58%
Refactor callback to async/await	387	301	22%
Architecture: microservices vs monolith	446	310	30%
Review PR for security issues	678	398	41%
Docker multi-stage build	1042	290	72%
Debug PostgreSQL race condition	1200	232	81%
Implement React error boundary	3454	456	87%
Average	1214	294	65%

Range: 22%–87% savings across prompts.

Important

Caveman only affects output tokens — thinking/reasoning tokens are untouched. Caveman no make brain smaller. Caveman make mouth smaller. Biggest win is readability and speed, cost savings are a bonus.

A March 2026 paper "Brevity Constraints Reverse Performance Hierarchies in Language Models" found that constraining large models to brief responses improved accuracy by 26 percentage points on certain benchmarks and completely reversed performance hierarchies. Verbose not always better. Sometimes less word = more correct.

Evals

Caveman not just claim 75%. Caveman prove it.

The evals/ directory has a three-arm eval harness that measures real token compression against a proper control — not just "verbose vs skill" but "terse vs skill". Because comparing caveman to verbose Claude conflate the skill with generic terseness. That cheating. Caveman not cheat.

# Run the eval (needs claude CLI)
uv run python evals/llm_run.py

# Read results (no API key, runs offline)
uv run --with tiktoken python evals/measure.py

Star This Repo

If caveman save you mass token, mass money — leave mass star. ⭐

🪨 The Caveman Ecosystem

Three tools. One philosophy: agent do more with less.

Repo	What	One-liner
caveman (you are here)	Output compression skill	why use many token when few do trick — ~75% fewer output tokens across Claude Code, Cursor, Gemini, Codex
cavemem	Cross-agent persistent memory	why agent forget when agent can remember — compressed SQLite + MCP, local by default
cavekit	Spec-driven autonomous build loop	why agent guess when agent can know — natural language → kits → parallel build → verified

They compose: cavekit orchestrates the build, caveman compresses what the agent says, cavemem compresses what the agent remembers. Install one, some, or all — each stands alone.

Also by Julius Brussee

Revu — local-first macOS study app with FSRS spaced repetition, decks, exams, and study guides. revu.cards

License

MIT — free like mass mammoth on open plain.

Version	Changes	Urgency	Date
v1.8.2	## Fixes - `fix(install)`: curl\|bash one-liner now actually installs skills (#370) Upstream `skills` CLI rendered its interactive skill-picker TUI under `curl\|bash` (stdin not a TTY), got no selection, exited 0, and installed nothing — while our installer cheerfully reported `done`. Now passes `--yes --all` to both the per-provider call and the auto-detect fallback so every detected agent gets all 7 caveman skills, no prompt required. - **`fix(gemini)`: strip stray YAML fences fr	High	5/12/2026
v1.8.1	## Hotfix for v1.8.0 — curl\|bash one-liner The headline install command in v1.8.0 was broken at first contact: ``` $ curl -fsSL https://raw.githubusercontent.com/JuliusBrussee/caveman/main/install.sh \| bash -s -- --only openclaw bash: line 39: BASH_SOURCE[0]: unbound variable error: unknown flag: -- ``` Two regressions — neither caught by CI because both are stdin-pipe-only. ### What broke 1. `set -u` + `${BASH_SOURCE[0]}` is fatal under curl\|bash. When bash reads from stdin, `BASH_SOUR	High	5/10/2026
v1.7.0	## v1.7.0 — Stats receipts, smart installer, cavecrew, MCP-shrink The biggest release since v1.0. Caveman now has measurement (real token receipts, not vibes), an installer that detects 30+ agents and runs each one's native install, three caveman-mode subagents that emit ~60% fewer handoff tokens than vanilla, an MCP middleware that shrinks tool descriptions in flight, and a one-command rule-file dropper for any repo. Plus a critical macOS installer fix that was silently breaking detection for	High	5/1/2026
v1.6.0	## v1.6.0 — Hardening release 11 community PRs merged plus a hardened security model for the flag file. This release fixes two real crash bugs that were silently breaking installs, two local-file-clobber vulnerabilities, and several portability gaps. ### Critical fixes - Hooks no longer crash when an ancestor `package.json` declares `"type": "module"`. Before this fix, any user with `~/.claude/package.json` set to ESM (common with several Claude Code plugins) hit `ReferenceError: require	High	4/15/2026
v1.5.1	## Runtime SKILL.md loading Activation hook now reads `skills/caveman/SKILL.md` at runtime instead of hardcoding rules inline. Edits to the source of truth propagate automatically — no duplication to go stale. - Plugin installs resolve SKILL.md relative to the plugin root - Standalone installs (hooks only, no skills dir) fall back to a built-in minimal ruleset - `commit`/`review`/`compress` modes skip SKILL.md machinery entirely — they have their own independent skill files ## Docs - **Conso	High	4/11/2026
v1.5.0	## Configurable default mode Default mode is now configurable instead of always starting at `full`. Resolution order: 1. `CAVEMAN_DEFAULT_MODE` environment variable (highest priority) 2. Config file at `~/.config/caveman/config.json` (XDG-compliant, cross-platform) 3. `'full'` (unchanged default — fully backward compatible) ```bash export CAVEMAN_DEFAULT_MODE=ultra ``` ```json { "defaultMode": "lite" } ``` All install/uninstall scripts (bash + PowerShell) updated. Invalid modes silently fal	High	4/11/2026
v1.4.1	## Fix: Codex plugin compress skill broken on Windows The `compress` skill in the Codex plugin shipped as symlinks (`plugins/caveman/skills/compress/SKILL.md` and `scripts`). On Windows and any git setup with `core.symlinks=false`, these checked out as plain text files containing the target path — causing Codex to reject the skill with "missing YAML frontmatter." ### Changes - Replaced symlinks with real file copies (scripts identical to source, SKILL.md adapted for plugin context) - Added CI	Medium	4/11/2026
v1.4.0	## Highlights - Add Claude Code statusline badge support, including shell and PowerShell badge scripts. - Add standalone Claude hook install/uninstall flows for macOS/Linux and Windows, with safer settings merge behavior and better reinstall checks. - Add always-on rule/instruction files plus sync workflow for Cursor, Windsurf, Cline, Copilot, Codex, Gemini, and repo agent docs. - Add local verification coverage for hook install/uninstall flows, synced artifacts, manifests, syntax, and caveman-	Medium	4/11/2026
v1.3.5	## What's new - Plugin-bundled hooks — SessionStart and UserPromptSubmit hooks now ship in `plugin.json`. Install as a plugin and they auto-activate — no `install.sh` needed. - Mode-aware statusline badge — flag file at `~/.claude/.caveman-active` now stores the active mode (`full`, `lite`, `ultra`, `wenyan`, `commit`, `review`). Statusline scripts can show `[CAVEMAN:ULTRA]` etc. - UserPromptSubmit mode tracker — detects `/caveman ultra`, `/caveman-commit`, etc. and updates the flag	Medium	4/9/2026
v1.3.0	## What's New ### 📜 文言文 (Wenyan) Mode Classical Chinese literary compression — same technical accuracy, different era, fewer tokens. Three levels: wenyan-lite, wenyan-full, wenyan-ultra. ``` English: "Your component re-renders because you create a new object reference each render." Caveman: "New object ref each render. Wrap in useMemo." Wenyan: "物出新參照，致重繪。useMemo Wrap之。" ``` ### 🛠️ New Skills - caveman-commit — terse commit messages in Conventional Commits format. `/caveman-commit`	High	4/8/2026
v1.2.0	## What's new - Intensity levels — choose `lite`, `full` (default), or `ultra` caveman compression. Fine-grained control over how aggressive the token savings are. - Auto-Clarity — caveman mode automatically drops compression for security warnings and destructive operations. Safety-critical output stays crystal clear. - caveman-compress skill — new standalone skill that compresses natural language memory files (CLAUDE.md, todos, preferences) into caveman format. Preserves all techni	Medium	4/6/2026
v1.1.0	## What new - Reproducible benchmark system — `benchmarks/run.py` call Claude API, measure real output token counts normal vs caveman, auto-update README table. No more fake numbers. - Real benchmark data — 10 coding prompts, actual API measurements. Average 65% token savings (range 22%–87%). - Codex plugin support — caveman now work in OpenAI Codex too. - Contributing guide + issue templates for bug reports and feature requests. ## Run benchmarks yourself ```bash cd bench	Medium	4/5/2026
v1.0.0	## 🪨 v1.0.0 — why use many token when few token do trick First release. Caveman here. Save token. Save money. Brain still big. ### What caveman do Make Claude talk like smart caveman. Cut ~75% tokens. Keep all technical substance. Code stay normal. Only English get caveman treatment. ### Feature - ~75% less token — less cost, fast response, same accuracy - One rock install — `npx skills add JuliusBrussee/caveman` - Easy on — `/caveman`, "caveman mode", "less tokens" - **Easy of	Medium	4/4/2026

caveman

Description

README

caveman

Before / After

🗣️ Normal Claude (69 tokens)

🪨 Caveman Claude (19 tokens)

🗣️ Normal Claude

🪨 Caveman Claude

🪶 Lite

🪨 Full

🔥 Ultra

📜 文言文

Install

What You Get

Usage

Intensity Levels

文言文 (Wenyan) Mode

Caveman Skills

caveman-commit

caveman-review

caveman-help

caveman-compress

Benchmarks

Evals

Star This Repo

🪨 The Caveman Ecosystem

Also by Julius Brussee

License

Release History

Dependencies & License Audit

Similar Packages

More in Prompt Engineering