freshcrate

caveman

πŸͺ¨ why use many token when few token do trick β€” Claude Code skill that cuts 65% of tokens by talking like caveman

Description

πŸͺ¨ why use many token when few token do trick β€” Claude Code skill that cuts 65% of tokens by talking like caveman

README

caveman

why use many token when few do trick

StarsLast CommitLicense Before/After β€’ Install β€’ Levels β€’ Skills β€’ Benchmarks β€’ Evals


A Claude Code skill/plugin and Codex plugin that makes agent talk like caveman β€” cutting ~75% of output tokens while keeping full technical accuracy. Now with 文言文 mode, terse commits, one-line code reviews, and a compression tool that cuts ~45% of input tokens every session.

Based on the viral observation that caveman-speak dramatically reduces LLM token usage without losing technical substance. So we made it a one-line install.

Before / After

πŸ—£οΈ Normal Claude (69 tokens)

"The reason your React component is re-rendering is likely because you're creating a new object reference on each render cycle. When you pass an inline object as a prop, React's shallow comparison sees it as a different object every time, which triggers a re-render. I'd recommend using useMemo to memoize the object."

πŸͺ¨ Caveman Claude (19 tokens)

"New object ref each render. Inline object prop = new ref = re-render. Wrap in useMemo."

πŸ—£οΈ Normal Claude

"Sure! I'd be happy to help you with that. The issue you're experiencing is most likely caused by your authentication middleware not properly validating the token expiry. Let me take a look and suggest a fix."

πŸͺ¨ Caveman Claude

"Bug in auth middleware. Token expiry check use < not <=. Fix:"

Same fix. 75% less word. Brain still big.

Pick your level of grunt:

πŸͺΆ Lite

"Your component re-renders because you create a new object reference each render. Inline object props fail shallow comparison every time. Wrap it in useMemo."

πŸͺ¨ Full

"New object ref each render. Inline object prop = new ref = re-render. Wrap in useMemo."

πŸ”₯ Ultra

"Inline obj prop β†’ new ref β†’ re-render. useMemo."

πŸ“œ 文言文

"η‰©ε‡Ίζ–°εƒη…§οΌŒθ‡΄ι‡ηΉͺ。useMemo Wrap之。"

Same answer. You pick how many word.

β”Œβ”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”
β”‚  TOKENS SAVED          β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ 75% β”‚
β”‚  TECHNICAL ACCURACY    β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ 100%β”‚
β”‚  SPEED INCREASE        β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ ~3x β”‚
β”‚  VIBES                 β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ OOG β”‚
β””β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”˜
  • Faster response β€” less token to generate = speed go brrr
  • Easier to read β€” no wall of text, just the answer
  • Same accuracy β€” all technical info kept, only fluff removed (science say so)
  • Save money β€” ~71% less output token = less cost
  • Fun β€” every code review become comedy

Install

npx skills add JuliusBrussee/caveman

npx skills supports 40+ agents β€” Claude Code, GitHub Copilot, Cursor, Windsurf, Cline, and more. To install for a specific agent:

npx skills add JuliusBrussee/caveman -a cursor
npx skills add JuliusBrussee/caveman -a github-copilot
npx skills add JuliusBrussee/caveman -a cline
npx skills add JuliusBrussee/caveman -a windsurf
npx skills add JuliusBrussee/caveman -a codex

Or with Claude Code plugin system:

claude plugin marketplace add JuliusBrussee/caveman
claude plugin install caveman@caveman

Codex:

  1. Clone repo
  2. Open Codex in repo
  3. Run /plugins
  4. Search Caveman
  5. Install plugin

Note

Windows Codex users: Clone repo β†’ VS Code β†’ Codex Settings β†’ Plugins β†’ find Caveman under local marketplace β†’ Install β†’ Reload Window. Also enable git config core.symlinks true before cloning (requires developer mode or admin).

Install once. Use in all sessions after that. One rock. That it.

Usage

Trigger with:

  • /caveman or Codex $caveman
  • "talk like caveman"
  • "caveman mode"
  • "less tokens please"

Stop with: "stop caveman" or "normal mode"

Intensity Levels

Level Trigger What it do
Lite /caveman lite Drop filler, keep grammar. Professional but no fluff
Full /caveman full Default caveman. Drop articles, fragments, full grunt
Ultra /caveman ultra Maximum compression. Telegraphic. Abbreviate everything

文言文 (Wenyan) Mode

Classical Chinese literary compression β€” same technical accuracy, but in the most token-efficient written language humans ever invented.

Level Trigger What it do
Wenyan-Lite /caveman wenyan-lite Semi-classical. Grammar intact, filler gone
Wenyan-Full /caveman wenyan Full 文言文. Maximum classical terseness
Wenyan-Ultra /caveman wenyan-ultra Extreme. Ancient scholar on a budget

Level stick until you change it or session end.

Caveman Skills

Skill What it do Trigger
caveman-commit Terse commit messages. Conventional Commits. ≀50 char subject. Why over what. /caveman-commit
caveman-review One-line PR comments: L42: πŸ”΄ bug: user null. Add guard. No throat-clearing. /caveman-review

caveman-compress

Caveman make Claude speak with fewer tokens. Compress make Claude read fewer tokens.

Your CLAUDE.md loads on every session start. Caveman Compress rewrites memory files into caveman-speak so Claude reads less β€” without you losing the human-readable original.

/caveman:compress CLAUDE.md
CLAUDE.md          ← compressed (Claude reads this every session β€” fewer tokens)
CLAUDE.original.md ← human-readable backup (you read and edit this)
File Original Compressed Saved
claude-md-preferences.md 706 285 59.6%
project-notes.md 1145 535 53.3%
claude-md-project.md 1122 687 38.8%
todo-list.md 627 388 38.1%
mixed-with-code.md 888 574 35.4%
Average 898 494 45%

Code blocks, URLs, file paths, commands, headings, dates, version numbers β€” anything technical passes through untouched. Only prose gets compressed. See the full caveman-compress README for details. Security note: Snyk flags this as High Risk due to subprocess/file patterns β€” it's a false positive.

Benchmarks

Real token counts from the Claude API (reproduce it yourself):

Task Normal (tokens) Caveman (tokens) Saved
Explain React re-render bug 1180 159 87%
Fix auth middleware token expiry 704 121 83%
Set up PostgreSQL connection pool 2347 380 84%
Explain git rebase vs merge 702 292 58%
Refactor callback to async/await 387 301 22%
Architecture: microservices vs monolith 446 310 30%
Review PR for security issues 678 398 41%
Docker multi-stage build 1042 290 72%
Debug PostgreSQL race condition 1200 232 81%
Implement React error boundary 3454 456 87%
Average 1214 294 65%

Range: 22%–87% savings across prompts.

Important

Caveman only affects output tokens β€” thinking/reasoning tokens are untouched. Caveman no make brain smaller. Caveman make mouth smaller. Biggest win is readability and speed, cost savings are a bonus.

A March 2026 paper "Brevity Constraints Reverse Performance Hierarchies in Language Models" found that constraining large models to brief responses improved accuracy by 26 percentage points on certain benchmarks and completely reversed performance hierarchies. Verbose not always better. Sometimes less word = more correct.

Evals

Caveman not just claim 75%. Caveman prove it.

The evals/ directory has a three-arm eval harness that measures real token compression against a proper control β€” not just "verbose vs skill" but "terse vs skill". Because comparing caveman to verbose Claude conflate the skill with generic terseness. That cheating. Caveman not cheat.

# Run the eval (needs claude CLI)
uv run python evals/llm_run.py

# Read results (no API key, runs offline)
uv run --with tiktoken python evals/measure.py

Snapshots committed to git. CI runs free. Every number change reviewable as diff. Add a skill, add a prompt β€” harness pick it up automatically.

Star This Repo

If caveman save you mass token, mass money β€” leave mass star. ⭐

Star History Chart

Also by Julius Brussee

  • Blueprint β€” specification-driven development for Claude Code. Natural language β†’ blueprints β†’ parallel builds β†’ working software.
  • Revu β€” local-first macOS study app with FSRS spaced repetition, decks, exams, and study guides. revu.cards

License

MIT β€” free like mass mammoth on open plain.

Release History

VersionChangesUrgencyDate
v1.6.0## v1.6.0 β€” Hardening release 11 community PRs merged plus a hardened security model for the flag file. This release fixes two real crash bugs that were silently breaking installs, two local-file-clobber vulnerabilities, and several portability gaps. ### Critical fixes - **Hooks no longer crash when an ancestor `package.json` declares `"type": "module"`.** Before this fix, any user with `~/.claude/package.json` set to ESM (common with several Claude Code plugins) hit `ReferenceError: require High4/15/2026
v1.5.1## Runtime SKILL.md loading Activation hook now reads `skills/caveman/SKILL.md` at runtime instead of hardcoding rules inline. Edits to the source of truth propagate automatically β€” no duplication to go stale. - Plugin installs resolve SKILL.md relative to the plugin root - Standalone installs (hooks only, no skills dir) fall back to a built-in minimal ruleset - `commit`/`review`/`compress` modes skip SKILL.md machinery entirely β€” they have their own independent skill files ## Docs - **ConsoHigh4/11/2026
v1.5.0## Configurable default mode Default mode is now configurable instead of always starting at `full`. Resolution order: 1. `CAVEMAN_DEFAULT_MODE` environment variable (highest priority) 2. Config file at `~/.config/caveman/config.json` (XDG-compliant, cross-platform) 3. `'full'` (unchanged default β€” fully backward compatible) ```bash export CAVEMAN_DEFAULT_MODE=ultra ``` ```json { "defaultMode": "lite" } ``` All install/uninstall scripts (bash + PowerShell) updated. Invalid modes silently falHigh4/11/2026
v1.3.0## What's New ### πŸ“œ 文言文 (Wenyan) Mode Classical Chinese literary compression β€” same technical accuracy, different era, fewer tokens. Three levels: wenyan-lite, wenyan-full, wenyan-ultra. ``` English: "Your component re-renders because you create a new object reference each render." Caveman: "New object ref each render. Wrap in useMemo." Wenyan: "η‰©ε‡Ίζ–°εƒη…§οΌŒθ‡΄ι‡ηΉͺ。useMemo Wrap之。" ``` ### πŸ› οΈ New Skills - **caveman-commit** β€” terse commit messages in Conventional Commits format. `/caveman-commit` High4/8/2026

Dependencies & License Audit

Loading dependencies...

Similar Packages

structured-prompt-skill✍️ Write effective AI prompts with this structured prompt engineering library and Claude Code skill, featuring 300+ curated examples for high-quality results.main@2026-04-21
brand-consistency-ai-skillEnsure marketing content stays on-brand by using AI to generate, audit, and validate materials across platforms with any large language model.main@2026-04-21
system_prompts_leaksExtracted system prompts from ChatGPT (GPT-5.4, GPT-5.3, Codex), Claude (Opus 4.6, Sonnet 4.6, Claude Code), Gemini (3.1 Pro, 3 Flash, CLI), Grok (4.2, 4), Perplexity, and more. Updated regularly.main@2026-04-20
agentroveYour own Claude Code UI, sandbox, in-browser VS Code, terminal, multi-provider support (Anthropic, OpenAI, GitHub Copilot, OpenRouter), custom skills, and MCP servers.v0.1.30
hermes-agentThe agent that grows with youv2026.4.16