Question 1

Which is better: Caveman or Gemini 2.5 Flash (Stable) with Thinking Mode?

Accepted Answer

Based on our expert panel, Gemini 2.5 Flash (Stable) with Thinking Mode has a stronger verdict with a 100% Ship rate. Caveman received a panel verdict of Mixed and Gemini 2.5 Flash (Stable) with Thinking Mode received Ship.

Question 2

Is Caveman free?

Accepted Answer

Caveman pricing: Free / Open Source

Question 3

Is Gemini 2.5 Flash (Stable) with Thinking Mode free?

Accepted Answer

Gemini 2.5 Flash (Stable) with Thinking Mode pricing: Free tier (Google AI Studio) / Pay-as-you-go via Gemini API: ~$0.15/1M input tokens (non-thinking), ~$3.50/1M input tokens (thinking mode)

Question 4

What do experts say about Caveman vs Gemini 2.5 Flash (Stable) with Thinking Mode?

Accepted Answer

Caveman: Caveman is a one-line installable Claude Code skill by Julius Brussee that instructs Claude to respond in ultra-compressed telegraphic language — short imperative verbs, no filler words, minimal articles — while preserving technical accuracy. The conceit is absurd: make Claude sound like a caveman. The result is practical: roughly 75% fewer output tokens per response.

This matters because Claude's usage limits are token-based. Power users and teams hitting rate limits on Claude Code subscriptions have found that caveman-style output dramatically extends how many interactions they can run per session. The Hacker News thread hit 333 points the day it launched, with developers sharing variations and reporting measurable drops in token consumption for coding workflows.

The project also spawned a fork (Caveman-Claude by om-patel5) that packages it as a higher-performance optimization layer with additional context-compression techniques. What started as a joke about caveman grammar is becoming a serious prompt-engineering pattern for token efficiency. Gemini 2.5 Flash (Stable) with Thinking Mode: Google DeepMind has promoted Gemini 2.5 Flash to stable status, making its 'thinking mode' generally available via the Gemini API and Google AI Studio. The model delivers chain-of-thought reasoning at significantly lower latency and cost than Gemini 2.5 Pro, making it a practical choice for production reasoning workloads. Thinking mode can be toggled on or off per request, giving developers granular control over the cost-quality tradeoff.

Caveman vs Gemini 2.5 Flash (Stable) with Thinking Mode

Caveman

Gemini 2.5 Flash (Stable) with Thinking Mode

Bookmarks