Question 1

Which is better: Caveman or Gemma 4 Multimodal Fine-Tuner?

Accepted Answer

Based on our expert panel, Caveman has a stronger verdict with a 75% Ship rate. Caveman received a panel verdict of Ship and Gemma 4 Multimodal Fine-Tuner received Ship.

Question 2

Is Caveman free?

Accepted Answer

Caveman pricing: Free / Open Source

Question 3

Is Gemma 4 Multimodal Fine-Tuner free?

Accepted Answer

Gemma 4 Multimodal Fine-Tuner pricing: Open Source

Question 4

What do experts say about Caveman vs Gemma 4 Multimodal Fine-Tuner?

Accepted Answer

Caveman: Caveman is a Claude Code skill and AI editor plugin that makes language models respond in compressed, fragment-based prose — dropping articles, filler, and pleasantries while keeping full technical content intact. It offers four intensity levels from Lite (removes fluff, preserves grammar) to Ultra (telegraphic shorthand) and even a classical Chinese mode (文言文) for extreme compression. The result: roughly 65–75% fewer output tokens on average.

The plugin ships with companion utilities: caveman-commit for sub-50-char commit messages, caveman-review for one-line PR verdicts with inline annotations, and caveman-compress to shrink documentation fed into sessions by ~46%. Installation is a single command across Claude Code, Cursor, Windsurf, Codex, Copilot, and 40+ other editors via the skills ecosystem.

With 27k+ GitHub stars since its Product Hunt launch today, Caveman has struck a nerve with developers who are burning through token budgets on Claude's verbose default style. It's arguably the simplest ROI improvement you can apply to any AI-assisted coding workflow today. Gemma 4 Multimodal Fine-Tuner: Gemma 4 Multimodal Fine-Tuner is an open-source toolkit that lets developers fine-tune Google's Gemma 4 and 3n models across all three modalities — text, images, and audio — using only Apple Silicon hardware. It runs natively on PyTorch with Metal Performance Shaders (MPS), bypassing the NVIDIA requirement that has historically blocked Mac users from serious local fine-tuning work.

The toolkit handles the full training pipeline including dataset prep, LoRA adapters, and multi-modal data collation. It ships with working example notebooks, a validation suite, and clean abstractions that don't require deep familiarity with the underlying MPS stack. Apple Silicon's unified memory architecture actually helps here — large multimodal batches fit in memory that would otherwise require GPU VRAM splitting on CUDA setups.

Posted to Hacker News on April 7 as a Show HN, it pulled 109 upvotes and 165 GitHub stars within hours. The timing is sharp: Gemma 4 just dropped days ago with new multimodal capabilities, and the community immediately wanted local fine-tuning. This fills that gap faster than Google's own tooling.

Caveman vs Gemma 4 Multimodal Fine-Tuner

Caveman

Gemma 4 Multimodal Fine-Tuner

Bookmarks