G

Gemma 4

Google's sharpest open models — multimodal, 256K context, runs on a Raspberry Pi

PriceFree / Open Source (Apache 2.0)Reviewed2026-04-18
Verdict — Ship
3 Ships1 Skips
Visit deepmind.google

The Panel's Take

Gemma 4 is Google DeepMind's fourth-generation open model family, released April 2, 2026, under Apache 2.0. Four variants ship in the family: E2B and E4B edge models that run fully offline on phones, Raspberry Pi, and NVIDIA Jetson; a 26B Mixture-of-Experts model that activates only 3.8B parameters at inference; and a 31B Dense flagship. The 31B scores 1452 on the Arena AI text leaderboard (third among all open models), hits 89.2% on AIME 2026 math, and 85.2% on MMLU Pro — versus Gemma 3's 20.8% on AIME. All four model sizes accept text and image inputs. The edge models additionally handle native audio and video, making them the first on-device models with full multimodal coverage. Context windows reach 256K tokens on the large variants, enabling entire codebases or long documents in a single prompt. Native support for tool use, structured output, and agentic workflows is baked in from the start. For the open-source AI community, Gemma 4 is a watershed: a commercially permissive model that genuinely competes with closed-source alternatives on reasoning benchmarks. Gemma downloads crossed 400 million before this launch — Gemma 4's edge deployment story, combining on-device inference with frontier-class reasoning, looks set to make that number look small.

Share this verdict

Gemma 4 verdict: SHIP 🚀

3 ships · 1 skip from the expert panel

Full review: shiporskip.io/tool/gemma-4-google-deepmind-open-multimodal-256k-apache-2026

Weekly AI Tool Verdicts

Get the next verdict in your inbox

7 critics review a new AI tool every day. Weekly digest — free.

Embed this verdict

Tool makers can add a live ShipOrSkip badge to their site. Badge loads track impressions; clicks route back to this review.

Ship · 7.5/10
HTML badge
<a href="https://shiporskip.io/api/badge-click/gemma-4-google-deepmind-open-multimodal-256k-apache-2026" target="_blank" rel="noopener"><img src="https://shiporskip.io/api/badge/gemma-4-google-deepmind-open-multimodal-256k-apache-2026" alt="Gemma 4 Ship verdict on ShipOrSkip" width="360" height="90" /></a>
Markdown badge
[![Gemma 4 Ship verdict on ShipOrSkip](https://shiporskip.io/api/badge/gemma-4-google-deepmind-open-multimodal-256k-apache-2026)](https://shiporskip.io/api/badge-click/gemma-4-google-deepmind-open-multimodal-256k-apache-2026)
Iframe widget
<iframe src="https://shiporskip.io/embed/gemma-4-google-deepmind-open-multimodal-256k-apache-2026" title="Gemma 4 ShipOrSkip verdict" width="360" height="260" style="border:0;border-radius:16px;max-width:100%;" loading="lazy"></iframe>

The reviews

Apache 2.0, runs on a Pi, 256K context, beats proprietary models on AIME — this is the open-source AI stack I've been waiting for. The agentic workflow support baked in natively means I'm not bolting on separate tooling. Shipping today.

Helpful?

The benchmark numbers are impressive on paper, but Gemma 3 was also hyped and underdelivered in production on complex multi-step tasks. The edge models are still unproven outside of Google's own hardware partnerships. Watch the community benchmarks before committing to a migration.

Helpful?

On-device frontier-class intelligence with native audio and video is the inflection point for ambient AI. When a $35 Raspberry Pi can run a model that beats last year's GPT-4 on math, the entire economics of edge AI applications change overnight. This is the model that makes AI infrastructure costs asymptotically cheap.

Helpful?

The document and PDF parsing, OCR, chart comprehension, and UI understanding built into every model size is huge for creative workflow automation. I can finally build tools that read design briefs, invoices, and mockups without needing a cloud API call. The offline capability means client data never leaves my machine.

Helpful?

Bookmarks

Loading bookmarks...

No bookmarks yet

Bookmark tools to save them for later