Google Gemma 4
Google's open multimodal models — vision, audio, and text under Apache 2.0
The Panel's Take
Google Gemma 4 is the most capable open model family Google has released, and the first to unify text, vision, and audio in a single architecture — all under the Apache 2.0 license. Available in four sizes (E2B, E4B, 26B MoE, 31B Dense), the lineup runs everywhere from smartphones to high-end GPUs and covers 140+ languages with context windows up to 256K. The headline stat: the 31B Dense model benchmarks above models nearly 20x its size in certain evals, making it the sharpest intelligence-per-parameter model in the open-source ecosystem as of its April 2026 release. The multimodal architecture processes documents with OCR, analyzes charts, transcribes speech, and understands video frames from a single model — no pipeline stitching required. For developers and researchers, the Apache 2.0 licensing is the real unlock. Gemma 4 is fully OSI-approved and commercially usable without restriction, building on a community of 400M+ downloads from prior Gemma versions and 100,000+ variants in the wild.
Share this verdict
Google Gemma 4 verdict: SHIP 🚀 3 ships · 1 skip from the expert panel Full review: shiporskip.io/tool/google-gemma-4-multimodal-apache-open-source-31b-agentic-2026
Weekly AI Tool Verdicts
Get the next verdict in your inbox
7 critics review a new AI tool every day. Weekly digest — free.
Compare Google Gemma 4 with Others
Embed this verdict
Tool makers can add a live ShipOrSkip badge to their site. Badge loads track impressions; clicks route back to this review.
<a href="https://shiporskip.io/api/badge-click/google-gemma-4-multimodal-apache-open-source-31b-agentic-2026" target="_blank" rel="noopener"><img src="https://shiporskip.io/api/badge/google-gemma-4-multimodal-apache-open-source-31b-agentic-2026" alt="Google Gemma 4 Ship verdict on ShipOrSkip" width="360" height="90" /></a>[](https://shiporskip.io/api/badge-click/google-gemma-4-multimodal-apache-open-source-31b-agentic-2026)<iframe src="https://shiporskip.io/embed/google-gemma-4-multimodal-apache-open-source-31b-agentic-2026" title="Google Gemma 4 ShipOrSkip verdict" width="360" height="260" style="border:0;border-radius:16px;max-width:100%;" loading="lazy"></iframe>The reviews
“Apache 2.0 on a model that beats GPT-class performance at 31B? Ship it immediately. The MoE 26B variant is already running under 16GB VRAM for me with llama.cpp quantization. The unified multimodal arch saves a ton of pipeline complexity.”
“Google's benchmark marketing is getting harder to trust — 'beats 600B rivals' is cherry-picked. The audio modality is notably weaker than Gemini 3.1, and fine-tuning the MoE variant requires infrastructure most teams don't have. Real-world performance lags the headline numbers.”
“The 100,000-variant Gemmaverse is a real ecosystem flywheel. Every new Gemma release compresses capability curves downward — things that required cloud APIs last year now run on-device. Gemma 4's audio addition makes it the first truly comprehensive local AI.”
“A single model that can read my documents, analyze charts, transcribe my audio notes, and generate code is genuinely transformative for creative production. The Apache license means I can embed it in client deliverables without legal headaches.”