Reviews/AI MODELS/DeepSeek V4-Pro
D

DeepSeek V4-Pro

1.6T-param MoE model, 1M context, Nvidia-free — just dropped Apache 2.0

PriceOpen Source (Apache 2.0) / ~$0.30/MTok APIReviewed2026-04-24
Verdict — Ship
3 Ships1 Skips
Visit huggingface.co

The Panel's Take

DeepSeek just dropped V4-Pro and V4-Flash simultaneously — and it's a statement release. V4-Pro packs 1.6 trillion total parameters in a MoE architecture with only 49B active per token, a 1-million-token context window, and a hybrid attention system (Compressed Sparse Attention + Heavily Compressed Attention) that requires just 27% of single-token inference FLOPs compared to V3.2. Both models are Apache 2.0. The hardware story is arguably the bigger news: V4 was trained entirely on Huawei Ascend 950PR chips, zero NVIDIA. That's a geopolitical and technical milestone — it validates China's domestic AI compute stack at frontier scale. The Engram Memory System gives V4 conditional context recall (94% at 128K tokens vs ~45% for V3.2), enabling genuinely long-context reasoning. V4-Flash at 284B parameters (13B active) is the cheaper, faster sibling for production use. Pricing is expected around $0.30/M tokens for Pro. The timing — released to HN today with 99+ points within hours — confirms this as an immediate conversation in the developer community about whether open-weight frontier models have finally matched proprietary ones.

Share this verdict

DeepSeek V4-Pro verdict: SHIP 🚀

3 ships · 1 skip from the expert panel

Full review: shiporskip.io/tool/deepseek-v4-pro-1-6t-moe-1m-context-huawei-apache-2026

Weekly AI Tool Verdicts

Get the next verdict in your inbox

7 critics review a new AI tool every day. Weekly digest — free.

Embed this verdict

Tool makers can add a live ShipOrSkip badge to their site. Badge loads track impressions; clicks route back to this review.

Ship · 7.5/10
HTML badge
<a href="https://shiporskip.io/api/badge-click/deepseek-v4-pro-1-6t-moe-1m-context-huawei-apache-2026" target="_blank" rel="noopener"><img src="https://shiporskip.io/api/badge/deepseek-v4-pro-1-6t-moe-1m-context-huawei-apache-2026" alt="DeepSeek V4-Pro Ship verdict on ShipOrSkip" width="360" height="90" /></a>
Markdown badge
[![DeepSeek V4-Pro Ship verdict on ShipOrSkip](https://shiporskip.io/api/badge/deepseek-v4-pro-1-6t-moe-1m-context-huawei-apache-2026)](https://shiporskip.io/api/badge-click/deepseek-v4-pro-1-6t-moe-1m-context-huawei-apache-2026)
Iframe widget
<iframe src="https://shiporskip.io/embed/deepseek-v4-pro-1-6t-moe-1m-context-huawei-apache-2026" title="DeepSeek V4-Pro ShipOrSkip verdict" width="360" height="260" style="border:0;border-radius:16px;max-width:100%;" loading="lazy"></iframe>

The reviews

Apache 2.0 with 1M context and frontier-level benchmarks changes the commercial calculus entirely. Self-host for sensitive workloads, use the API for production — the 49B active params means reasonable inference costs if you have the hardware.

Helpful?

Benchmark claims from DeepSeek have historically been hard to independently replicate at launch. The Huawei chip story is compelling but also means the Western open-source deployment story requires significant hardware work. And 1.6T parameters is not consumer hardware territory.

Helpful?

V4's Nvidia-free training stack is a geopolitical inflection point as much as a technical one. It proves the export control strategy isn't containing China's AI progress — and gives the global open-source community a frontier model with no licensing restrictions.

Helpful?

A 1M-token context model at $0.30/MTok Apache 2.0 means long-form creative projects — novels, screenplays, brand bibles — can finally be processed holistically. The Flash variant's low cost makes it accessible even for creative side projects with tight budgets.

Helpful?

Bookmarks

Loading bookmarks...

No bookmarks yet

Bookmark tools to save them for later