DeepSeek V4-Pro
1.6T-param MoE model, 1M context, Nvidia-free — just dropped Apache 2.0
The Panel's Take
DeepSeek just dropped V4-Pro and V4-Flash simultaneously — and it's a statement release. V4-Pro packs 1.6 trillion total parameters in a MoE architecture with only 49B active per token, a 1-million-token context window, and a hybrid attention system (Compressed Sparse Attention + Heavily Compressed Attention) that requires just 27% of single-token inference FLOPs compared to V3.2. Both models are Apache 2.0. The hardware story is arguably the bigger news: V4 was trained entirely on Huawei Ascend 950PR chips, zero NVIDIA. That's a geopolitical and technical milestone — it validates China's domestic AI compute stack at frontier scale. The Engram Memory System gives V4 conditional context recall (94% at 128K tokens vs ~45% for V3.2), enabling genuinely long-context reasoning. V4-Flash at 284B parameters (13B active) is the cheaper, faster sibling for production use. Pricing is expected around $0.30/M tokens for Pro. The timing — released to HN today with 99+ points within hours — confirms this as an immediate conversation in the developer community about whether open-weight frontier models have finally matched proprietary ones.
Share this verdict
DeepSeek V4-Pro verdict: SHIP 🚀 3 ships · 1 skip from the expert panel Full review: shiporskip.io/tool/deepseek-v4-pro-1-6t-moe-1m-context-huawei-apache-2026
Weekly AI Tool Verdicts
Get the next verdict in your inbox
7 critics review a new AI tool every day. Weekly digest — free.
Compare DeepSeek V4-Pro with Others
Embed this verdict
Tool makers can add a live ShipOrSkip badge to their site. Badge loads track impressions; clicks route back to this review.
<a href="https://shiporskip.io/api/badge-click/deepseek-v4-pro-1-6t-moe-1m-context-huawei-apache-2026" target="_blank" rel="noopener"><img src="https://shiporskip.io/api/badge/deepseek-v4-pro-1-6t-moe-1m-context-huawei-apache-2026" alt="DeepSeek V4-Pro Ship verdict on ShipOrSkip" width="360" height="90" /></a>[](https://shiporskip.io/api/badge-click/deepseek-v4-pro-1-6t-moe-1m-context-huawei-apache-2026)<iframe src="https://shiporskip.io/embed/deepseek-v4-pro-1-6t-moe-1m-context-huawei-apache-2026" title="DeepSeek V4-Pro ShipOrSkip verdict" width="360" height="260" style="border:0;border-radius:16px;max-width:100%;" loading="lazy"></iframe>The reviews
“Apache 2.0 with 1M context and frontier-level benchmarks changes the commercial calculus entirely. Self-host for sensitive workloads, use the API for production — the 49B active params means reasonable inference costs if you have the hardware.”
“Benchmark claims from DeepSeek have historically been hard to independently replicate at launch. The Huawei chip story is compelling but also means the Western open-source deployment story requires significant hardware work. And 1.6T parameters is not consumer hardware territory.”
“V4's Nvidia-free training stack is a geopolitical inflection point as much as a technical one. It proves the export control strategy isn't containing China's AI progress — and gives the global open-source community a frontier model with no licensing restrictions.”
“A 1M-token context model at $0.30/MTok Apache 2.0 means long-form creative projects — novels, screenplays, brand bibles — can finally be processed holistically. The Flash variant's low cost makes it accessible even for creative side projects with tight budgets.”