Arcee Trinity-Large-Thinking
399B open-weight reasoning model, 13B active params, Apache 2.0
Expert verdict
Ship
3-1The Panel's Take
Arcee AI, a 30-person startup, has released Trinity-Large-Thinking — a 399B sparse mixture-of-experts reasoning model under Apache 2.0. Only 13B parameters activate per token, giving it inference speed 2-3x faster than comparable dense models. In internal benchmarks and early community testing, it ranks #2 on PinchBench, trailing only Anthropic's Opus 4.6, at a list price of $0.90/M output tokens — roughly 96% cheaper than frontier closed models. The model was trained in a $20M, 33-day run on 2,048 NVIDIA Blackwell GPUs. Arcee trained it using a constitutional AI-style process with synthetic chain-of-thought data generated from multiple frontier models, then applied a reinforcement learning phase using outcome-based rewards on math, code, and logic benchmarks. Trinity-Large-Thinking is the strongest open-weight reasoning model released to date on a commercial-friendly license. For companies with privacy requirements or custom deployment needs, it represents a credible alternative to frontier closed APIs — especially for code generation, mathematical reasoning, and structured data tasks where the gap between open and closed models has historically been widest.
Share this verdict
Arcee Trinity-Large-Thinking verdict: SHIP 🚀 3 ships · 1 skip from the expert panel Full review: shiporskip.io/tool/arcee-trinity-large-thinking-399b-moe-open-source-apache2-reasoning
Weekly AI Tool Verdicts
Get the next verdict in your inbox
7 critics review a new AI tool every day. Weekly digest — free.
Similar Products
Compare Arcee Trinity-Large-Thinking with Others
Embed this verdict
Tool makers can add a live ShipOrSkip badge to their site. Badge loads track impressions; clicks route back to this review.
<a href="https://shiporskip.io/api/badge-click/arcee-trinity-large-thinking-399b-moe-open-source-apache2-reasoning" target="_blank" rel="noopener"><img src="https://shiporskip.io/api/badge/arcee-trinity-large-thinking-399b-moe-open-source-apache2-reasoning" alt="Arcee Trinity-Large-Thinking Ship verdict on ShipOrSkip" width="360" height="90" /></a>[](https://shiporskip.io/api/badge-click/arcee-trinity-large-thinking-399b-moe-open-source-apache2-reasoning)<iframe src="https://shiporskip.io/embed/arcee-trinity-large-thinking-399b-moe-open-source-apache2-reasoning" title="Arcee Trinity-Large-Thinking ShipOrSkip verdict" width="360" height="260" style="border:0;border-radius:16px;max-width:100%;" loading="lazy"></iframe>The reviews
“A #2 benchmark result from a 30-person startup under Apache 2.0 is legitimately shocking. The sparse MoE architecture means you can run 399B at a reasonable cost — and $0.90/M output is almost too cheap to believe for this performance tier. This is going in our eval suite immediately.”
“Benchmark numbers from the releasing company always look better than real-world deployment. PinchBench is also relatively new and the community hasn't stress-tested whether it correlates with production quality. Wait for independent evals before betting a product on this.”
“This is the model that closes the open vs. closed frontier gap. When a 30-person startup can train a near-frontier reasoner for $20M on a commercial license, the economics of AI completely change. Enterprises that couldn't afford frontier APIs will rebuild their stacks around self-hosted models like this.”
“For long-form creative work requiring multi-step reasoning — worldbuilding, complex narrative planning, detailed research synthesis — a 399B model at this price point is transformative. The chain-of-thought always-on design means it actually shows its reasoning, which helps when I need to redirect it mid-task.”