Question 1

Which is better: Arcee Trinity-Large-Thinking or MiniMax M2.7?

Accepted Answer

Based on our expert panel, Arcee Trinity-Large-Thinking has a stronger verdict with a 75% Ship rate. Arcee Trinity-Large-Thinking received a panel verdict of Ship and MiniMax M2.7 received Mixed.

Question 2

Is Arcee Trinity-Large-Thinking free?

Accepted Answer

Arcee Trinity-Large-Thinking pricing: $0.90/M output tokens (API) / Self-hostable open weights

Question 3

Is MiniMax M2.7 free?

Accepted Answer

MiniMax M2.7 pricing: Free / Open Weights (self-host) / API via MiniMax

Question 4

What do experts say about Arcee Trinity-Large-Thinking vs MiniMax M2.7?

Accepted Answer

Arcee Trinity-Large-Thinking: Arcee AI, a 30-person startup, has released Trinity-Large-Thinking — a 399B sparse mixture-of-experts reasoning model under Apache 2.0. Only 13B parameters activate per token, giving it inference speed 2-3x faster than comparable dense models. In internal benchmarks and early community testing, it ranks #2 on PinchBench, trailing only Anthropic's Opus 4.6, at a list price of $0.90/M output tokens — roughly 96% cheaper than frontier closed models.

The model was trained in a $20M, 33-day run on 2,048 NVIDIA Blackwell GPUs. Arcee trained it using a constitutional AI-style process with synthetic chain-of-thought data generated from multiple frontier models, then applied a reinforcement learning phase using outcome-based rewards on math, code, and logic benchmarks.

Trinity-Large-Thinking is the strongest open-weight reasoning model released to date on a commercial-friendly license. For companies with privacy requirements or custom deployment needs, it represents a credible alternative to frontier closed APIs — especially for code generation, mathematical reasoning, and structured data tasks where the gap between open and closed models has historically been widest. MiniMax M2.7: MiniMax M2.7 is a 230B-parameter Mixture-of-Experts reasoning model released as open weights in April 2026. Only 10 billion parameters activate per token (8 of 256 experts), which enables frontier-level performance at significantly lower inference cost and latency than dense models of comparable quality. The context window stretches to 204,800 tokens — roughly 307 pages of text — with strong performance on long-horizon agentic tasks.

M2.7 is purpose-built for tool-using agents and coding workflows. It scored 50 on the Artificial Analysis Intelligence Index, placing it among the top open-weight models globally. Weights landed on Hugging Face simultaneously with an API launch and the open-sourcing of OpenRoom, MiniMax's interactive agent orchestration system — a rare move that gives developers the full stack from model to agent runtime.

MiniMax is a Shanghai-based AI company that has been quietly iterating through M1, M2, M2.5, and now M2.7 with consistent improvements. The M2.7 release represents a notable capability jump in the MoE open-weights space, particularly for developers who need a locally deployable model that can handle complex multi-step agent tasks without calling a paid API.

Arcee Trinity-Large-Thinking vs MiniMax M2.7

Arcee Trinity-Large-Thinking

MiniMax M2.7

Bookmarks