Question 1

Which is better: Auto-Arch Tournament or Devstral Medium?

Accepted Answer

Based on our expert panel, Devstral Medium has a stronger verdict with a 100% Ship rate. Auto-Arch Tournament received a panel verdict of Ship and Devstral Medium received Ship.

Question 2

Is Auto-Arch Tournament free?

Accepted Answer

Auto-Arch Tournament pricing: Open Source

Question 3

Is Devstral Medium free?

Accepted Answer

Devstral Medium pricing: Open weights (Apache 2.0, free to self-host) / API via La Plateforme (token-based, competitive with Mistral's standard pricing tiers)

Question 4

What do experts say about Auto-Arch Tournament vs Devstral Medium?

Accepted Answer

Auto-Arch Tournament: Auto-Arch Tournament is an autonomous research system where an AI agent iteratively proposes, implements, and validates microarchitectural improvements to a RISC-V CPU. Starting from a standard 5-stage pipeline, the loop runs hypotheses in parallel, each going through formal verification (53 symbolic checks), cycle-accurate simulation, multi-seed FPGA place-and-route, and CoreMark CRC validation. Only hypotheses that beat the current champion get merged; everything else gets discarded. Starting from 301 iterations/second, the system hit 577 iter/s (+92%) across 73 attempts in 9.8 hours — producing a design 26% faster and 40% smaller in LUTs than the baseline.

The insight the author drives home is that the real innovation isn't the AI agent — it's the verifier. The orchestrator is hardcoded to prevent agents from manipulating their own evaluation gates, a simple but critical design constraint that turns a creative process into a trustworthy one. Without a rigorous verification harness, agent-driven optimization becomes a confidence trick.

This is early but fascinating proof that AI-driven hardware design loops can produce commercially meaningful gains. The repo uses Claude Code or Codex as the coding agent, SystemVerilog for the RTL, and standard open-source EDA tooling (Yosys, nextpnr, Verilator). It's a compelling template for anyone building agentic optimization loops where correctness matters. Devstral Medium: Devstral Medium is a 70B-class language model from Mistral AI purpose-built for agentic software engineering tasks — multi-file editing, code navigation, and tool use in long-context coding workflows. It ships via Mistral's La Plateforme API and as open weights on Hugging Face under Apache 2.0. The model targets the gap between frontier closed models and smaller open-source coding models on agentic benchmarks like SWE-bench.

Auto-Arch Tournament vs Devstral Medium

Auto-Arch Tournament

Devstral Medium

Bookmarks