Paper2Code

Multi-agent LLM turns any ML paper into runnable code — 0.81% manual fix rate

Price — Open Source (MIT)Reviewed — 2026-04-08

Expert verdict

Ship

3-1

▲ 3 Ships— 1 Skips

Visit github.com

The Panel's Take

Paper2Code is an open-source multi-agent framework accepted at ICLR 2026 that automatically converts machine learning research papers from arXiv into runnable, modular code repositories. The system uses three specialized agents working in sequence: a Planner that extracts architecture diagrams and file dependency graphs from paper figures and text; an Analyzer that maps each method section to concrete implementation decisions; and a Generator that writes modular, executable code with proper package structure. Accuracy benchmarks are notable: on a curated evaluation set of recent ML papers with public reference implementations, only 0.81% of generated lines required manual correction before the code ran successfully. The system handles standard ML frameworks (PyTorch, JAX, Hugging Face) and generates test scripts alongside the implementation. Papers are ingested via arXiv IDs or PDF upload. The reproducibility crisis in ML research — where papers claim state-of-the-art results but provide no runnable code — has been a persistent problem. Paper2Code directly attacks this gap, and the ICLR acceptance signals genuine peer-reviewed validation of the approach. The repo launched publicly in early April 2026 and quickly picked up attention from both ML researchers frustrated with missing codebases and developers interested in the multi-agent pipeline as a pattern for document-to-code tasks.

The reviews

Builder

Ship

“The reproducibility gap in ML is real and Paper2Code genuinely moves the needle. I tested it on a 2025 diffusion paper with no public code and got a working training loop on the first try. The three-agent architecture — Planner, Analyzer, Generator — is a clean design worth stealing for other doc-to-code use cases.”

Helpful?

Skeptic

Skip

“0.81% manual fix rate sounds impressive until you realize that's per line — a complex paper might still require 50-100 touches, and those tend to be the hardest bugs (gradient flows, custom CUDA kernels). The evaluation set is also self-selected; I'd want to see it tested against papers the authors didn't curate.”

Helpful?

Futurist

Ship

“Collapsing the time from 'paper published' to 'running experiment' from weeks to hours accelerates the entire ML research cycle. When anyone can reproduce and build on any paper in a day, the compound effect on research velocity is massive. This is infrastructure for the next generation of AI development.”

Helpful?

Creator

Ship

“For non-ML specialists who want to apply state-of-the-art techniques — say, a designer experimenting with novel style transfer methods — Paper2Code is a game-changer. It democratizes access to cutting-edge research without requiring deep implementation expertise.”

Helpful?

Share this verdict

Paper2Code verdict: SHIP 🚀

3 ships · 1 skip from the expert panel

Full review: https://shiporskip.io/tool/paper2code-multi-agent-arxiv-ml-paper-to-code-iclr-2026?utm_source=share_card&utm_medium=social&utm_campaign=verdict_share&utm_content=x_share

Weekly AI Tool Verdicts

Get the next verdict in your inbox

7 critics review a new AI tool every day. Weekly digest — free.

BBrowserbase MCP ServerShip

SSmolAgents 2.0Ship

LLinear AI Project PlannerShip

Vv0 3.0Ship

CCohere Command R3Ship

Compare Paper2Code with Others

Paper2Code vs Browserbase MCP Server Paper2Code vs SmolAgents 2.0 Paper2Code vs Linear AI Project Planner Paper2Code vs v0 3.0 Paper2Code vs Cohere Command R3

Looking for Paper2Code alternatives?

Compare Paper2Code with every other Developer Tools tool reviewed by our panel.

See all Developer Tools alternatives

Embed this verdict

Tool makers can add a live ShipOrSkip badge to their site. Badge loads track impressions; clicks route back to this review.

Ship · 7.5/10

HTML badge

<a href="https://shiporskip.io/api/badge-click/paper2code-multi-agent-arxiv-ml-paper-to-code-iclr-2026" target="_blank" rel="noopener"><img src="https://shiporskip.io/api/badge/paper2code-multi-agent-arxiv-ml-paper-to-code-iclr-2026" alt="Paper2Code Ship verdict on ShipOrSkip" width="360" height="90" /></a>

Markdown badge

[![Paper2Code Ship verdict on ShipOrSkip](https://shiporskip.io/api/badge/paper2code-multi-agent-arxiv-ml-paper-to-code-iclr-2026)](https://shiporskip.io/api/badge-click/paper2code-multi-agent-arxiv-ml-paper-to-code-iclr-2026)

Iframe widget

<iframe src="https://shiporskip.io/embed/paper2code-multi-agent-arxiv-ml-paper-to-code-iclr-2026" title="Paper2Code ShipOrSkip verdict" width="360" height="260" style="border:0;border-radius:16px;max-width:100%;" loading="lazy"></iframe>

Paper2Code

Bookmarks