Question 1

Which is better: Open Agents (Vercel Labs) or pi-autoresearch?

Accepted Answer

Based on our expert panel, Open Agents (Vercel Labs) has a stronger verdict with a 75% Ship rate. Open Agents (Vercel Labs) received a panel verdict of Ship and pi-autoresearch received Mixed.

Question 2

Is Open Agents (Vercel Labs) free?

Accepted Answer

Open Agents (Vercel Labs) pricing: Open Source (MIT)

Question 3

Is pi-autoresearch free?

Accepted Answer

pi-autoresearch pricing: Open Source (Apache 2.0)

Question 4

What do experts say about Open Agents (Vercel Labs) vs pi-autoresearch?

Accepted Answer

Open Agents (Vercel Labs): Open Agents is Vercel Labs' open-source reference implementation for building persistent cloud coding agents. It demonstrates a three-tier architecture: a chat UI layer, a durable workflow layer using the new Vercel Workflow SDK, and isolated sandbox VMs with snapshot/resume. The result is an agent that doesn't lose its state when your laptop closes — it keeps working in the cloud and you can pick up the conversation when you're back.

The reference implementation includes git operations (clone, branch, commit, PR creation), voice input via ElevenLabs integration, session sharing via a shareable URL, and a real-time log stream so you can watch what the agent is doing. It's designed to be forked and adapted rather than used as-is — think of it as Vercel's opinionated answer to "how should a cloud coding agent be architected?"

What makes this notable isn't the feature list — it's the source. Vercel is the dominant deployment platform for web developers, and when Vercel shows you how to build something, thousands of developers follow the pattern. Open Agents is likely to become the de facto reference architecture for the next generation of coding agent products built on Vercel infrastructure. pi-autoresearch: pi-autoresearch extends the pi terminal agent with an autonomous optimization loop: the agent writes a change, runs a benchmark, uses Median Absolute Deviation (MAD) to filter out statistical noise, and either commits or reverts — then loops. No human in the loop. The cycle repeats until a time limit or convergence criterion is met.

The technique was popularized by Karpathy's autoresearch concept for ML training, but pi-autoresearch generalizes it to any benchmarkable target. Shopify's engineering team ran it against their Liquid template engine and reported 53% faster parse/render with 61% fewer allocations after an overnight run — changes their team had been unable to land manually in months. The MAD-based noise filtering is the key innovation: it prevents the agent from chasing benchmark noise and reverting valid improvements.

The project has spawned an ecosystem: pi-autoresearch-studio adds a visual timeline of accepted/rejected edits, openclaw-autoresearch ports the concept to Claw Code, and autoloop generalizes it to any agent that supports a run/test interface. At 3,500 stars, it's one of the most-forked pi extensions.

Open Agents (Vercel Labs) vs pi-autoresearch

Open Agents (Vercel Labs)

pi-autoresearch

Bookmarks