Compare/Dirac vs Matt Pocock's Skills

AI tool comparison

Dirac vs Matt Pocock's Skills

Which one should you ship with? Here is the side-by-side panel verdict, pricing read, reviewer split, and community vote comparison.

D

Developer Tools

Dirac

Open-source coding agent that crushed TerminalBench-2 at 64.8% lower cost

Ship

75%

Panel ship

Community

Free

Entry

Dirac is an open-source AI coding agent built by Dirac Delta Labs that shot to the top of TerminalBench-2 with a 65.2% score using Gemini Flash — while costing 64.8% less than competing agents. Forked from Cline and rebuilt with a performance-first architecture, it handles file modifications, multi-file refactoring, terminal commands, and browser automation through an approval-based workflow. What sets Dirac apart is its technical substrate: hash-anchored edits replace fragile line-number targeting with stable content hashes, AST-native processing understands language structure for TypeScript, Python, and C++, and multi-file batching reduces LLM roundtrips by processing several files per call. The result is a leaner context that preserves model reasoning quality without burning through tokens. Available as both a VS Code extension and an npm CLI, Dirac supports Anthropic, OpenAI, Google, Groq, and Mistral as backends. Its Apache 2.0 license and strong TerminalBench showing on the affordable Gemini Flash model make it a compelling pick for developers who want production-grade coding assistance without the per-token bill shock.

M

Developer Tools

Matt Pocock's Skills

Reusable Claude agent skills that fix AI coding's biggest failure modes

Ship

75%

Panel ship

Community

Free

Entry

Matt Pocock — the TypeScript educator behind Total TypeScript — dropped a GitHub repo that's currently the #2 trending project on all of GitHub with 7,300+ stars in a single day. It's a curated collection of reusable agent skills for Claude Code and other coding agents, installable with one line: `npx skills@latest add mattpocock/skills`. The skills tackle the four canonical failure modes of AI-assisted development: misalignment (agents build the wrong thing), verbosity (context windows bloated with unnecessary tokens), broken code (no feedback loops), and poor design (architecture degrades over time). Each skill is a focused slash command — `/grill-me`, `/tdd`, `/diagnose`, `/improve-codebase-architecture` — that guides agents through professional engineering practices rather than just writing code. What makes this land differently is Pocock's framing: he argues software engineering fundamentals matter more than ever in the agent era, not less. The repo is built around the insight that agents need structured methodology, not just raw capability. With over 3,200 forks in 24 hours and widespread adoption reports, this is shaping up to be the de facto starting point for anyone building a serious `.claude` directory.

Decision
Dirac
Matt Pocock's Skills
Panel verdict
Ship · 3 ship / 1 skip
Ship · 3 ship / 1 skip
Community
No community votes yet
No community votes yet
Pricing
Free / Open Source (Apache 2.0)
Open Source / Free
Best for
Open-source coding agent that crushed TerminalBench-2 at 64.8% lower cost
Reusable Claude agent skills that fix AI coding's biggest failure modes
Category
Developer Tools
Developer Tools

Reviewer scorecard

Builder
80/100 · ship

Topping TerminalBench-2 while being 64.8% cheaper is the kind of benchmark that actually matters to developers. The hash-anchored editing and AST-native approach fix the two most annoying failure modes of existing coding agents — wrong line edits and syntax-blind refactors.

80/100 · ship

This is the missing manual for working with coding agents. The /tdd and /grill-me skills alone have already changed how I approach agent sessions — I actually get working code on the first pass now instead of a beautiful-looking mess that fails every test.

Skeptic
45/100 · skip

It's a Cline fork with smart optimizations — not a ground-up rethink. TerminalBench-2 scores are reproducible only if you're running similar tasks; complex real-world codebases may tell a different story. Also, requiring your own API key still means real money.

45/100 · skip

Slash commands in a shell script repo going viral is classic GitHub hype. These are just prompts dressed up as methodology — any senior engineer could write these in an afternoon, and half your team will ignore them after week two. The stars reflect Pocock's brand, not necessarily the utility.

Futurist
80/100 · ship

The race to build the cheapest, most accurate coding agent is the real infrastructure play of 2026. Dirac's multi-provider support and lean context model are exactly the primitives that make agentic coding deployable at scale — not just on powerful machines.

80/100 · ship

We're watching the emergence of a skills economy for AI agents. Pocock's repo is an early proof-of-concept that reusable, composable agent skills are a real category — the npm of agent methodology. Whoever wins this space wins a huge chunk of the developer toolchain.

Creator
80/100 · ship

The VS Code extension makes it approachable for designers who code. Approval-based workflows mean it won't silently rewrite your carefully named CSS classes. Worth trying if you've been burned by agents that act first and apologize later.

80/100 · ship

The /caveman ultra-compressed mode is genuinely clever for large codebases where token limits bite. As someone who spends half my life fighting context windows, the CONTEXT.md shared domain language approach deserves its own talk at every dev conference this year.

Weekly AI Tool Verdicts

Get the next comparison in your inbox

New AI tools ship daily. We compare them before you waste an afternoon.

Bookmarks

Loading bookmarks...

No bookmarks yet

Bookmark tools to save them for later