AI tool comparison
Nothing Ever Happens vs Qwen3.6-Max-Preview
Which one should you ship with? Here is the side-by-side panel verdict, pricing read, reviewer split, and community vote comparison.
AI Experiments
Nothing Ever Happens
An autonomous bot that always bets 'No' on Polymarket doom predictions—and profits
75%
Panel ship
—
Community
Free
Entry
Nothing Ever Happens is a deliberately simple autonomous trading bot that buys "No" contracts on Polymarket prediction markets—specifically targeting non-sports questions about dramatic or catastrophic events. The thesis: humans systematically overestimate the probability that scary predicted events will actually happen. The bot filters markets using LLM-based criteria to exclude sports (where outcomes are more unpredictable) and focuses on the long tail of geopolitical, tech, and social predictions that tend toward "nothing happens." Built by Sterling Crispin (an artist and technologist known for his work on Apple Vision Pro), the project is equal parts satirical commentary and functional trading system. It logs all positions, P&L, and reasoning chains so you can audit its decisions. The name references an internet phrase mocking catastrophist news cycles—"nothing ever happens" is the skeptic's rebuttal to perpetual crisis framing. The HN post hit 370 points and 180+ comments in a few hours, sparking genuine debate about whether this is a sound strategy, a fun toy, or a comment on prediction market epistemology. Real-world results aren't yet published, but the idea of using an LLM as a "doom filter" for prediction markets is novel enough to be worth watching.
AI Models
Qwen3.6-Max-Preview
Alibaba's #1-ranked agentic coding model — tops SWE-bench Pro, Terminal-Bench, and more
75%
Panel ship
—
Community
Paid
Entry
Qwen3.6-Max-Preview is Alibaba's flagship closed-weight model and currently holds the top position on five major agentic coding benchmarks: SWE-bench Pro, Terminal-Bench 2.0, SkillsBench, QwenClawBench, and QwenWebBench. Released April 20 as a preview API, it represents Alibaba's most aggressive push yet at the frontier of agentic AI. Unlike the open-weight Qwen3.6-27B and Qwen3.6-35B-A3B variants released alongside it, the Max model is proprietary and available only through the Qwen API. It's designed for complex multi-step coding tasks, autonomous terminal operation, and web-based agent workflows — the kind of tasks that require sustained planning over dozens of steps without human intervention. For the developer community, the benchmarks are eye-catching: claiming the #1 spot on SWE-bench Pro means it's outperforming Claude Opus 4.7, GPT-5, and Gemini Ultra 2.0 on autonomous software engineering tasks. Whether those numbers hold in production is the real question, but at competitive API pricing, Qwen3.6-Max is worth serious evaluation by any team running coding agents at scale.
Reviewer scorecard
“Clean architecture, good logging, and a legitimately interesting hypothesis about prediction market psychology. The LLM filtering layer for 'doom vs. non-doom' questions is a smart abstraction. Even if the strategy underperforms, the codebase is a solid template for automated Polymarket bots.”
“The SWE-bench Pro numbers are hard to ignore — if this actually resolves real GitHub issues at the rate the benchmark suggests, it's the best coding agent on the market right now. Early access reports from the terminal-bench community are positive, and the API latency is reportedly competitive with Claude. Worth evaluating seriously before your next agent project.”
“The strategy looks good in backtests but Polymarket's liquidity is thin and arbitrageurs will price this edge away quickly once it's well-known. Also: 'nothing ever happens' is survivorship bias dressed as strategy—the times something DOES happen, you're wiped out. Don't put meaningful capital here.”
“Alibaba runs their own benchmarks (QwenClawBench, QwenWebBench) that nobody outside can verify, which is a big red flag. SWE-bench Pro results need independent reproduction before taking them at face value. The 'preview' label also means API reliability, rate limits, and pricing are all subject to change — risky to build a production pipeline on.”
“Autonomous agents that trade prediction markets based on LLM-assessed epistemic calibration is a genuinely new thing. If this works at scale, it could actually make prediction markets more accurate by algorithmically correcting for human doom-bias. That's a more interesting outcome than any individual P&L.”
“The fact that a Chinese tech company is releasing frontier-level agentic models that credibly compete with OpenAI and Anthropic is the real story here. Competition at the frontier drives down prices and forces capability improvements across the board. Alibaba's aggressive release cadence suggests this is just the beginning of a sustained push.”
“Sterling Crispin making a 'nothing ever happens' bot is peak art-meets-tech. It's a functional piece of commentary on the anxiety economy—we're so primed for crisis that prediction markets misprice normalcy. The aesthetic of it is as interesting as the trading logic.”
“For creative technologists building with code, the agentic capabilities matter — a model that can autonomously navigate a codebase and implement multi-file changes opens up a new class of creative tools. If the benchmarks hold in practice, this unlocks more ambitious generative projects without a human in the loop for every step.”
Weekly AI Tool Verdicts
Get the next comparison in your inbox
New AI tools ship daily. We compare them before you waste an afternoon.