AI tool comparison
GLM-5.1 vs Nothing Ever Happens
Which one should you ship with? Here is the side-by-side panel verdict, pricing read, reviewer split, and community vote comparison.
AI Models
GLM-5.1
First open-source model to top SWE-bench Pro — 744B MoE, MIT, zero Nvidia
50%
Panel ship
—
Community
Paid
Entry
GLM-5.1 is Z.ai's (formerly Zhipu AI) open-weight model released April 7, 2026 under the MIT license. It's a 744-billion-parameter Mixture-of-Experts architecture with 40 billion active parameters per token, a 200K-token context window, and a 131K maximum output length — and it became the first open-source model ever to lead SWE-bench Pro, scoring 58.4% versus Claude Opus 4.6's 57.3%. The training story is almost as remarkable as the performance. GLM-5.1 was trained entirely on approximately 100,000 Huawei Ascend 910B chips using the MindSpore framework — no Nvidia hardware was used at any point. That makes it one of the first frontier-tier models to demonstrate that the CUDA monoculture isn't technically mandatory for training state-of-the-art models. Z.ai became the first publicly traded foundation model company via a Hong Kong IPO in January 2026 (~$558M raised). The model is free to download from HuggingFace and also available via API at $0.95 per million input tokens. In agentic demonstrations, it has run autonomously for eight hours straight — 655 planning and execution iterations — without human checkpoints.
AI Experiments
Nothing Ever Happens
An autonomous bot that always bets 'No' on Polymarket doom predictions—and profits
75%
Panel ship
—
Community
Free
Entry
Nothing Ever Happens is a deliberately simple autonomous trading bot that buys "No" contracts on Polymarket prediction markets—specifically targeting non-sports questions about dramatic or catastrophic events. The thesis: humans systematically overestimate the probability that scary predicted events will actually happen. The bot filters markets using LLM-based criteria to exclude sports (where outcomes are more unpredictable) and focuses on the long tail of geopolitical, tech, and social predictions that tend toward "nothing happens." Built by Sterling Crispin (an artist and technologist known for his work on Apple Vision Pro), the project is equal parts satirical commentary and functional trading system. It logs all positions, P&L, and reasoning chains so you can audit its decisions. The name references an internet phrase mocking catastrophist news cycles—"nothing ever happens" is the skeptic's rebuttal to perpetual crisis framing. The HN post hit 370 points and 180+ comments in a few hours, sparking genuine debate about whether this is a sound strategy, a fun toy, or a comment on prediction market epistemology. Real-world results aren't yet published, but the idea of using an LLM as a "doom filter" for prediction markets is novel enough to be worth watching.
Reviewer scorecard
“MIT license, top SWE-bench Pro score, $0.95/M via API. If your use case is agentic coding and you're not evaluating GLM-5.1, you're leaving real performance on the table. The 8-hour autonomous run capability is compelling for long-horizon task pipelines.”
“Clean architecture, good logging, and a legitimately interesting hypothesis about prediction market psychology. The LLM filtering layer for 'doom vs. non-doom' questions is a smart abstraction. Even if the strategy underperforms, the codebase is a solid template for automated Polymarket bots.”
“SWE-bench Pro is one benchmark. The broader coding composite (Terminal-Bench 2.0 + NL2Repo) still has Claude Opus 4.6 ahead at 57.5 vs GLM-5.1's 54.9. Running 744B locally requires hardware most teams don't own, and the API's Chinese jurisdiction will trigger compliance blockers for many organizations.”
“The strategy looks good in backtests but Polymarket's liquidity is thin and arbitrageurs will price this edge away quickly once it's well-known. Also: 'nothing ever happens' is survivorship bias dressed as strategy—the times something DOES happen, you're wiped out. Don't put meaningful capital here.”
“The Huawei chip training story matters more than the benchmark ranking. If GLM-5.1 proves you can train frontier models without Nvidia at scale, it fractures the GPU supply chain narrative that's been shaping geopolitics and AI policy discussions for years. This is a proof of concept with enormous implications.”
“Autonomous agents that trade prediction markets based on LLM-assessed epistemic calibration is a genuinely new thing. If this works at scale, it could actually make prediction markets more accurate by algorithmically correcting for human doom-bias. That's a more interesting outcome than any individual P&L.”
“For creative workflows, the 744B MoE overhead is overkill and local deployment requires datacenter-grade hardware that's nowhere near indie studio territory. The MIT license is great, but the gap between 'free to download' and 'free to actually run' is vast at this parameter count.”
“Sterling Crispin making a 'nothing ever happens' bot is peak art-meets-tech. It's a functional piece of commentary on the anxiety economy—we're so primed for crisis that prediction markets misprice normalcy. The aesthetic of it is as interesting as the trading logic.”
Weekly AI Tool Verdicts
Get the next comparison in your inbox
New AI tools ship daily. We compare them before you waste an afternoon.