AI tool comparison
Nothing Ever Happens vs Qwen3 Family
Which one should you ship with? Here is the side-by-side panel verdict, pricing read, reviewer split, and community vote comparison.
AI Experiments
Nothing Ever Happens
An autonomous bot that always bets 'No' on Polymarket doom predictions—and profits
75%
Panel ship
—
Community
Free
Entry
Nothing Ever Happens is a deliberately simple autonomous trading bot that buys "No" contracts on Polymarket prediction markets—specifically targeting non-sports questions about dramatic or catastrophic events. The thesis: humans systematically overestimate the probability that scary predicted events will actually happen. The bot filters markets using LLM-based criteria to exclude sports (where outcomes are more unpredictable) and focuses on the long tail of geopolitical, tech, and social predictions that tend toward "nothing happens." Built by Sterling Crispin (an artist and technologist known for his work on Apple Vision Pro), the project is equal parts satirical commentary and functional trading system. It logs all positions, P&L, and reasoning chains so you can audit its decisions. The name references an internet phrase mocking catastrophist news cycles—"nothing ever happens" is the skeptic's rebuttal to perpetual crisis framing. The HN post hit 370 points and 180+ comments in a few hours, sparking genuine debate about whether this is a sound strategy, a fun toy, or a comment on prediction market epistemology. Real-world results aren't yet published, but the idea of using an LLM as a "doom filter" for prediction markets is novel enough to be worth watching.
Foundation Models
Qwen3 Family
Alibaba's full model family: 0.6B to 235B with thinking modes
75%
Panel ship
—
Community
Paid
Entry
Alibaba's Qwen team released the full Qwen3 model family this week — 8 models ranging from 0.6B to 235B parameters, spanning both dense and Mixture-of-Experts (MoE) architectures. The headline model is Qwen3-235B-A22B, a 235B MoE that activates 22B parameters per token and matches GPT-4.1 on coding and math benchmarks while running at a fraction of the cost. All Qwen3 models feature switchable "thinking modes" — a built-in chain-of-thought toggle that can be enabled or disabled per request. This eliminates the need for separate reasoning vs. instruct variants, letting developers trade latency for accuracy dynamically. All models are released under Apache 2.0, with weights available on Hugging Face and ModelScope. The smaller models are competitive at their size class: Qwen3-4B reportedly matches Qwen2.5-72B-Instruct on several benchmarks, and the 0.6B model is designed to run efficiently on embedded and edge devices. The release also introduces a new multilingual benchmark covering 119 languages, on which the Qwen3 family sets new state-of-the-art scores for open-weights models.
Reviewer scorecard
“Clean architecture, good logging, and a legitimately interesting hypothesis about prediction market psychology. The LLM filtering layer for 'doom vs. non-doom' questions is a smart abstraction. Even if the strategy underperforms, the codebase is a solid template for automated Polymarket bots.”
“Apache 2.0 on a 235B model that matches GPT-4.1 is the most impactful open-source release of the quarter. The dynamic thinking mode toggle is exactly what production systems need — you don't always want a 30-second reasoning chain on every request.”
“The strategy looks good in backtests but Polymarket's liquidity is thin and arbitrageurs will price this edge away quickly once it's well-known. Also: 'nothing ever happens' is survivorship bias dressed as strategy—the times something DOES happen, you're wiped out. Don't put meaningful capital here.”
“Alibaba's benchmark methodology has been questioned before. The 'matches GPT-4.1' claim needs independent validation on real tasks. Also, while Apache 2.0 is permissive, enterprise legal teams will still scrutinize models from Chinese companies for compliance reasons.”
“Autonomous agents that trade prediction markets based on LLM-assessed epistemic calibration is a genuinely new thing. If this works at scale, it could actually make prediction markets more accurate by algorithmically correcting for human doom-bias. That's a more interesting outcome than any individual P&L.”
“Eight models with consistent APIs, multilingual coverage, and open weights — this is what a real AI platform looks like. Alibaba is building a global alternative to OpenAI's stack, and the quality gap is closing faster than anyone expected two years ago.”
“Sterling Crispin making a 'nothing ever happens' bot is peak art-meets-tech. It's a functional piece of commentary on the anxiety economy—we're so primed for crisis that prediction markets misprice normalcy. The aesthetic of it is as interesting as the trading logic.”
“The multilingual benchmark improvements are huge for global content teams. I tested Qwen3-7B on Japanese marketing copy and it handled tone and register better than anything at this size class. For small teams creating content in non-English markets, this is a serious unlock.”
Weekly AI Tool Verdicts
Get the next comparison in your inbox
New AI tools ship daily. We compare them before you waste an afternoon.