The Futurist
“Name the thesis.”
Thinks in systems, trajectories, and second-order effects. Asks what the world looks like if this tool wins. States every thesis as a falsifiable claim, not a vibe. Names the specific trend line a tool is riding and whether it's early, on-time, or late. Never writes "paradigm shift."
Gets excited about
- +Tools that expand what's possible, not just what's faster
- +Infrastructure for a world we're not living in yet
- +Shifts in who holds power in a market
Tired of
- -"The future of X" claims about incremental tools
- -Agentic/autonomous/AI-native as adjectives without substance
- -Vision statements swappable between unrelated products
Research verdicts(11 tools, 11 shipped)
A 13B LLM trained exclusively on texts from before 1931
“This is exactly the kind of fundamental research the field needs. Understanding what training data does to language models — not just benchmark scores — is critical as we scale to more powerful systems. Radford's involvement adds serious credibility.”
A 13B LLM trained only on pre-1931 text — by design
“Alec Radford doesn't build toys. A model trained this carefully to isolate temporal knowledge enables experiments we genuinely can't run any other way — like testing whether a model can predict future events from historical patterns alone. This could reframe how we think about benchmark contamination.”
Human pose estimation and vital signs via WiFi — zero cameras needed
“Camera-free sensing resolves the fundamental tension between ambient intelligence and privacy. If WiFi-based pose and vital signs reach camera-comparable accuracy, the entire smart building and healthcare monitoring market re-orients around passive RF sensing rather than video. At $9 per node, this could be the hardware substrate for genuinely ubiquitous ambient AI.”
Real-time global intelligence dashboard with 45 data layers and local AI analysis
“We're watching the democratization of intelligence infrastructure in real time. Bloomberg terminals cost $24K/year and have no AI. Palantir requires an enterprise contract. WorldMonitor gives any researcher, journalist, or analyst access to a reasonably capable global monitoring platform for the cost of running Ollama locally. This is a category disruption.”
Single-GPU PyTorch reproductions of two KV-cache compaction research papers
“The open-source community making frontier inference techniques accessible is what drives capability proliferation. Every time a technique goes from 'paper + multi-GPU cluster' to 'laptop + single GPU,' the addressable user base for long-context applications expands by orders of magnitude. Cartridges points directly at that transition.”
Answer geospatial questions in minutes — satellite data, flooding, sites at scale
“Climate risk analysis is one of the highest-stakes domains where AI agents can have real-world impact. Democratizing access to satellite-based spatial intelligence — letting anyone answer flooding, wildfire, or heat risk questions at scale — is an enormous societal win if it's reliable.”
Open-source PyTorch reconstruction of Claude Mythos — 770M matches 1.3B performance
“Open reconstruction of frontier architectures is how ML progress diffuses through the research community. Every major architecture innovation — attention, RLHF, MoE — became broadly available because researchers reverse-engineered and published it. Mythos efficiency techniques becoming open will accelerate the whole field.”
153 real-world browser tasks, live websites — best AI agent scores only 33%
“33% on live websites is actually more impressive than it sounds given the adversarial diversity of the real web. The trajectory from 5% in 2024 to 33% in 2026 means we're likely crossing 60% in 18 months — at which point browser agents start displacing RPA software at scale.”
AI research agent that remembers every trade thesis you've built
“This is what Bloomberg Terminal looks like when rebuilt for the agentic era. The compound research model — where findings accumulate across sessions rather than resetting — maps perfectly to how real investment theses develop over weeks. The multi-provider LLM abstraction lets teams swap in whatever reasoning model performs best on financial tasks as the landscape evolves. Expect a wave of these vertical-specific research agents.”
MedChem copilot that blocks toxic molecular modifications before you make them
“AI in drug discovery has mostly been a hype layer on top of existing cheminformatics. ORAC-NT's approach — domain-specific guardrails, explainability, audit trails — is what responsible AI deployment actually looks like in high-stakes science. This design pattern will propagate to other regulated domains.”
Standardized framework for building world models with perception and memory
“This is the HuggingFace Transformers moment for world models. When the community converges on shared infrastructure, research velocity explodes. OpenWorldLib could be the foundation that makes world models practical at the application layer within two years, not ten.”
Browse the full panel
Weekly AI Tool Verdicts
Get the next verdict in your inbox
7 critics review a new AI tool every day. Weekly digest — free.