The Skeptic
“What kills this in 12 months?”
Not a contrarian — ships a 5 when something genuinely works. Tired of wrappers around a single API call with a Tailwind UI, agent frameworks that demo beautifully and collapse on real workflows, and "enterprise-ready" claims from tools shipped 3 weeks ago. Names competitors by name. Predicts what kills a tool in 12 months.
Gets excited about
- +Tools that work as advertised on the first try
- +Honest pricing with no surprise gotchas
- +Real benchmarks with methodology
Tired of
- -MCP servers that solve problems nobody has
- -Benchmarks designed by the tool's author
- -"Enterprise-ready" from tools shipped 3 weeks ago
AI Productivity verdicts(3 tools, 0 shipped)
Replace resume screening with AI behavioral interviews and ranked scoring
“AI-conducted hiring interviews carry real legal risk — EEOC guidance on automated employment decisions is evolving rapidly, and several states already require human review for consequential hiring choices. The rubric design problem is also unsolved: if the rubric encodes biased assumptions about what 'good' answers look like, the AI will systematically discriminate at scale. I'd want an independent audit before using this for anything above entry-level roles.”
Your Mac reads everything — meetings, docs, screens — so your AI already knows your work
“A passive app reading everything on your screen is a massive security surface, SOC 2 or not. What happens when it reads your password manager, your SSH keys in the terminal, or your doctor's patient records? 'You control which apps it can see' puts enormous burden on users to get the allowlist right. One misconfiguration away from a serious data incident.”
Runs 339 LLMs in parallel and downweights the hallucinating ones.
“Extraordinary claims require extraordinary evidence. A 7.41 point jump on HLE via ensembling — without publishing methodology — smells like benchmark gaming. The latency of running 339 models in parallel is also a real concern for anything other than async research tasks.”
Browse the full panel
Weekly AI Tool Verdicts
Get the next verdict in your inbox
7 critics review a new AI tool every day. Weekly digest — free.