AI tool comparison
Stet vs Velo
Which one should you ship with? Here is the side-by-side panel verdict, pricing read, reviewer split, and community vote comparison.
Productivity
Stet
Local macOS dictation that sounds like you — not like generic AI prose
75%
Panel ship
—
Community
Free
Entry
Stet is an open-source macOS dictation app that transcribes speech locally and then uses AI to clean up the output while actively preserving your personal writing style and tone. The core innovation is a voice model — a lightweight profile that learns from your past writing so the AI corrections don't flatten your voice into generic AI-ese. The result is meant to sound like you dictated it, not like it was passed through a generic LLM. The technical approach combines local Whisper-based transcription (nothing leaves your device during speech-to-text) with an optional AI refinement pass that can use your own API key (BYOK) or a $6.99/month subscription. The open-source release includes the voice profiling code, making it auditable and forkable. It's a direct response to Wispr Flow, which is closed-source and subscription-only. For writers, podcasters, and productivity users who dictate significant amounts of content, the voice preservation angle is genuinely differentiated. The proliferation of AI writing tools has created a recognizable 'AI voice' — flat, over-structured, and devoid of personality — that sophisticated readers are increasingly adept at detecting. Stet's bet is that preserving your actual voice is the most valuable thing an AI writing assistant can do.
Productivity
Velo
Turn any doc, slide, or screen into an AI-narrated video message
75%
Panel ship
—
Community
Free
Entry
Velo lets you record or upload anything — slides, PDFs, docs, screen recordings, websites — and instantly converts it into a polished video message narrated by a hyper-realistic AI avatar with lip sync, eye blinks, and natural gestures. The whole workflow runs in-browser with no downloads required. The key insight is async communication fatigue: teams are drowning in wall-of-text Slack messages and poorly-produced Loom videos, but nobody has time to polish a proper recording. Velo fills the gap by letting you share a PDF, pick a voice, and ship a professional-looking walkthrough in under two minutes. It launched on Product Hunt today and hit #1 with 464 upvotes — unusually strong traction for a non-developer tool. The avatar quality is notably better than earlier AI presenter tools. Early users are reporting it as a replacement for Loom in cases where they want a "polished" look without showing their face or spending time on editing.
Reviewer scorecard
“Open-source, local-first transcription with BYOK is the right architecture. I've been burned by voice tools that upload my audio to servers I can't audit. The voice profile approach for preserving style is technically interesting — I want to see how it handles domain-specific jargon and code-switching between formal and casual registers.”
“The in-browser workflow is genuinely frictionless — paste a link, pick a voice, done. This is the kind of async communication tool I'd actually use instead of recording another mediocre Loom.”
“The 'sounds like you' promise needs a lot of data to actually deliver — your voice profile is only as good as the writing samples it's trained on, and most people don't have a consistent, large corpus of their own writing. For casual dictators, this might just be Whisper with extra steps. Apple's built-in dictation is free and surprisingly good now.”
“AI avatars in 2026 still read as 'uncanny valley corporate' and that's going to cap adoption in informal team settings. Also no pricing transparency at launch is a red flag — freemium often means 'free for 30 seconds of video.'”
“Voice-first computing is coming back, and the arms race for authentic AI writing assistance is heating up. The distinguishing factor won't be transcription accuracy — everyone has solved that — it will be voice fidelity. Stet is building in the right direction: local processing plus personal style models. Expect this architecture to be standard in two years.”
“Async video is eating synchronous meetings and Velo's approach — no face, no setup, just content — could accelerate that significantly for distributed teams. This is what the next generation of internal communication looks like.”
“This is genuinely exciting for writers and content creators. The homogenization of AI-assisted writing is a real aesthetic problem — everything starts sounding like the same LinkedIn post. A tool that actively fights that tendency by learning your specific voice is solving the right problem. Even if the voice model needs work, the direction is exactly right.”
“As a content creator I've been waiting for a tool that makes me look polished without a studio setup. The avatar quality here actually clears my bar — I'd use this for client-facing walkthroughs without hesitation.”
Weekly AI Tool Verdicts
Get the next comparison in your inbox
New AI tools ship daily. We compare them before you waste an afternoon.