All Reviews (17)
Gemini 3.1 Flash TTS
Google's TTS API with conversational voice direction and 70+ languages
VoxCPM2
Tokenizer-free TTS: voice design, cloning, and 30 languages from 2B params
VoxCPM2
Tokenizer-free TTS: clone any voice or design one from text, 30 languages, Apache 2.0
Qwen3-TTS
Alibaba's voice cloning TTS handles 600+ languages in one model
Voxtral 4B TTS
Mistral's open-weights production TTS — 9 languages, 70ms latency, 20 voices
OmniVoice
Zero-shot TTS across 600+ languages — open source and 40x faster than real-time
VibeVoice
Microsoft's open-source frontier voice AI — 90 min TTS, 4 speakers
Udio
AI music creation with studio-quality output
ElevenLabs
AI voice cloning and text-to-speech that sounds human
Suno
AI music generation — full songs from a text prompt
Deepgram
AI speech-to-text and text-to-speech API for developers
Krisp
AI noise cancellation and meeting assistant
Synthesia
AI video generation platform for enterprise training
Whisper
OpenAI's open-source speech recognition
Murf.ai
AI voice generator for professional voiceovers
AssemblyAI
AI-powered speech intelligence
Speechmatics
Enterprise speech recognition API