Back
OpenAILaunchOpenAI2026-05-13

OpenAI Launches GPT-5.5 Instant, Realtime Translate, and Streaming Whisper Today

OpenAI is rolling out three products simultaneously today: GPT-5.5 Instant (the new default in ChatGPT and API), GPT-Realtime-Translate (live speech translation across 70+ languages), and GPT-Realtime-Whisper (streaming speech-to-text with live transcription). The multi-product drop represents OpenAI's most aggressive infrastructure upgrade since GPT-5 launched.

Original source

OpenAI is shipping three products simultaneously today, marking one of its most dense release days in months. GPT-5.5 Instant replaces GPT-5.3 Instant as the default model across all ChatGPT tiers and in the API as `chat-latest`, bringing faster inference, improved instruction-following, and better multilingual performance.

The two Realtime additions are the more significant news. GPT-Realtime-Translate handles live speech-to-speech translation across 70+ input languages into 13 output languages with reported sub-200ms latency — real-time enough for live conversations. GPT-Realtime-Whisper complements it with streaming speech-to-text that transcribes as the speaker talks rather than processing audio in chunks, enabling applications like live captioning, voice assistants, and call-center analytics.

The Realtime API, introduced with GPT-4o last year, is quietly becoming OpenAI's most commercially interesting infrastructure layer. Voice-native applications — from customer service bots to accessibility tools — are growing faster than text-based ones, and these updates give developers dramatically more capability to build on.

For enterprises, the combination of Translate and Whisper in the same Realtime API makes building multilingual voice products significantly simpler. Previously this required chaining multiple services from different providers. The competitive pressure from ElevenLabs, Deepgram, and AssemblyAI clearly accelerated today's release.

Panel Takes

The Builder

The Builder

Developer Perspective

GPT-Realtime-Translate collapsing speech translation into the same API I'm already using for completion is a genuine simplification. I've been paying three different vendors for what OpenAI just unified. Migration happening this week.

The Skeptic

The Skeptic

Reality Check

Three simultaneous launches is great PR but the model updates are incremental — 5.5 Instant is a minor revision. The Realtime features are real but specialized. For most developers today's news changes nothing about their primary workflows.

The Futurist

The Futurist

Big Picture

Streaming speech-to-text that works in real time is an accessibility and internationalization breakthrough waiting to happen. Realtime Translate at 200ms latency means natural multilingual conversation without pauses. This is the invisible infrastructure layer that makes AI disappear into daily life.

Bookmarks

Loading bookmarks...

No bookmarks yet

Bookmark tools to save them for later