OpenAI has added three new audio models to its API, pushing its Realtime API stack beyond fast speech recognition and into something closer to an actual voice worker: GPT-Realtime-2 for reasoning-heavy conversations, GPT-Realtime-Translate for live multilingual speech, and GPT-Realtime-Whisper for streaming transcription. The pitch is simple enough: let apps listen, understand, translate, and respond while […]

