Posted inAI

OpenAI adds three realtime audio models to the API

OpenAI has added three new audio models to its API, pushing its Realtime API stack beyond fast speech recognition and into something closer to an actual voice worker: GPT-Realtime-2 for reasoning-heavy conversations, GPT-Realtime-Translate for live multilingual speech, and GPT-Realtime-Whisper for streaming transcription. The pitch is simple enough: let apps listen, understand, translate, and respond while […]