OpenAI has released three new real-time voice models: GPT-Realtime-2, GPT-Realtime-Translate, and GPT-Realtime-Whisper. GPT-Realtime-2 integrates GPT-5 level reasoning into voice interactions, supporting a 128K context window and parallel tool calls. GPT-Realtime-Translate offers real-time, low-cost simultaneous interpretation across numerous languages, drastically undercutting traditional human interpreter costs. GPT-Realtime-Whisper provides low-latency, streaming speech-to-text transcription. AI
Summary written by gemini-2.5-flash-lite from 1 source. How we write summaries →
IMPACT These models significantly lower the cost and increase accessibility of real-time voice translation and AI-powered voice agents, potentially disrupting the simultaneous interpretation industry and enabling more natural human-computer interaction.
RANK_REASON OpenAI announced three new voice models with advanced capabilities, including GPT-5 level reasoning and real-time translation. [lever_c_demoted from frontier_release: ic=1 ai=1.0]