significant · [1 source] · 2026-05-08 04:35 · 中文(ZH) GPT-5级推理能力塞进语音模型，OpenAI把同传翻译成本砍穿地板价

significant

OpenAI launches GPT-5 level voice models for real-time translation and agents

By PulseAugur Editorial · Summary by gemini-2.5-flash-lite from 1 source

OpenAI has released three new real-time voice models: GPT-Realtime-2, GPT-Realtime-Translate, and GPT-Realtime-Whisper. GPT-Realtime-2 integrates GPT-5 level reasoning into voice interactions, supporting a 128K context window and parallel tool calls. GPT-Realtime-Translate offers real-time, low-cost simultaneous interpretation across numerous languages, drastically undercutting traditional human interpreter costs. GPT-Realtime-Whisper provides low-latency, streaming speech-to-text transcription. AI

Summary written by gemini-2.5-flash-lite from 1 source. How we write summaries →

IMPACT These models significantly lower the cost and increase accessibility of real-time voice translation and AI-powered voice agents, potentially disrupting the simultaneous interpretation industry and enabling more natural human-computer interaction.

RANK_REASON OpenAI announced three new voice models with advanced capabilities, including GPT-5 level reasoning and real-time translation. [lever_c_demoted from frontier_release: ic=1 ai=1.0]

Read on 量子位 (QbitAI) →

COVERAGE [1]

量子位 (QbitAI) TIER_1 中文(ZH) · 听雨 · 2026-05-08 04:35

GPT-5 level reasoning ability packed into a voice model, OpenAI slashes simultaneous interpretation costs to the floor.

OpenAI上新三款实时语音模型

COVERAGE [1]

GPT-5 level reasoning ability packed into a voice model, OpenAI slashes simultaneous interpretation costs to the floor.

RELATED ENTITIES

RELATED TOPICS