OpenAI released a new generation of voice models in its API on Wednesday, giving developers tools to build apps that can reason through spoken requests, translate across +70 languages, and transcribe ...
What’s new: OpenAI released three voice AI models with real-time reasoning, translation, and transcription capabilities, aiming to make conversations more interactive and task-oriented. Who’s testing: ...
The new features could be handy for customer service systems, but OpenAI says they have applications that work across a ...
GPT-Realtime-2 brings GPT-5-class reasoning to live voice. A separate translation model covers 70+ input languages. A streaming Whisper variant handles transcription. The pricing is aggressive enough ...
OpenAI’s latest voice AI model, GPT Realtime 2, introduces advanced capabilities for natural and context-aware interactions. Built on the GPT-5-level reasoning framework, it handles complex tasks such ...
OpenAI recently outlined how it adapted WebRTC for low-latency voice AI at global scale. The new architecture replaced a ...
AI thrives on data but feeding it the right data is harder than it seems. As enterprises scale their AI initiatives, they face the challenge of managing diverse data pipelines, ensuring proximity to ...
May 7 (Reuters) - OpenAI introduced three audio models for its developer platform on Thursday, aiming to make voice-based software agents more ‌conversational and capable of completing tasks in real ...