Mistral releases Voxtral TTS model that’s fast, multilingual and small enough to be practical for voice agents.
Relatively light at just 2 billion parameters, the model is meant for use with consumer-grade GPUs for those who want to self ...
OpenAI has today introduced a suite of advanced audio models and tools through its API, designed to empower developers in creating sophisticated, voice-driven applications. These updates include ...
Cohere has released Transcribe, a 2-billion-parameter open-source speech recognition model that tops the Hugging Face Open ...
A monthly overview of things you need to know as an architect or aspiring architect. Unlock the full InfoQ experience by logging in! Stay updated with your favorite authors and topics, engage with ...
COLOGNE, Germany, Feb. 2, 2026 /PRNewswire/ -- DeepL, a global AI product and research company, today announced the general availability of DeepL Voice API. This innovative product empowers developers ...
What if you could transform hours of audio into precise, actionable text with just a few lines of code? In 2025, this is no longer a futuristic dream but a reality powered by innovative speech-to-text ...
eSpeaks’ Corey Noles talks with Rob Israch, President of Tipalti, about what it means to lead with Global-First Finance and how companies can build scalable, compliant operations in an increasingly ...
Voice Agent API Is Industry's Only Offering That Delivers The Single, Real-Time API Experience Developers Love, Combined with Full Controllability Enterprises Need. No Need to Stitch Together STT, TTS ...
Some results have been hidden because they may be inaccessible to you
Show inaccessible results