OpenAI upgrades its transcription and voice-generating AI models

OpenAI is bringing new transcription and voice-generating AI models to its API that the company claims improve upon its previous releases.

For OpenAI, the models fit into its broader “agentic” vision: building automated systems that can independently accomplish tasks on behalf of users. The definition of “agent” might be in dispute, but OpenAI Head of Product Olivier Godement described one interpretation as a chatbot that can speak with a business’s customers.

“We’re going to see more and more agents pop up in the coming months” Godement told TechCrunch during a briefing. “And so the general theme is helping customers and developers leverage agents that are useful, available, and accurate.”

OpenAI claims that its new text-to-speech model, “gpt-4o-mini-tts,” not only delivers more nuanced and realistic-sounding speech but is also more “steerable” than its previous-gen speech-synthesizing models. Developers can instruct gpt-4o-mini-tts on how to say things in natural language — for example, “speak like a mad scientist” or “use a serene voice, like a mindfulness teacher.”

Here’s a “true crime-style,” weathered voice:

OpenAI transcription results — The results from OpenAI transcription benchmarking.Image Credits:OpenAI

#OpenAI #upgrades #transcription #voicegenerating #models

OpenAI upgrades its transcription and voice-generating AI models

Elon Musk holds unprecedented Pentagon talks, wants leakers prosecuted

Ceres Power Holdings plc (CPWHF) Q4 2024 Earnings Call Transcript

Valve removes video game demo suspected of being malware

MDPL: No Reason To Pay Those Hefty Fees For Now (BATS:MDPL)

Trump administration seeks to disqualify judge in law firm case

Entain: Missouri And Alberta Delayed

enCore Energy (EU) Faces Securities Class Action As Investors Scrutinize Financial Reporting and Leadership Shakeup – Hagens Berman

Elon Musk holds unprecedented Pentagon talks, wants leakers prosecuted

Ceres Power Holdings plc (CPWHF) Q4 2024 Earnings Call Transcript

Elon Musk holds unprecedented Pentagon talks, wants leakers prosecuted

Ceres Power Holdings plc (CPWHF) Q4 2024 Earnings Call Transcript

Elon Musk holds unprecedented Pentagon talks, wants leakers prosecuted

Ceres Power Holdings plc (CPWHF) Q4 2024 Earnings Call Transcript