Amazon launches Nova Sonic AI for natural voice interactions

The new AI handles real conversations better, with a 4.2% word error rate across languages and a 1.09s response time, outperforming GPT-4o in noisy environments.

Amazon has unveiled Nova Sonic, a new AI model designed to process and generate human-like speech, positioning it as a rival to OpenAI and Google’s top voice assistants. The company claims it outperforms competitors in speed, accuracy, and cost, and it is reportedly 80% cheaper than GPT-4o.

Already powering Alexa+, Nova Sonic excels in real-time conversation, handling interruptions and noisy environments better than legacy AI assistants.

Unlike older voice models, Nova Sonic can dynamically route requests, fetching live data or triggering external actions when needed. Amazon says it achieves a 4.2% word error rate across multiple languages and responds in just 1.09 seconds, faster than OpenAI’s GPT-4o.

Developers can access it via Bedrock, Amazon’s AI platform, using a new streaming API.

The launch signals Amazon’s push into artificial general intelligence (AGI), AI that mimics human capabilities.

Rohit Prasad, head of Amazon’s AGI division, hinted at future models handling images, video, and sensory data. This follows last week’s preview of Nova Act, an AI for browser tasks, suggesting Amazon is accelerating its AI rollout beyond Alexa.

For more information on these topics, visit diplomacy.edu.