Amazon brings AI voice and video generation models to rival ChatGPT, Gemini

Amazon's Nova Reel 1.1 is exclusively available in US, with a wider rollout on the horizon
An undated image. — Amazon
An undated image. — Amazon

In the face of intensifying competition in the realm of artificial intelligence (AI), a recent entrant, Amazon, has dramatically stepped up its game by introducing voice AI models, Nova Sonic and an updated Nova Reel.

The disclosure of Amazon's latest AI models appears to be the company's latest bid to rival ChatGPT's Advanced Voice Mode and Gemini Live, two renowned AI voice chatbots from the leading tech giants.

Amazon's Nova Sonic AI model

Amazon's Nova Sonic comes as a revolutionary force for real-time speech processing and AI voice generation. 

What sets Nova Sonic apart from typical AI models running separate systems for speech recognition, text conversion, and audio generation is that it leverages a unique unified model architecture

Integrating a unified model ensures improved flow and quality of the voice responses generated. With this meticulous system in its foundations, Amazon's latest AI model becomes unmatched at recognising tone and intention, providing more natural and contextually relevant answers.

Amazon's Nova Reel AI model

Amazon also showcased version 1.1 of its Nova Reel, an updated iteration of its video generation AI model. The latest version of Amazon's Nova Reel increases the quality and latency of its preceding model, now letting users compose videos of up to two minutes in length.

The centre of attraction here is that it's impeccable at being consistent with styles across multiple six-second scenes.

It's worth noting that Nova Reel 1.1 is exclusively available in the US, with a wider rollout on the horizon.