HomeTechnologyGoogle’s Veo 3 escalates the AI video race with native audio generation

Google’s Veo 3 escalates the AI video race with native audio generation

Google is also launching a new AI filmmaking tool called Flow, and integrating the Veo 3 and Imagen 4 AI models into its Gemini assistant app

May 20, 2025 / 23:18 IST
Story continues below Advertisement
Google
Alongside Veo 3, Google has introduced a new version of its image generation model Imagen, called Imagen 4, which can create images in a range of aspect ratios and up to 2k resolution

Google is introducing audio generation to its text-to-video generation artificial intelligence (AI) model, Veo, that aims to compete with OpenAI's Sora, Meta's Movie Gen, and startups like Runway, and Stability AI.

On May 20, Google unveiled Veo 3, the latest version of its video generation model, at its annual developer conference, Google I/O 2025.

Story continues below Advertisement

The new model, which succeeds Veo 2, can generate sound effects and background noise like traffic noises in the background of a city street scene, birds singing in a park, or dialogue between characters from a text prompt.

"We're emerging from the silent era of video generation...This opens up a whole new world of possibilities," said Google DeepMind CEO Demis Hassabis.