Google introduces text-to-video AI model Veo to take on OpenAI's Sora

Unveiled at Google I/O 2024, Veo can generate high-quality 1080p resolution videos that can go beyond a minute.

Vikas SN

May 14, 2024 / 23:39 IST

A screenshot of a video created by Google Veo (Image credit: Google)

Google on May 14 introduced Veo, its most advanced video generation model that can generate high-definition video in a range of cinematic and visual styles, at its annual developer conference Google I/O 2024.

This announcement comes amidst intensifying competition in the artificial intelligence (AI) video generation models space from rivals such as OpenAI's Sora, Facebook parent Meta's Emu Video, and startups such as Runway, and Stability AI.

Sora, in particular, has wowed people with its realistic visuals since its debut in February.

Veo generates high-quality 1080p resolution videos that can go beyond a minute. The company claims the model has an advanced understanding of natural language and visual semantics, and can generate video that closely represents the user's creative vision — accurately rendering details in longer prompts and capturing tone.

It also understands cinematic terms like "timelapse" or "aerial shots of a landscape" and can create consistent and coherent footage where people, animals and objects move realistically throughout shots. One can further edit these generated videos using additional prompts.

Veo will be available to select creators as a private preview inside Google's AI video generator, VideoFX, which is part of the company's Labs initiative, over the coming weeks. People can sign up to join the waitlist. Google said it will also bring some of Veo's capabilities to YouTube Shorts and other products in the future.

New Imagen 3 model

Google also introduced a new version of its image generation model Imagen, which the company says is its highest quality text-to-image model to date.

Called Imagen 3, the model can produce photorealistic, lifelike images with incredible level of detail and far fewer distracting visual imperfections than prior models, the company executives said in the blogpost.

Imagen 3 better understands natural language, the intent behind the user's prompt, and also incorporates small details from longer prompts, they said.

"It’s also our best model yet for rendering text, which has been a challenge for image generation models. This capability opens up possibilities for generating personalized birthday messages, title slides in presentations and more" said Google executives Collins and Eck.

Imagen 3 will be available to select creators as a private preview inside Google's AI image generator ImageFX through its Labs initiative. People can sign up to try the model by joining the waitlist. Imagen 3 will be soon available to developers and enterprises through Vertex AI, the managed AI app developer platform of Google's Cloud unit.

Apart from these developments, Google announced that Grammy winning musician Wyclef Jean, electronic musician Marc Rebillet and Grammy nominated songwriter Justin Tranter have released demo song recordings created with the help of the company's music AI tools on their YouTube channels.

The tech giant is also expanding its watermarking tool SynthID, that embeds digital watermarks AI-generated images and audio, to more formats including text and video. All videos generated by Veo on VideoFX will also be watermarked by SynthID, the company said.

Event alert: Moneycontrol and CNBC TV18 are hosting the ultimate event on artificial intelligence, bringing together entrepreneurs, ecosystem enablers, policymakers, industry leaders, and innovators on May 17 in Gurugram. Click here to register and gain access to the AI Alliance Delhi-NCR Chapter.

Vikas SN covers Big Tech, streaming, social media and gaming industry

first published: May 14, 2024 11:39 pm

Discover the latest Business News, Sensex, and Nifty updates. Obtain Personal Finance insights, tax queries, and expert opinions on Moneycontrol or download the Moneycontrol App to stay updated!

Subscribe to Tech Newsletters

Al Edge Newsletter On Saturdays

Find the best of Al News in one place, specially curated for you every weekend.
MC Tech 3 Newsletter Daily-Weekdays

Stay on top of the latest tech trends and biggest startup news.

Google introduces text-to-video AI model Veo to take on OpenAI's Sora

Unveiled at Google I/O 2024, Veo can generate high-quality 1080p resolution videos that can go beyond a minute.

Related Stories

Subscribe to Tech Newsletters

Trending news