Moneycontrol PRO
HomeTechnology‘AI that helps you in the real world’: Google’s DeepMind boss on merging Gemini and Veo AI models

‘AI that helps you in the real world’: Google’s DeepMind boss on merging Gemini and Veo AI models

Google DeepMind CEO Demis Hassabis explained how the company plans to combine its two powerful AI tools: Gemini, which is built to understand and generate text, images, and audio, and Veo, which creates videos.

April 11, 2025 / 16:01 IST
Google

Google is dreaming big when it comes to artificial intelligence. And now, it’s taking a major step toward building AI that actually understands the physical world — by teaching it through YouTube videos.

In a recent episode of the Possible podcast, Google DeepMind CEO Demis Hassabis explained how the company plans to combine its two powerful AI tools: Gemini, which is built to understand and generate text, images, and audio, and Veo, which creates videos. Together, they could form a super-smart assistant that doesn’t just answer questions, but actually understands how the world works.

“We’ve always built Gemini to be multimodal from the beginning,” said Hassabis. “And the reason we did that is because we have a vision for this idea of a universal digital assistant — one that actually helps you in the real world.”

In simpler terms, Google wants to create an AI that doesn’t just chat or draw pictures — but watches, listens, and learns from everything, including videos.

And yes, a big chunk of that video training is likely coming from YouTube. Since Google owns the platform, it has access to an enormous library of content showing real-life activities — cooking, building, sports, science experiments, you name it. “Basically, by watching YouTube videos — a lot of YouTube videos — [Veo 2] can figure out, you know, the physics of the world,” Hassabis said.

The goal? Smarter AI that can “see” and interpret the world more like humans do. Think of an assistant that doesn’t just give you a recipe but understands how ingredients behave when cooked, because it has seen thousands of videos showing that in action.

Google isn’t alone in this race. OpenAI and Amazon are also working on similar “omni” models — AI systems that can handle anything from text to images to sounds, all at once.

Of course, all of this requires massive amounts of data. Google has said some of its models may be trained on YouTube content, depending on creator agreements. The company even updated its terms of service last year to allow broader use of content for training AI.

Invite your friends and family to sign up for MC Tech 3, our daily newsletter that breaks down the biggest tech and startup stories of the day

MC Tech Desk Read the latest and trending tech news—stay updated on AI, gadgets, cybersecurity, software updates, smartphones, blockchain, space tech, and the future of innovation.
first published: Apr 11, 2025 04:00 pm

Discover the latest Business News, Sensex, and Nifty updates. Obtain Personal Finance insights, tax queries, and expert opinions on Moneycontrol or download the Moneycontrol App to stay updated!

Subscribe to Tech Newsletters

  • On Saturdays

    Find the best of Al News in one place, specially curated for you every weekend.

  • Daily-Weekdays

    Stay on top of the latest tech trends and biggest startup news.

Advisory Alert: It has come to our attention that certain individuals are representing themselves as affiliates of Moneycontrol and soliciting funds on the false promise of assured returns on their investments. We wish to reiterate that Moneycontrol does not solicit funds from investors and neither does it promise any assured returns. In case you are approached by anyone making such claims, please write to us at grievanceofficer@nw18.com or call on 02268882347