Moneycontrol PRO
Loans
Loans
HomeTechnologyGoogle DeepMind unveils Gemini Robotics, its new AI models to power humanoid and other real-world robots

Google DeepMind unveils Gemini Robotics, its new AI models to power humanoid and other real-world robots

Google DeepMind introduces Gemini Robotics, an AI-powered model designed for humanoid robots, enabling advanced vision, language, and action capabilities to perform real-world tasks with enhanced dexterity, interactivity, and spatial reasoning.

March 12, 2025 / 22:06 IST
Gemini robotics

Google has taken a step forward in the world of AI development with its all-new Gemini Robotics and Gemini Robotics-ER AI models. Google DeepMind has introduced Gemini, the  AI models based on Gemini 2.0 that are designed to implement AI into real-world robotics These models, according to the blog post, bring vision-language-action (VLA) capabilities and embodied reasoning (ER) to robots, enabling them to perform complex physical tasks with greater adaptability, interactivity, and dexterity. Google DeepMind is partnering with Apptronik to integrate these advancements into humanoid robots.

Gemini Robotics: Vision-language-action model

Gemini Robotics extends the capabilities of Gemini 2.0 by integrating physical actions as an output modality. The company says that It allows robots to interact dynamically with their environment and adapt to changes in real-time.

The model generalises across new environments, objects, and instructions, more than doubling performance on generalization benchmarks compared to previous VLA models. It also understands and responds to natural language commands in multiple languages, adapting its actions based on changing conditions.

Moreover, the model enables fine motor control, allowing robots to perform precise tasks such as origami folding or packing items into a bag.

Initially trained on the ALOHA 2 bi-arm robotic platform, Gemini Robotics is adaptable to various robot types, including Franka-based systems and the Apptronik Apollo humanoid robot.

Gemini Robotics-ER: Enhancing spatial reasoning

On the other hand, the Gemini Robotics-ER focuses on spatial reasoning, allowing roboticists to integrate it with low-level controllers for real-world applications. The model improves 2D and 3D object detection, state estimation, and spatial understanding for better robotic navigation and object interaction. It can autonomously generate control code, achieving 2x-3x higher success rates than previous models. The model leverages in-context learning, refining its responses based on human demonstrations.

Safety and responsible AI

Google DeepMind has not forgotten the responsible use of AI with its two new AI models. The company has integrated the safety measures into Gemini Robotics-ER by enabling it to assess whether an action is safe before execution. The company is also releasing the ASIMOV dataset to evaluate the semantic safety of robotic actions.

Gemini Robotics-ER is being tested by trusted partners, including Agile Robots, Agility Robots, Boston Dynamics, and Enchanted Tools. Google DeepMind aims to refine these models to advance AI-driven robotics applications.

Invite your friends and family to sign up for MC Tech 3, our daily newsletter that breaks down the biggest tech and startup stories of the day

MC Tech Desk Read the latest and trending tech news—stay updated on AI, gadgets, cybersecurity, software updates, smartphones, blockchain, space tech, and the future of innovation.
first published: Mar 12, 2025 10:04 pm

Discover the latest Business News, Sensex, and Nifty updates. Obtain Personal Finance insights, tax queries, and expert opinions on Moneycontrol or download the Moneycontrol App to stay updated!

Subscribe to Tech Newsletters

  • On Saturdays

    Find the best of Al News in one place, specially curated for you every weekend.

  • Daily-Weekdays

    Stay on top of the latest tech trends and biggest startup news.

Advisory Alert: It has come to our attention that certain individuals are representing themselves as affiliates of Moneycontrol and soliciting funds on the false promise of assured returns on their investments. We wish to reiterate that Moneycontrol does not solicit funds from investors and neither does it promise any assured returns. In case you are approached by anyone making such claims, please write to us at grievanceofficer@nw18.com or call on 02268882347