Google DeepMind Introduces New AI Models That Let Robots Perform Real-World Tasks Without Training

On the evening of March 12, Beijing time.Google DeepMind Launch of two new models AI Modelsthat is designed to help robots accomplish moreMultiple real-world missions.

Google DeepMind Introduces New AI Models That Let Robots Perform Real-World Tasks Without Training

One of them, called Gemini Robotics, is avisual language action modelThe ability to make robotsUnderstanding new situations without specialized training.

Gemini Robotics is based on the latest version of Google's flagship AI model, Gemini 2.0, which builds on Gemini's multimodal world understanding capabilities by adding new modalities for physical action, according to Carolina Parada, senior director of robotics at Google DeepMind. Gemini Robotics builds on Gemini's multimodal world-understanding capabilities and applies them to the real world by adding new modalities for physical action.

The model makes progress in the three core areas that Google DeepMind believes are necessary to build efficient robots: versatility, interaction, and flexibility. In addition to being able to cope with new contexts, Gemini Robotics performs better at interacting with humans and the environment, and is able to perform more precise physical operations, such asFolding paper or opening bottle caps.

The other is the Gemini Robotics-ER (Embodied Reasoning) model, which the company describes as an advanced visual language model capable of "Understanding a complex and dynamic world”.

Parada further explains that when you're filling a bento box.Where to place items on the table and how to do itThe Gemini Robotics-ER is designed for this type of reasoning task, and robotics experts can use the model to interface with existing low-level control systems, opening up new capabilities driven by the Gemini Robotics-ER.

Vikas Sindhwani, a researcher at Google DeepMind, said the company is developing a "layered security policy" and said the Gemini Robotics-ER model has been trained to assess whether an action is safe or not in a given situation. The company has also released new benchmarks and frameworks to advance security research in AI. According to 1AI, last year, Google DeepMind Introduced the "Robot Constitution", which is a code of conduct for robots inspired by Isaac Asimov.

According to The Verge, Google DeepMind has partnered with Apptronik to "build the next generation of humanoid robots". In addition, Google has opened up the Gemini Robotics-ER model to "trusted testers" including Agile Robots, Agility Robotics, Boston Dynamics and Enchanted Tools, Parada said: "We're focused on building intelligence that understands and acts in the physical world, and we're looking forward to applying this technology to multiple domains and manifestations."

statement:The content of the source of public various media platforms, if the inclusion of the content violates your rights and interests, please contact the mailbox, this site will be the first time to deal with.
Information

With fines of up to €35 million, Spain 'cracks down' on firms that fail to label AI-generated content

2025-3-13 10:35:14

Information

Lucent Technologies Launches Open-Sora 2.0, an Open Source Video Generation Model with Performance Close to OpenAI Sora

2025-3-13 17:31:14

Search