Psi R0, the First End-to-End Reinforcement Learning-Based Embodiment Model for Dual Dexterous Hands to Collaborate on Complex Operations

December 30th.Spiritual IntelligencereleaseThe first end-to-end reinforcement learning (RL) basedbody model Psi R0.

1AI has learned that the model supportsDual dexterity for complex operationsThe Psi R0 can be used to generate an intelligent body with reasoning ability to accomplish and close the loop of long-range dexterous operation tasks by mixing and training multiple skills in tandem. Moreover, Psi R0 can also generalize across item and scene levels.

Taking an e-commerce scenario as an example, the packing of goods is a typical long-distance task, requiring tens of thousands of items to be grabbed, scanned, placed, and tied in plastic bags, etc. The Psi R0 is able to complete this series of actions smoothly with a pair of dexterous hands (officially known asThis series of movements can replace a complete workstation at the customer's site.), becoming the first embodied robot trained to perform long-range dexterous manipulation tasks based on reinforcement learning.

Officially, the RL-based Psi R0 model uses massive simulation data to train a two-handed operating intelligence, and connects multiple skills in tandem through a bidirectional training framework, which is the first in the industry to complete long-range tasks in open environments, with strong generalization capabilities and high robustness.

This skill training framework abstracts key information from object spatio-temporal trajectories to construct a generalized objective function, thus solving the problem of difficult reward function design. In the post-training phase, the success rate of long-range tasks is further improved by aligning a small amount of high-quality real-machine data.

In addition, the transfer feasibility function in the bi-directional training framework plays an important role in fine-tuning the skills to improve the success rate and generalization of the tandem, and at the same time gives the model the ability to switch skills autonomously, so that it can quickly adjust its strategy when it encounters operational failures to ensure a high success rate.

statement:The content of the source of public various media platforms, if the inclusion of the content violates your rights and interests, please contact the mailbox, this site will be the first time to deal with.

Psi R0, the First Reinforcement Learning-Based End-to-End Embodiment Model for Dual Dexterous Hands to Perform Complex Operations

Xunlei to set up AI global headquarters in Hangzhou

Big Models DeepSeek: No one authorized to participate in institutional investor exchanges, online rumors of exchanges are not true

AI Weibo

AI Applications

5000+ AI applications! Updated daily

1AICLUB

Highly recommended! Official brand Weibo

AI Tutorials

Tons of tutorials to read

AI Basic Training Camp

Zero-based entry, leading you to become an AI expert

1ai tiktok

1ai master

TikTok account: 1ai.net

1ai master

TikTok account: 1ai.net

1ai WeChat

Five minutes a day

Become a master in one year

Scan the QR code to follow

Related content:

Xunlei to set up AI global headquarters in Hangzhou

Big Models DeepSeek: No one authorized to participate in institutional investor exchanges, online rumors of exchanges are not true

Krea AI will launch a video generation function with a simpler and more beautiful interface

Musk: Tesla's humanoid robot Optimus may be sold by the end of next year

'Her' Creator Alexis Conneau Announces Departure from OpenAI, Soul of OpenAI GPT-4o Resigns to Start His Own Business

OpenAI Releases AI Model with Reasoning Capabilities, OpenAI o1 Model Debuts

AI Applications

5000+ AI applications! Updated daily

1AICLUB

Highly recommended! Official brand Weibo

AI Tutorials

Tons of tutorials to read

AI Basic Training Camp

Zero-based entry, leading you to become an AI expert

1ai master

TikTok account: 1ai.net

1ai master

TikTok account: 1ai.net

Five minutes a day

Become a master in one year

Scan the QR code to follow