Microsoft open source multimodal AI Agent "Magma": shopping can automatically order, but also predict the behavior of video characters

Feb. 26, 2012 - Early this morning, Beijing time.MicrosoftIn the official websiteOpen SourceBeMultimodality AI Agent Base model --MagmaMagma has a lot more to offer than a traditional Agent. Compared to traditional Agents, Magma hasMultimodal capabilities across digital, physical worldsIn addition to automatically processing different types of data such as images, video, and text, Magma has built-in psychological prediction capabilities that enhance the ability to understand the spatial and temporal dynamics of future video frames and accurately predict the intentions and future behavior of people or objects in the video.

Users can use Magma toAutomatically place e-commerce orders and check the weather; it can alsoAutomatically operated physical robots, or get help in playing real chess.

According to the official description, Magma is able to help AI-driven assistants or robots understand their surroundings and act accordingly. For example, it can help domestic robotsLearn how to organize items you've never seen before, or help virtual assistantsGenerate step-by-step user interface navigation instructions for unfamiliar tasks.

Magma is one of the foundational models of VLA (IT House Note: Visual Linguistic Action) capable of adapting to new tasks in digital and physical environments, effectively learning from massive amounts of publicly available visual and linguistic data to fuse linguistic, spatial, and temporal intelligences to cope with complex tasks and environments in the digital and physical world.

With open source link: https://microsoft.github.io/Magma/

statement:The content of the source of public various media platforms, if the inclusion of the content violates your rights and interests, please contact the mailbox, this site will be the first time to deal with.
Information

Google Launches Gemini Code Assist Personal Edition Programming Tool, Available for Free

2025-2-26 11:33:16

Information

OpenAI Deep Research feature available to ChatGPT Plus subscribers with 10 queries per month

2025-2-26 11:36:09

Search