DeepSeek 等秒变操控电脑 AI智能体，微软开源工具 OmniParser V2.0 发布

February 17th.Microsoft OmniParser It is an AI tool for parsing and recognizing on-screen interactive icons by purely visual GUI-based intelligences, previously paired with GPT-4V to significantly enhance recognition capabilities.

DeepSeek and other AI intelligences that control computers in seconds, Microsoft's open-source tool OmniParser V2.0 is released.

On February 12, Microsoft released on its official website the OmniParser Latest Version V2.0In addition, OpenAI (4o / o1 / o3-mini) is available,DeepSeek(R1), Qwen (2.5VL) and Anthropic (Sonnet) models into AI intelligences that can manipulate computers.

Compared to version V1, OmniParser V2 has been trained using larger scale interactive element detection data and icon feature caption data, resulting in higher accuracy and faster inference in detecting smaller interactable UI elements, with a latency reduction of 60%.

In the high-resolution Agent benchmark test ScreenSpot Pro.V2+GPT-4o had an accuracy of 39.6%, while the GPT-4o raw accuracy was only 0.8%.

In order to be able to experiment faster with different intelligences setups, theMicrosoft has also open-sourced OmniTool, a Dockerized Windows system that integrates a set of basic tools needed for intelligences, covering functions such as screen understanding, localization, action planning and execution, and a key tool for turning large models into intelligent bodies.

statement:The content of the source of public various media platforms, if the inclusion of the content violates your rights and interests, please contact the mailbox, this site will be the first time to deal with.

DeepSeek and other AI intelligences that control computers in seconds, Microsoft's open-source tool OmniParser V2.0 is released.

Google to drop diversity program, says no longer banning AI weaponization "good for society"

ByteDance's Chinese AI IDE "Trae" now supports Windows, with built-in GPT-4o for free!

AI Weibo

AI Applications

5000+ AI applications! Updated daily

1AICLUB

Highly recommended! Official brand Weibo

AI Tutorials

Tons of tutorials to read

AI Basic Training Camp

Zero-based entry, leading you to become an AI expert

1ai tiktok

1ai master

TikTok account: 1ai.net

1ai master

TikTok account: 1ai.net

1ai WeChat

Five minutes a day

Become a master in one year

Scan the QR code to follow

Related content:

Google to drop diversity program, says no longer banning AI weaponization "good for society"

ByteDance's Chinese AI IDE "Trae" now supports Windows, with built-in GPT-4o for free!

Microsoft GitHub Copilot Enterprise Edition is now available for $39 per person per month

Microsoft sues an organization for illegally hacking its AI services and bypassing security to generate harmful content

Human jobs under threat of replacement: OpenAI revealed to be releasing 'doctoral-level' super AI intelligence this month

OpenAI Launches Operator, the First AI Intelligence Body That Controls Computers to Automate Tasks, Booking Tickets and Shopping Online on Their Behalf

Please enter the code

....Payment confirmation in progress....

AI Applications

5000+ AI applications! Updated daily

1AICLUB

Highly recommended! Official brand Weibo

AI Tutorials

Tons of tutorials to read

AI Basic Training Camp

Zero-based entry, leading you to become an AI expert

1ai master

TikTok account: 1ai.net

1ai master

TikTok account: 1ai.net

Five minutes a day

Become a master in one year

Scan the QR code to follow