Facade Intelligence Releases MiniCPM-o 2.6 Full Modal Model, Called "End-Side GPT-4o"

January 16th.Wall-facing intelligencePublic announced today the launch of the "MiniCPM-o 2.6" end-sideholomodal modelWith a parameter of 8B, it is claimed that the performance is comparable to GPT-4o and Claude-3.5-Sonnet.

It utilizes an end-to-end multimodal architecture that can simultaneously process multiple types of data such as text, images, audio and video to generate high-quality text and speech output. Officially, it has a total parameter count of 8B, visual, speech and multimodal streaming capabilitiesAchieved GPT-4o-202405 rating, one of the richest models in the open source community in terms of modal support and performance.

MiniCPM-o 2.6 supportBilingual voice dialog with configurable voicesThe program also features advanced capabilities such as emotion/speed/style control, end-to-end voice cloning, role-playing, and more.

According to the official introduction, MiniCPM-o 2.6 is alsoThe first support in the iPad Multimodal real-time streaming interactions on end-side devices such as theThe multimodal macromodel of GPT-4o-20240 is a large model of multimodality. With an average score of 70.2 on the OpenCompass list (combining 8 mainstream multimodal benchmarks), it outperforms mainstream commercial closed-source multimodal macromodels such as GPT-4o-202405, Gemini 1.5 Pro, and Claude 3.5 Sonnet in terms of single-graph comprehension with a size in the order of 8B.

1AI Attached open source address:

GitHub: https://github.com/OpenBMB/MiniCPM-o
huggingface: https://huggingface.co/openbmb/MiniCPM-o-2_6

statement:The content of the source of public various media platforms, if the inclusion of the content violates your rights and interests, please contact the mailbox, this site will be the first time to deal with.

Facade Intelligence Releases MiniCPM-o 2.6 Full Modal Model, Called "End-Side GPT-4o"

Altman: General Artificial Intelligence Spearheaded by OpenAI, Humans Will No Longer Be the Smartest on Earth

Tmall Elf hardware team merges with Quark product team to explore AI glasses and other hardware, sources say

AI Weibo

AI Applications

5000+ AI applications! Updated daily

1AICLUB

Highly recommended! Official brand Weibo

AI Tutorials

Tons of tutorials to read

AI Basic Training Camp

Zero-based entry, leading you to become an AI expert

1ai tiktok

1ai master

TikTok account: 1ai.net

1ai master

TikTok account: 1ai.net

1ai WeChat

Five minutes a day

Become a master in one year

Scan the QR code to follow

Related content:

Altman: General Artificial Intelligence Spearheaded by OpenAI, Humans Will No Longer Be the Smartest on Earth

Tmall Elf hardware team merges with Quark product team to explore AI glasses and other hardware, sources say

AI company Mianbi Intelligence completes a new round of financing worth hundreds of millions of yuan

Huawei Hubble invested in a domestic AI large-scale model company for the first time: Mianbi Intelligence completed hundreds of millions of yuan in financing, and Zhihu CTO Li Dahai became CEO

The open source MiniCPM 2.0 series of models from Mianbi Intelligent has significantly enhanced its OCR and other capabilities

Mianbi Intelligent released the MiniCPM 3.0 client-side model: it can run with 2GB of memory and its performance exceeds GPT-3.5

Please enter the code

... .Payment confirmation in progress....

AI Applications

5000+ AI applications! Updated daily

1AICLUB

Highly recommended! Official brand Weibo

AI Tutorials

Tons of tutorials to read

AI Basic Training Camp

Zero-based entry, leading you to become an AI expert

1ai master

TikTok account: 1ai.net

1ai master

TikTok account: 1ai.net

Five minutes a day

Become a master in one year

Scan the QR code to follow