No Questions Asked Core Dome Open Sources World's First End-Side Omnimodal Understanding Model Megrez-3B-Omni, Supports Image, Audio, and Text Understanding

December 16th.No Question Core DomeAnnounced today,Open SourceMegrez-3B-Omni, a full-modal understanding miniaturization model in the no-questions-asked core dome end-side solution, and its language-only model version, Megrez-3B-Instruct.

No Questions Asked Core Dome Open Sources World's First End-Side Omnimodal Understanding Model Megrez-3B-Omni, Supports Image, Audio, and Text Understanding

Officially, Megrez-3B-Omni is a full-modal understanding model made for the end, with the ability to process image, audio, and text modal data simultaneously:

  • existgraphic understandingOn the other hand, Megrez-3B-Omni is currently one of the most accurate image understanding models on several mainstream test sets such as OpenCompass, MME, MMMU, and OCRBench.
  • existtext comprehensionOn the other hand, Megrez-3B-Omni achieves the optimal accuracy of the end-to-end model on several authoritative test sets such as C-EVAL, MMLU / MMLU Pro, AlignBench, and so on.
  • existspeech understandingIn terms of this, Megrez-3B-Omni supports voice input in both Chinese and English, and is also capable of handling complex multi-round dialog scenarios, as well as supporting voice questioning of input images or text, enabling free switching between different modes.

Officials claim that the unimodal version of Megrez-3B-Instruct achieves a significant improvement in inference speed compared to its predecessor and other end-side macrolanguage models.Maximum inference speed can be ahead of the same precision model 300%.

The relevant links are as follows:

statement:The content is collected from various media platforms such as public websites. If the included content infringes on your rights, please contact us by email and we will deal with it as soon as possible.
HeadlinesInformation

Wuhan University's Artificial Intelligence Institute established, Xiaomi Group says it looks forward to cooperation

2024-12-16 16:42:19

Information

UK Tests AI Cameras to Catch Drunk Drivers: Can Judge by Driving Behavior and Road Use

2024-12-16 16:45:00

Search