Mistral Releases Pixtral Large Multimodal AI Model: Tops Complex Math Reasoning, Diagram/Document Reasoning Over GPT-4o

November 19th.Mistral AI Corp. made an announcement yesterday (Nov. 18) about a newMultimodality AI Models Pixtral Large.The model has 124 billion parameters, is based on Mistral Large 2, and is mainly used for processing text and images.

Pixtral Large is now available under the Mistral Research License and Commercial License for research, education, and commercial use.

Pixtral Large is the second model in the Mistral AI multimodal family.IT House cites an official press release that the model performed well in standard multimodal benchmarks such as MathVista, DocVQA and VQAv2, especially in MathVista where it achieved an accuracy of 69.4%, outperforming all competitors.

Pixtral Large also outperformed GPT-4o and Gemini-1.5 Pro in both the ChartQA and DocVQA tests.

Equipped with a 123B multimodal decoder and a 1B visual encoder, the model supports a 128K context window and is capable of processing at least 30 high-resolution images.Pixtral Large excels not only with visual data, but also with complex reasoning and graphical understanding.

statement:The content of the source of public various media platforms, if the inclusion of the content violates your rights and interests, please contact the mailbox, this site will be the first time to deal with.
Information

Deep in the clouds "Bobcat" all-terrain off-road robot launched: can climb 22cm steps, 45° slopes, 98,000 yuan

2024-11-19 21:44:22

Information

OpenAI has long dreamed of building a chip: it considered acquiring Cerebras, a wafer-level chip company.

2024-11-20 0:36:24

Search