Pixtral 12B released: Mistral open source the first multimodal AI macromodel

Pixtral 12B Released: Mistral Open Sources First Multimodal AI Big Model

TechCrunch, a technology media outlet, reported yesterday (September 11) thatFrance AI Startups Mistral Release of Pixtral 12B.is the company's first multimodal AI big speech model capable of processing images and text simultaneously.

The Pixtral 12B model has 12 billion parameters and is about 24 GB in size; the parameters roughly correspond to the model's solving power, and models with more parameters usually perform better than models with fewer parameters.

The Pixtral 12B model is built on the text model Nemo 12B and is capable of answering questions about any number of images of any size.

Similar to other multimodal models such as Anthropic's Claude series and OpenAI's GPT-4o, the Pixtral 12B should theoretically be able to perform tasks such as adding descriptions to images and counting the number of objects in a photo.

Users can download and fine-tune the Pixtral 12B model and use it under an Apache 2.0 license.

Pixtral 12B will soon be available for open beta testing on Mistral's chatbot and API service platforms Le Chat and Le Plateforme, said Sophia Yang, Mistral's head of developer relations, in a post on the X platform.

Pixtral 12B Released: Mistral Open Sources First Multimodal AI Big Model

In terms of technical specifications, the Pixtral12B is equally impressive: a 40-layer network structure, 14,336 hidden dimensions, 32 attention heads, and a 400M dedicated visual coder that supports the processing of images with 1024x1024 resolution.

Pixtral 12B Released: Mistral Open Sources First Multimodal AI Big Model

On platforms such as MMMU, Mathvista, ChartQA, and DocVQA, it outperforms a number of well-known multimodal models including Phi-3 and Qwen-27B, which fully proves its strong strength.

Huggingface address.

https://huggingface.co/mistral-community/pixtral-12b-240910

statement:The content is collected from various media platforms such as public websites. If the included content infringes on your rights, please contact us by email and we will deal with it as soon as possible.

{{userData.name}}Verify

Pixtral 12B Released: Mistral Open Sources First Multimodal AI Big Model

AI smartphone market exploding soon, Canalys predicts 54% of global phone shipments by 2028

'Her' Creator Alexis Conneau Announces Departure from OpenAI, Soul of OpenAI GPT-4o Resigns to Start His Own Business

AI Weibo

AI Applications

5000+ AI applications! Updated daily

AIAICLUB

Highly recommended! Official brand Weibo

AI Tutorials

Tons of tutorials to read

AI Basic Training Camp

Zero-based entry, leading you to become an AI expert

1ai tiktok

1ai master

TikTok account: 1ai.net

1ai master

TikTok account: 1ai.net

1ai WeChat

Five minutes a day

Become a master in one year

Scan the QR code to follow

{{userData.name}}Verify

Related content:

AI smartphone market exploding soon, Canalys predicts 54% of global phone shipments by 2028

'Her' Creator Alexis Conneau Announces Departure from OpenAI, Soul of OpenAI GPT-4o Resigns to Start His Own Business

French AI startup Mistral AI is close to reaching new financing agreement with a valuation of $6 billion

Mistral releases its first code generation AI model Codestral

Mistral's new model Codestral Mamba is faster and can process text twice as long as GPT-4o

French AI startup Poolside is valued at $2 billion and plans to raise $400 million

AI Applications

5000+ AI applications! Updated daily

AIAICLUB

Highly recommended! Official brand Weibo

AI Tutorials

Tons of tutorials to read

AI Basic Training Camp

Zero-based entry, leading you to become an AI expert

1ai master

TikTok account: 1ai.net

1ai master

TikTok account: 1ai.net

Five minutes a day

Become a master in one year

Scan the QR code to follow