NVIDIA releases new AI model with 8 billion parameters: accurate, efficient, deployable on RTX workstations

NVIDIA releases new AI model with 8 billion parameters: high accuracy and efficiency, deployable on RTX workstations

NvidiaIn a blog post on August 21st, the Mistral-NeMo-Minitron 8B small-language AI model was released, with the advantages of high accuracy and computational efficiency.The model can be run on GPU-accelerated data centers, clouds, and workstations.

NVIDIA and Mistral AI released last monthOpen Source The Mistral NeMo 12B model, on which NVIDIA is building, is back with the smaller Mistral-NeMo-Minitron 8B model, with 8 billion parameters, which can be run on workstations with NVIDIA RTX graphics cards.

NVIDIA releases new AI model with 8 billion parameters: high accuracy and efficiency, deployable on RTX workstations

NVIDIA says that Mistral-NeMo-Minitron 8B was obtained by width-pruning Mistral NeMo 12B and mildly retraining it with knowledge distillation, as published in Compact Language Models via Pruning and Knowledge Distillation".

Pruning shrinks a neural network by removing the model weights that contribute the least to accuracy. In the "distillation" process, the team retrained the pruned model on a small dataset to significantly improve the accuracy that had been reduced through the pruning process.

For its size, the Mistral-NeMo-Minitron 8B leads the pack in nine popular benchmarks for language modeling. These benchmarks cover a wide range of tasks, including language comprehension, common-sense reasoning, mathematical reasoning, summarization, coding, and the ability to generate authentic answers.

statement:The content is collected from various media platforms such as public websites. If the included content infringes on your rights, please contact us by email and we will deal with it as soon as possible.

{{userData.name}}Verify

NVIDIA releases new AI model with 8 billion parameters: high accuracy and efficiency, deployable on RTX workstations

Former Character.AI CEO Noam Chazer returns home to take up a new position as co-technical director of Google Gemini

Global Consumer AI Mobile Apps TOP 50 List: Meitu Xiuxiu squeezed into the top ten, ByteDance Doubao made the list for the first time and ranked 26th

AI Weibo

AI Applications

5000+ AI applications! Updated daily

AIAICLUB

Highly recommended! Official brand Weibo

AI Tutorials

Tons of tutorials to read

AI Basic Training Camp

Zero-based entry, leading you to become an AI expert

1ai tiktok

1ai master

TikTok account: 1ai.net

1ai master

TikTok account: 1ai.net

1ai WeChat

Five minutes a day

Become a master in one year

Scan the QR code to follow

{{userData.name}}Verify

Related content:

Former Character.AI CEO Noam Chazer returns home to take up a new position as co-technical director of Google Gemini

Global Consumer AI Mobile Apps TOP 50 List: Meitu Xiuxiu squeezed into the top ten, ByteDance Doubao made the list for the first time and ranked 26th

DBRX is the world's most powerful open source AI model: 132 billion parameters, language understanding, programming capabilities, etc. are better than GPT-3.5

LG launches Korea's first open source AI model EXAONE 3.0, ranks first in Korean test

NVIDIA team launches AI model StormCast, high-precision weather forecast, accurately predicts thunderstorms within a few kilometers

Anthropic modifies its service policy: allowing third parties to use its own AI models such as Claude in "products for minors"

AI Applications

5000+ AI applications! Updated daily

AIAICLUB

Highly recommended! Official brand Weibo

AI Tutorials

Tons of tutorials to read

AI Basic Training Camp

Zero-based entry, leading you to become an AI expert

1ai master

TikTok account: 1ai.net

1ai master

TikTok account: 1ai.net

Five minutes a day

Become a master in one year

Scan the QR code to follow