Kai-Fu Lee's AI company Zero One Everything announced the open source Yi-9B model, claiming to have the strongest mathematical capabilities in the same series of codes

Zero One Everything 01AI" official WeChat account announced tonightOpen Source Yi-9B Model, officially called the Yi seriesModelThe "Science Champion" in the series - Yi-9B is the model with the strongest code and mathematical capabilities in the current Yi series models.RealityThe actual parameter is 8.8B, and the default context length is 4K tokens..

This model is based on Yi-6B (trained with 3.1T tokens) and uses 0.8T tokens for further training. The data is as of June 2023.

According to reports, in terms of comprehensive ability (Mean-All),Yi-9B's performance is "better than other similar sizedOpen SourceThe best model, surpassing DeepSeek-Coder, DeepSeek-Math, Mistral-7B, SOLAR-10.7B and Gemma-7B.

In terms of coding capability (Mean-Code), Yi-9B’s performance is second only to DeepSeek-Coder-7B, and surpasses Yi-34B, SOLAR-10.7B, Mistral-7B, and Gemma-7B.

In terms of mathematical capabilities (Mean-Math), Yi-9B's performance is second only to DeepSeek-Math-7B, and surpasses SOLAR-10.7B, Mistral-7B, and Gemma-7B.

In terms of common sense and reasoning ability (Mean-Text), Yi-9B performs on par with Mistral-7B, SOLAR-10.7B, and Gemma-7B.

The official said,Yi-9B (BF 16) and its quantized version Yi-9B (Int8) can be easily deployed on consumer-grade graphics cards, low cost of use and developer-friendly.

Kai-Fu Lee's AI company Zero One Everything announced the open source Yi-9B model, claiming to have the strongest mathematical capabilities in the same series of codes

The companyKai-Fu LeeThe chairman and CEO of Sinovation Ventures led the team to found the company, which has already launched two models, Yi-34B and Yi-6B.Open Source Big Model, claiming to be completely open to academic research and simultaneously open to free commercial applications.

statement:The content of the source of public various media platforms, if the inclusion of the content violates your rights and interests, please contact the mailbox, this site will be the first time to deal with.
Information

NVIDIA's official blog launches the "Decoding AI" column: RTX AI has high computing power, low latency, and local deployment is safer

2024-3-7 9:31:57

Information

Some are happy, some are sad: In January this year, the number of new AI-related jobs in the United States increased by 42% compared to when ChatGPT was released, and the number of IT jobs decreased by 31% compared to the same period

2024-3-7 9:34:57

Search