360 Brain 7B parameter large model open source, support 500,000 words long text input

360 Brain 7B parameter large model open source, supports 500,000 words of long text input

360 The company recently released a newOpen Source360 Intelligent Brain 7B (7 billion parameter model).Large ModelIt uses a corpus of 3.4 trillion tokens for training, mainly in Chinese, English, and code.Open 4K, 32K, 360K three different text lengths. 360 said that 360K (about 500,000 words) is the longest text length among the current domestic open source models.

360 Brain 7B parameter large model open source, supports 500,000 words of long text input

360 said that they verified the model performance on the mainstream evaluation data sets of OpenCompass, including C-Eval, AGIEval, MMLU, CMMLU, HellaSwag, MATH, GSM8K, HumanEval, MBPP, BBH, LAMBADA, and the capabilities examined included natural language understanding, knowledge, mathematical calculation and reasoning, code generation, logical reasoning, etc. Among them, the 360 model ranked first on four evaluation data sets and ranked third on average.

360 Brain 7B parameter large model open source, supports 500,000 words of long text input

In the LongBench test (a multi-task, bilingual Chinese-English benchmark for evaluating the long text comprehension capabilities of large language models), 360 selected Chinese single-document question and answer, multi-document question and answer, summary, and few-shot tasks that are most closely related to Chinese long text applications for evaluation. The 360Zhinao-7B-Chat-32K model achieved the highest average score.

360 Brain 7B parameter large model open source, supports 500,000 words of long text input

In the English NeedleInAHaystack test (a method of inserting key information into different positions of a long text and then asking questions about the key information to test the long text ability of a large model), 360Zhinao-7B-Chat-360K achieved an accuracy rate of more than 98%. 360 constructed a Chinese NeedleInAHaystack test based on the SuperCLUE-200K evaluation benchmark and also achieved an accuracy rate of more than 98%.

In addition to the model weights, the model's fine-tuning training code, inference code and a full set of tools are also open source, allowing developers of large models to use it "out of the box".

Zhou Hongyi once said that the length of the text of the large model industry paper will soon be 1 million words. "We plan to open source this capability, so there is no need for everyone to reinvent the wheel. The 360K is mainly for the sake of reputation." He also called himself a "believer in open source" and believed in the power of open source.

statement:The content is collected from various media platforms such as public websites. If the included content infringes on your rights, please contact us by email and we will deal with it as soon as possible.

{{userData.name}}Verify

360 Brain 7B parameter large model open source, supports 500,000 words of long text input

Forbes releases list of top 50 artificial intelligence companies, with OpenAI, Anthropic and others dominating the list

Huawei Hubble invested in a domestic AI large-scale model company for the first time: Mianbi Intelligence completed hundreds of millions of yuan in financing, and Zhihu CTO Li Dahai became CEO

AI Weibo

AI Applications

5000+ AI applications! Updated daily

AIAICLUB

Highly recommended! Official brand Weibo

AI Tutorials

Tons of tutorials to read

AI Basic Training Camp

Zero-based entry, leading you to become an AI expert

1ai tiktok

1ai master

TikTok account: 1ai.net

1ai master

TikTok account: 1ai.net

1ai WeChat

Five minutes a day

Become a master in one year

Scan the QR code to follow

{{userData.name}}Verify

Related content:

Forbes releases list of top 50 artificial intelligence companies, with OpenAI, Anthropic and others dominating the list

Huawei Hubble invested in a domestic AI large-scale model company for the first time: Mianbi Intelligence completed hundreds of millions of yuan in financing, and Zhihu CTO Li Dahai became CEO

Open Source Big Model Alliance! 57 organizations including Oracle, Intel, Meta, etc. participate

Kunlun Wanwei announced the release and open source of "Tiangong Model 3.0" on April 17: 400 billion parameters, claimed to have better performance than Grok 1.0

Yuanxiang releases XVERSE-MoE-A4.2B large model for free commercial use

"We need to bring big models down from the altar", Zhou Hongyi announced that 360 security big models will be free

AI Applications

5000+ AI applications! Updated daily

AIAICLUB

Highly recommended! Official brand Weibo

AI Tutorials

Tons of tutorials to read

AI Basic Training Camp

Zero-based entry, leading you to become an AI expert

1ai master

TikTok account: 1ai.net

1ai master

TikTok account: 1ai.net

Five minutes a day

Become a master in one year

Scan the QR code to follow