阿里云通义千问系列 AI 开源模型升至 Qwen2：5 个尺寸、上下文长度最高支持 128K tokens

Thousand Questions on Tongyi(Qwen) announced today that after months of hard work, the Qwen series models have been significantly upgraded from Qwen1.5 to Qwen2.And it has been synchronized on Hugging Face and ModelScopeOpen Source.

Attached is Qwen 2.0. The main contents are as follows:

Pre-trained and fine-tuned models in 5 sizes, including Qwen2-0.5B, Qwen2-1.5B, Qwen2-7B, Qwen2-57B-A14B, and Qwen2-72B
In addition to Chinese and English, high-quality data related to 27 languages has been added to the training data;
Leading performance on multiple benchmarks;
Significant improvement in coding and math skills;
Increased the context length support, up to 128K tokens (Qwen2-72B-Instruct).

Basic information of the model

The Qwen2 series includes pre-trained and instruction fine-tuned models of 5 sizes, including Qwen2-0.5B, Qwen2-1.5B, Qwen2-7B, Qwen2-57B-A14B, and Qwen2-72B.

Model	Qwen2-0.5B	Qwen2-1.5B	Qwen2-7B	Qwen2-57B-A14B	Qwen2-72B
Parameter quantity	0.49B	1.54B	7.07B	57.41B	72.71B
Non-Embedding Parameters	0.35B	1.31B	5.98B	56.32B	70.21B
GQA	True	True	True	True	True
Tie Embedding	True	True	False	False	False
Context length	32K	32K	128K	64K	128K

In the Qwen1.5 series, only 32B and 110B models used GQA. This time, all models of all sizes use GQA so that everyone can experience the advantages of GQA's inference acceleration and reduced video memory usage.

Model Evaluation

Compared with Qwen1.5, Qwen2 has achieved a significant improvement in performance on large-scale models. We conducted a comprehensive evaluation of Qwen2-72B.

In the evaluation of pre-trained language models, compared with the current best open source models, Qwen2-72B significantly surpasses the current leading models such as Llama-3-70B and Qwen1.5's largest model Qwen1.5-110B in many capabilities including natural language understanding, knowledge, code, mathematics and multilingualism.

statement:The content of the source of public various media platforms, if the inclusion of the content violates your rights and interests, please contact the mailbox, this site will be the first time to deal with.

Alibaba Cloud Tongyi Qianwen series AI open source model upgraded to Qwen2: 5 sizes, context length supports up to 128K tokens

Basic information of the model

Model Evaluation

Challenging Nvidia and AMD, Kneron, a startup backed by Qualcomm and Foxconn, launches new NPU for notebooks

Microsoft launches Aurora, the first AI-based weather forecasting system that can also predict air pollution levels

AI Weibo

AI Applications

5000+ AI applications! Updated daily

1AICLUB

Highly recommended! Official brand Weibo

AI Tutorials

Tons of tutorials to read

AI Basic Training Camp

Zero-based entry, leading you to become an AI expert

1ai tiktok

1ai master

TikTok account: 1ai.net

1ai master

TikTok account: 1ai.net

1ai WeChat

Five minutes a day

Become a master in one year

Scan the QR code to follow

Basic information of the model

Model Evaluation

Related content:

Challenging Nvidia and AMD, Kneron, a startup backed by Qualcomm and Foxconn, launches new NPU for notebooks

Microsoft launches Aurora, the first AI-based weather forecasting system that can also predict air pollution levels

Alibaba Cloud Tongyi Qianwen open-sources two voice base models, with better recognition performance than OpenAI Whisper

Alibaba Cloud x MediaTek, Dimensity 9300 and other mobile phone chips are adapted to the end-side Tongyi Qianwen large model

Alibaba Cloud: Tongyi Qianwen API daily call volume exceeds 100 million, and corporate users exceed 90,000

Tongyi Qianwen announced that the price of the main model Qwen-Long of "GPT-4 level" has been reduced by 97%, 2 million tokens per yuan

AI Applications

5000+ AI applications! Updated daily

1AICLUB

Highly recommended! Official brand Weibo

AI Tutorials

Tons of tutorials to read

AI Basic Training Camp

Zero-based entry, leading you to become an AI expert

1ai master

TikTok account: 1ai.net

1ai master

TikTok account: 1ai.net

Five minutes a day

Become a master in one year

Scan the QR code to follow