DeepSeek open-sources DeepSeek-V2-Chat-0628 model code and improves mathematical reasoning capabilities

Recently, the Chatbot Arena organized by LMSYS released the latest list update. The LMSYS Chatbot Arena ranked 11th on the list, surpassing allOpen Source Model, including Llama3-70B, Qwen2-72B, Nemotron-4-340B, Gemma2-27B, etc., won the globalOpen SourceHonor for the top model.

DeepSeek- Compared with the 0507 open source Chat version, V2-0628 has comprehensively improved its capabilities in code mathematical reasoning, command following, role playing, JSON Output, etc.

Chatbot Arena is a globally recognized authoritative large-model blind testing platform that uses manual blind testing to ensure the fairness of the evaluation. In this evaluation, DeepSeek-V2-0628 demonstrated world-class long-question solving capabilities in Hard Prompt, Code, Longer Query, and Math, and was at the same level as top models such as GPT-4-Turbo-0409 and Claude3Opus.

DeepSeek-V2-0628 not only performs well on the international stage, but also ranks among the best in the domestic model evaluation, ranking second among all domestic models, demonstrating its strong competitiveness. In addition, DeepSeek-V2-0628 was launched on June 28, 2024, providing API and web services at a very competitive price.

statement:The content of the source of public various media platforms, if the inclusion of the content violates your rights and interests, please contact the mailbox, this site will be the first time to deal with.
Information

The U.S. Department of Commerce is spending $400 million to boost chip production

2024-7-20 8:22:56

Information

DeepL launches a new generation of translation AI, with translation performance surpassing GPT-4

2024-7-20 8:25:21

Search