苹果AI发布7亿参数开源语言模型DCLM 准确率提高、计算资源占用降低

recent,appleThe company's artificial intelligence team has collaborated with the University of Washington and other institutions to launch a product called DCLMofOpen SourceLanguage model. This model has 700 million parameters and uses up to 2.5 trillion data tokens during training, helping us better understand and generate language.

So, what is a language model? In simple terms, it is a program that can analyze and generate language, which can help us complete various tasks, such as translation, text generation, and sentiment analysis. In order for these models to perform better, we need high-quality datasets. However, obtaining and organizing this data is not an easy task, because we need to filter out irrelevant or harmful content and remove duplicate information.

To meet this challenge, Apple's research team launched DataComp for Language Models (DCLM), a dataset optimization tool for language models. They recently open-sourced the DCIM model and dataset on the Hugging Face platform. The open-source versions include DCLM-7B, DCLM-1B, dclm-7b-it, DCLM-7B-8k, dclm-baseline-1.0, and dclm-baseline-1.0-parquet, which allows researchers to conduct a large number of experiments through this platform to find the most effective data organization strategy.

Apple AI releases 700 million parameter open source language model DCLM with improved accuracy and reduced computing resource usage

https://huggingface.co/collections/mlfoundations/dclm-669938432ef5162d0d0bc14b

The core advantage of DCLM lies in its structured workflow. Researchers can choose models of different sizes as needed, ranging from 412 million to 700 million parameters, and can also experiment with different data organization methods, such as deduplication and filtering. Through these systematic experiments, researchers can clearly evaluate the quality of different datasets. This not only lays the foundation for future research, but also helps us understand how to improve the performance of the model by improving the dataset.

For example, using the benchmark dataset built by DCLM, the research team trained a 700 million parameter language model and achieved a 5-shot accuracy of 64% in the MMLU benchmark! This is 6.6 percentage points higher than the previous state-of-the-art, and uses 40% less computing resources. The performance of the DCLM baseline model is also comparable to Mistral-7B-v0.3 and Llama38B, which require much more computing resources.

The launch of DCLM provides a new benchmark for language model research, helping scientists to systematically improve model performance while reducing the required computing resources.

statement:The content of the source of public various media platforms, if the inclusion of the content violates your rights and interests, please contact the mailbox, this site will be the first time to deal with.

Apple AI releases 700 million parameter open source language model DCLM with improved accuracy and reduced computing resource usage

Japanese media warns: AI search engines may have irreversible impact on culture

WPS Office is accused of leaking user documents for AI training, officials deny

AI Weibo

AI Applications

5000+ AI applications! Updated daily

1AICLUB

Highly recommended! Official brand Weibo

AI Tutorials

Tons of tutorials to read

AI Basic Training Camp

Zero-based entry, leading you to become an AI expert

1ai tiktok

1ai master

TikTok account: 1ai.net

1ai master

TikTok account: 1ai.net

1ai WeChat

Five minutes a day

Become a master in one year

Scan the QR code to follow

Related content:

Japanese media warns: AI search engines may have irreversible impact on culture

WPS Office is accused of leaking user documents for AI training, officials deny

Airbnb acquires GamePlanner.AI, an artificial intelligence startup founded by the creator of Apple's voice assistant Siri

Huazhong University of Science and Technology open-sources multimodal large model Monkey

More details of Apple's iOS 18 AI features revealed: Siri is smarter and can solve math problems

Apple WWDC24 will be held on June 11th, multiple AI technologies of iOS 18 will be released

AI Applications

5000+ AI applications! Updated daily

1AICLUB

Highly recommended! Official brand Weibo

AI Tutorials

Tons of tutorials to read

AI Basic Training Camp

Zero-based entry, leading you to become an AI expert

1ai master

TikTok account: 1ai.net

1ai master

TikTok account: 1ai.net

Five minutes a day

Become a master in one year

Scan the QR code to follow