Meta AI为移动设备开发紧凑型语言模型MobileLLM，仅3.5亿参数

Meta AI researchers have introduced MobileLLM, a language designed for high-performance computing on smartphones and other resource-constrained devicesModelThe study, published on June 27, 2024, challenges the prevailing view that effective AI ModelsAssumptions of necessary size.

The research team, which includes members of Meta Reality Labs, PyTorch, and Meta AI Research (FAIR), focused on optimizing models with fewer than a billion parameters, a fraction of the parameters of models like GPT-4, which are estimated to have more than a trillion.

The main innovations of MobileLLM include:

Prioritize model depth over width
Implementing embedded sharing and grouping query notes
Utilizes a novel direct block weight sharing technique

These design choices allow MobileLLM to outperform previous models of similar size by 2.7% to 4.3% on common benchmark tasks. While these single-digit improvements may seem small, they represent significant progress in the highly competitive field of language model development.

Notably, on certain API call tasks, the 350 million parameter version of MobileLLM demonstrated comparable accuracy to the larger 7 billion parameter LLaMA-2 model, suggesting that for certain specific applications, more compact models may provide similar functionality while using less computational resources.

Meta AI develops a compact language model MobileLLM for mobile devices with only 350 million parameters

The development of MobileLLM coincides with growing interest in more efficient AI models. As progress on very large language models shows signs of slowing, researchers are increasingly exploring the potential of more compact, specialized designs. Despite the “LLM” in the name, the focus on efficiency and on-device deployment puts MobileLLM in the same category as what some researchers call small language models (SLMs).

While MobileLLM is not yet available to the public, Meta has open-sourced the pre-trained code, allowing other researchers to build on its work. As this technology develops, it could bring more advanced AI capabilities to personal devices, although the timeline and specific features remain uncertain.

statement:The content of the source of public various media platforms, if the inclusion of the content violates your rights and interests, please contact the mailbox, this site will be the first time to deal with.

Meta AI develops a compact language model MobileLLM for mobile devices with only 350 million parameters

Kuaishou open-sources image generation model Kolors to support text generation in the picture

Alibaba Cloud Tongyi Qianwen open-sources two voice base models, with better recognition performance than OpenAI Whisper

AI Weibo

AI Applications

5000+ AI applications! Updated daily

1AICLUB

Highly recommended! Official brand Weibo

AI Tutorials

Tons of tutorials to read

AI Basic Training Camp

Zero-based entry, leading you to become an AI expert

1ai tiktok

1ai master

TikTok account: 1ai.net

1ai master

TikTok account: 1ai.net

1ai WeChat

Five minutes a day

Become a master in one year

Scan the QR code to follow

Related content:

Kuaishou open-sources image generation model Kolors to support text generation in the picture

Alibaba Cloud Tongyi Qianwen open-sources two voice base models, with better recognition performance than OpenAI Whisper

Meta Chief Scientist Yann LeCun believes that AI superintelligence will not arrive soon and is skeptical about quantum computing

Meta updates AI model Code Llama70B with higher accuracy

OpenAI and Meta are about to release AI models with human-level reasoning capabilities, report says

The draft shows that the United States is ready to go all-out to implement national rules for rapidly developing AI technology

AI Applications

5000+ AI applications! Updated daily

1AICLUB

Highly recommended! Official brand Weibo

AI Tutorials

Tons of tutorials to read

AI Basic Training Camp

Zero-based entry, leading you to become an AI expert

1ai master

TikTok account: 1ai.net

1ai master

TikTok account: 1ai.net

Five minutes a day

Become a master in one year

Scan the QR code to follow