Going out and asking to open the big model "sequence monkey" open source dataset

Mobvoi Opens Up the Large Model "Sequence Monkey" Open Source Dataset

Mobvoiannounced that it will open its hyperscale language model to the public"serial monkey"The partial training dataset, named "Sequence MonkeyOpen SourceDataset 1.0".

Sequence Monkey, as one of the core technologies of Going Out, has a powerful generalized representation and inference capability, and has demonstrated its excellent performance in many fields such as Q&A system, natural language processing, machine translation, text summarization, etc., which greatly improves the productivity and data processing capability.

Mobvoi Opens Up the Large Model "Sequence Monkey" Open Source Dataset

In order to promote the continuous progress of large language modeling technology, GoDoQ decided to open source some of its training datasets. The open source "Sequence Monkey Open Source Dataset 1.0" includes Chinese general text corpus, ancient poetry and modern translation corpus, and text generation corpus, which have been carefully selected and organized to ensure their high quality and easy-to-use data format. At the same time, the company has adopted a generous license agreement, which provides easy access for developers and researchers.

Through this action, Going Out hopes to attract more talents and teams to participate in the research and application of big language modeling, and jointly promote the continuous progress of this cutting-edge technology. The company firmly believes that the release of the open source dataset will promote academic exchanges and cooperation and accelerate the pace of innovation in related fields.

Project address:https://github.com/mobvoi/seq-monkey-data

statement:The content is collected from various media platforms such as public websites. If the included content infringes on your rights, please contact us by email and we will deal with it as soon as possible.

{{userData.name}}Verify

Mobvoi Opens Up the Large Model "Sequence Monkey" Open Source Dataset

Hong Kong large-scale model company Weitu AI completes angel round financing, with a valuation of US$100 million

The latest feature of Stability AI API can edit and replace specified areas through text

AI Weibo

AI Applications

5000+ AI applications! Updated daily

AIAICLUB

Highly recommended! Official brand Weibo

AI Tutorials

Tons of tutorials to read

AI Basic Training Camp

Zero-based entry, leading you to become an AI expert

1ai tiktok

1ai master

TikTok account: 1ai.net

1ai master

TikTok account: 1ai.net

1ai WeChat

Five minutes a day

Become a master in one year

Scan the QR code to follow

{{userData.name}}Verify

Related content:

Hong Kong large-scale model company Weitu AI completes angel round financing, with a valuation of US$100 million

The latest feature of Stability AI API can edit and replace specified areas through text

The largest in China! Alibaba CEO Wu Yongming: 72 billion parameter large model will be open source soon

Yuanxiang releases XVERSE-MoE-A4.2B large model for free commercial use

Tencent's Hunyuan Wenshengtu model is open source: equipped with the first Chinese-English bilingual DiT architecture, free for commercial use

Kunlun Wanwei announces the open source of Skywork-MoE, a 200 billion sparse model with strong performance and lower cost

AI Applications

5000+ AI applications! Updated daily

AIAICLUB

Highly recommended! Official brand Weibo

AI Tutorials

Tons of tutorials to read

AI Basic Training Camp

Zero-based entry, leading you to become an AI expert

1ai master

TikTok account: 1ai.net

1ai master

TikTok account: 1ai.net

Five minutes a day

Become a master in one year

Scan the QR code to follow