"The first model in China that can match the voice capabilities of GPT-4o", Lingo voice AI model opens for internal testing

West Lake invested by Jinke TomcatXinchenIn August this year, Xinchen was launched Lingo Voice big model, the first end-to-end speech model in China, was launched today (August 24)Closed betareserve.

In the announcement released on August 21, the official introduction stated that compared with traditional TTS, the end-to-end speech big model is a more comprehensive technology. It not only can recognize speech, but also integrates natural language processing, intent recognition, dialogue management, and speech synthesis. It realizes the complete interactive process from speech input to speech feedback, greatly enriching the depth and breadth of human-computer interaction.

Xinchen Lingo voice model is the first model in China that has the same voice capabilities as GPT-4o, the technical capabilities have the following three notable characteristics:

Native speech understanding:As an end-to-end model, Lingo can not only recognize text information in speech, but also accurately capture other important features such as emotion, tone, pitch, and even ambient sound, helping the model to understand the speech content more comprehensively, thereby providing a more natural and vivid interactive experience.
Multiple voice styles:Lingo can adaptively adjust the speed, pitch, and noise intensity of speech according to the context and user instructions, and can generate voice responses in a variety of styles such as conversation, singing, and crosstalk, effectively improving the flexibility and adaptability of the model in different application scenarios.
Voice Mode Super Compression:Lingo uses a voice codec with a compression rate of hundreds of times, which can compress voice to an extremely short length, significantly reducing computing and storage costs while helping the model generate high-quality voice content.

statement:The content of the source of public various media platforms, if the inclusion of the content violates your rights and interests, please contact the mailbox, this site will be the first time to deal with.

"The first model in China with voice capabilities comparable to GPT-4o", Lingo voice AI model opens for internal testing

Adam, a humanoid robot, is working for the first time at Walmart: it can provide 200 cups of tea and coffee a day

Meta releases Sapiens visual model to enable AI to analyze and understand human actions in images/videos

AI Weibo

AI Applications

5000+ AI applications! Updated daily

1AICLUB

Highly recommended! Official brand Weibo

AI Tutorials

Tons of tutorials to read

AI Basic Training Camp

Zero-based entry, leading you to become an AI expert

1ai tiktok

1ai master

TikTok account: 1ai.net

1ai master

TikTok account: 1ai.net

1ai WeChat

Five minutes a day

Become a master in one year

Scan the QR code to follow

Related content:

Adam, a humanoid robot, is working for the first time at Walmart: it can provide 200 cups of tea and coffee a day

Meta releases Sapiens visual model to enable AI to analyze and understand human actions in images/videos

ByteDance Dreamina video generation officially opens for internal testing on a first-come, first-served basis

Chrome browser will have a built-in AI model Gemini Nano, the new version will start internal testing

SenseTime launches Vimi video generation large model C-end application Vimi camera open for internal testing

Launched on August 30! iFlytek Spark Voice Model Update "Extremely Fast Super Anthropomorphic Interaction"

AI Applications

5000+ AI applications! Updated daily

1AICLUB

Highly recommended! Official brand Weibo

AI Tutorials

Tons of tutorials to read

AI Basic Training Camp

Zero-based entry, leading you to become an AI expert

1ai master

TikTok account: 1ai.net

1ai master

TikTok account: 1ai.net

Five minutes a day

Become a master in one year

Scan the QR code to follow