Wisdom Spectrum Clear Speech Goes Live with Emotional Speech Model GLM-4-Voice: Understanding Emotions, Emotional Expression and Empathy

Wisdom Spectrum Clear Speech Launches Emotional Speech Model GLM-4-Voice: Understanding Emotions, Emotional Expression and Empathy

Smart Spectrum Announces the Launch of GLM-4-Voice End-to-End EmotionVoice Model. Officially, itsAbility to understand emotions, emotional expression, emotional empathyThe program, which is self-adjusting, supports multiple languages and dialects, and features lower latency and the ability to interrupt at any time, is available to users now at "Zhipu Qingyan"Experience on the App.

The GLM-4-Voice is described as having the following features:

Emotional expression and emotional resonance:Voices have different emotions and subtleties, such as happy, sad, angry, and scared.
Adjust the speed of speech:In the same round of conversation, you can ask the TA to speak faster or slower.
Interrupt at any time and enter instructions flexibly:Adjust the content and style of voice output based on real-time user commands to support more flexible dialog interactions.
Multi-language and multi-dialect support:At present, GLM-4-Voice supports Chinese and English voices as well as dialects from all over China, and is especially good at Cantonese, Chongqing and Beijing.
Combined with video calling, you can see and talk:A video calling feature will be available soon.

In addition, AutoGLM is equipped with phone use capability, which allows it to simulate human operation of a cell phone by receiving simple text/voice commands. It is not limited to simple task scenarios or API calls, nor does it require users to manually build complex and cumbersome workflows, and its operation logic is similar to that of humans.

GLM-4-Voice is open-sourced in the same period, and is officially called the first open-sourced end-to-end multimodal model of Smart Spectrum.IT Home with address:

Code Repository:

https://github.com/THUDM/GLM-4-Voice

statement:The content is collected from various media platforms such as public websites. If the included content infringes on your rights, please contact us by email and we will deal with it as soon as possible.

{{userData.name}}Verify

Wisdom Spectrum Clear Speech Launches Emotional Speech Model GLM-4-Voice: Understanding Emotions, Emotional Expression and Empathy

Altman Responds to OpenAI's Plans for Next-Generation Model Orion: Fake News Gets Out of Hand

Claude AI goes online with new tool: analyze, visualize data in CSV files

AI Weibo

AI Applications

5000+ AI applications! Updated daily

AIAICLUB

Highly recommended! Official brand Weibo

AI Tutorials

Tons of tutorials to read

AI Basic Training Camp

Zero-based entry, leading you to become an AI expert

1ai tiktok

1ai master

TikTok account: 1ai.net

1ai master

TikTok account: 1ai.net

1ai WeChat

Five minutes a day

Become a master in one year

Scan the QR code to follow

{{userData.name}}Verify

Related content:

Altman Responds to OpenAI's Plans for Next-Generation Model Orion: Fake News Gets Out of Hand

Claude AI goes online with new tool: analyze, visualize data in CSV files

Claiming to be better than XTTS! VoiceCraft: A voice model that supports voice cloning and modifying original audio text

Alibaba releases new voice model Qwen2-Audio, surpassing OpenAI Whisper

Zhipu Qingyan App's video call function is now free for all users for a limited time: an AI product with "eyes"

Wisdom Spectrum open source CogView3-Plus, related functions on the Wisdom Spectrum Clear Words App

AI Applications

5000+ AI applications! Updated daily

AIAICLUB

Highly recommended! Official brand Weibo

AI Tutorials

Tons of tutorials to read

AI Basic Training Camp

Zero-based entry, leading you to become an AI expert

1ai master

TikTok account: 1ai.net

1ai master

TikTok account: 1ai.net

Five minutes a day

Become a master in one year

Scan the QR code to follow