OpenAI opens GPT-4o speech mode to some paid subscribers, delivering more natural real-time conversations

30th local time.OpenAI announced that with immediate effect, it will provide some of the ChatGPT Plus User Open GPT-4o (Alpha version) and will be rolled out to all ChatGPT Plus this fall. subscriber.

In May of this year, OpenAI CTO Mira Murati gave a talk about it:

In GPT-4o, we train a new unified model end-to-end across text, vision, and audio, meaning that all inputs and outputs are processed by the same neural network.

Since GPT-4o is our first model to combine all of these modalities, we are still in the early stages of exploring the model’s capabilities and its limitations.

The OpenAI company had planned to invite a small group of ChatGPT Plus users to test the GPT-4o voice mode at the end of June this year, but officials announced a delay in June, saying that they needed toMore time to polishThe model, improving the modelDetecting and rejecting certain contentThe ability of the

According to the previously revealed information, the GPT-3.5 model has an average speech feedback delay of 2.8 seconds, while the GPT-4 model has a delay of 5.4 seconds, making it less than excellent for speech communication, and the upcoming GPT-4o can greatly reduce the delay time.Nearly seamless dialog.

The GPT-4o voice mode hasrapid response,You sound like a real person.OpenAI further claims that GPT-4o speech patterns can sense emotional tones in speech, including sadness, excitement, or singing.

OpenAI spokesperson Lindsay McCallum said, "ChatGPT You can't fake another person's voice., including the voices of individuals and public figures, and would prevent theDifferent from the preset soundsof the output."

statement:The content of the source of public various media platforms, if the inclusion of the content violates your rights and interests, please contact the mailbox, this site will be the first time to deal with.

OpenAI opens GPT-4o voice model to some paid subscribers, providing more natural real-time conversations

Meta stops using celebrity AI chatbots and moves to user-made AI

JD Cloud released eight AI products, including programming assistant JoyCoder, Yanxi Digital Human 3.0, etc.

AI Weibo

AI Applications

5000+ AI applications! Updated daily

1AICLUB

Highly recommended! Official brand Weibo

AI Tutorials

Tons of tutorials to read

AI Basic Training Camp

Zero-based entry, leading you to become an AI expert

1ai tiktok

1ai master

TikTok account: 1ai.net

1ai master

TikTok account: 1ai.net

1ai WeChat

Five minutes a day

Become a master in one year

Scan the QR code to follow

Related content:

Meta stops using celebrity AI chatbots and moves to user-made AI

JD Cloud released eight AI products, including programming assistant JoyCoder, Yanxi Digital Human 3.0, etc.

OpenAI GPT-4o drives ChatGPT subscription service demand surge, mobile revenue soars

OpenAI agrees to buy $51 million in AI chips from Rain, a startup in which CEO Sam Altman has a personal investment

OpenAI image generator DALL-E2 is out of service, DALL-E3 takes its place

OpenAI releases new flagship generative AI model GPT-4o: smoother voice conversations, free of charge

AI Applications

5000+ AI applications! Updated daily

1AICLUB

Highly recommended! Official brand Weibo

AI Tutorials

Tons of tutorials to read

AI Basic Training Camp

Zero-based entry, leading you to become an AI expert

1ai master

TikTok account: 1ai.net

1ai master

TikTok account: 1ai.net

Five minutes a day

Become a master in one year

Scan the QR code to follow