This morning Beijing time,OpenAI Announcing on X (Twitter), the much-anticipated ChatGPT Voice AssistantThe feature will be delayed because the company needs to ensure it can "securely and efficiently" handle requests from millions of users.
The gist of the notice is as follows:
We wanted to share some recent progress on the advanced speech modes we demonstrated in the Spring Update, which we’re still very excited about:
We had planned to start rolling out the beta to a small group of ChatGPT Plus users in late June, but it will take another month to reach launch standards. For example, we are improving the modelAbility to detect and reject certain contentWe are also working hard to improve the user experience and prepare the infrastructure to scale to millions of users while maintaining real-time responsiveness.
As part of our iterative deployment strategy, we will start withA small number of usersStart testing, collect feedback, and expand based on that feedback. We plan to make it available to all Plus users onautumnThe specific time depends on whether we can reachHigh safety and reliability standardsWe are also working on launching newVideo and screen sharing capabilitiesand will inform you promptly.
ChatGPT's advanced speech mode canUnderstand and respond to emotions and nonverbal cues, bringing us closer to having real-time, natural conversations with AI. Our mission is to bring you these new, carefully designed experiences.
This means that users who want to "talk" with the ChatGPT voice assistant will have to wait for a while. On May 14 this year, OpenAI released a new GPT-4o model, which canUnderstand the user's voice questions and answer them by voice.