OpenAI releases GPT-4o, an all-around model that can hear, see, speak and is free

OpenAI up to dateReleased its flagship large model GPT-4oThe model is not only free to use, but also has the combined ability to hear, see and speak, providing a silky smooth and latency-free interactive experience, as if you were having a video call with a human being.

OpenAI releases GPT-4o, an all-around model that can hear, see, speak and is free

Features of GPT-4o

  • Omnipotent Input and Output: The GPT-4o is capable of accepting any combination of text, audio and images as input and generating corresponding text, audio and image output.
  • Fast Response:The model responds to audio inputs in 232 ms to 320 ms, which is consistent with the speed of human dialog.
  • Free and Open:GPT-4o will be free and open to all users, including all the features of the ChatGPT Plus member version, such as visualization, networking, memorization, code execution, and so on.

During the livestream, CTO Murati demonstrated GPT-4o's real-time interactive capabilities, including interrupting conversations at any time and responding with a rich tone of voice.

Researcher William Fedus revealed that the GPT-4o was one of the models that was previously A/B tested in the Big Model Arena and had higher performance than the GPT-4-Turbo.

API Provision

GPT-4o will also be available as an API at 50% off, with double the speed and five times the number of calls per unit of time.

Netizens are already envisioning application scenarios for GPT-4o, such as helping blind or partially sighted people better understand the world.

Demo Highlights

OpenAI President Brockman demonstrated GPT-4o's real-time translation capabilities, as well as conversations and singing between two ChatGPTs during the livestream.

Technical details

GPT-4o is a new model trained end-to-end where all inputs and outputs are processed by the same neural network, which is a significant improvement over previous speech models.

future outlook

Although OpenAI has not released a detailed technical report, the successful demonstration of GPT-4o has attracted widespread attention and discussion.

The release of OpenAI's GPT-4o model not only demonstrates the company's AIup to dateprogress, and also provides the public with a powerful and easy-to-use AI tool. As the technology continues to advance, we can expect GPT-4o to bring even richer and more innovative application scenarios in the future.

statement:The content is collected from various media platforms such as public websites. If the included content infringes on your rights, please contact us by email and we will deal with it as soon as possible.
Information

OpenAI Venture Fund continues to expand and support multiple artificial intelligence startups

2024-5-14 9:38:40

Information

Smartisan Notepad iOS version updated to v4.0: Added AI writing function, 88.8 yuan per year

2024-5-14 9:41:12

Search