OpenAI up to dateReleased its flagship large model GPT-4oThe model is not only free to use, but also has the combined ability to hear, see and speak, providing a silky smooth and latency-free interactive experience, as if you were having a video call with a human being.
Features of GPT-4o
- Omnipotent Input and Output: The GPT-4o is capable of accepting any combination of text, audio and images as input and generating corresponding text, audio and image output.
- Fast Response:The model responds to audio inputs in 232 ms to 320 ms, which is consistent with the speed of human dialog.
- Free and Open:GPT-4o will be free and open to all users, including all the features of the ChatGPT Plus member version, such as visualization, networking, memorization, code execution, and so on.
During the livestream, CTO Murati demonstrated GPT-4o's real-time interactive capabilities, including interrupting conversations at any time and responding with a rich tone of voice.
Researcher William Fedus revealed that the GPT-4o was one of the models that was previously A/B tested in the Big Model Arena and had higher performance than the GPT-4-Turbo.
API Provision
GPT-4o will also be available as an API at 50% off, with double the speed and five times the number of calls per unit of time.
Netizens are already envisioning application scenarios for GPT-4o, such as helping blind or partially sighted people better understand the world.
Demo Highlights
OpenAI President Brockman demonstrated GPT-4o's real-time translation capabilities, as well as conversations and singing between two ChatGPTs during the livestream.
Technical details
GPT-4o is a new model trained end-to-end where all inputs and outputs are processed by the same neural network, which is a significant improvement over previous speech models.
future outlook
Although OpenAI has not released a detailed technical report, the successful demonstration of GPT-4o has attracted widespread attention and discussion.
The release of OpenAI's GPT-4o model not only demonstrates the company's AIup to dateprogress, and also provides the public with a powerful and easy-to-use AI tool. As the technology continues to advance, we can expect GPT-4o to bring even richer and more innovative application scenarios in the future.