OpenAI secretly tested GPT-4o, and it topped the chatbot arena rankings

OpenAI William Fedus, an employee of LMSYS, confirmed on social media platform X on Monday that ChatbotsThe mysterious chatbot "gpt-chatbot" that performed well in the Chatbot Arena is the new artificial intelligence model they just released. GPT-4oFedus also revealed that GPT-4o topped the Arena leaderboard in the test, achieving the highest score ever.

“GPT-4o is our most advanced cutting-edge model,” Fedus wrote on Twitter. “We’ve been testing a version of it in Arena under the name ‘im-also-a-good-gpt2-chatbot’.”

OpenAI secretly tested GPT-4o, and it topped the chatbot arena rankings

Chatbot Arena is a website where visitors can talk to two random AI language models at the same time, without knowing which is which, and then choose the model that provides the better response.

Starting in April this year, OpenAI tested multiple versions of GPT-4o in the arena. The model first appeared under the name "gpt2-chatbot", then became "im-a-good-gpt2-chatbot", and finally "im-also-a-good-gpt2-chatbot".

Since GPT-4o was released today, multiple sources have revealed that the model has topped LMSYS’s internal leaderboard by a huge margin, surpassing the previous top-ranked models Claude 3 Opus and GPT-4 Turbo.

lmsys.org The official account of shared a chart and wrote: "The 'gpt2-chatbot' series model has just soared to the top of the list, surpassing all other models by a significant margin (about 50 Elo), and it has become the most powerful model in the arena. This is an internal screenshot. The public version of 'gpt-4o' has now entered the arena and will soon appear on the public leaderboard!"

OpenAI secretly tested GPT-4o, and it topped the chatbot arena rankings

As of press time, "im-also-a-good-gpt2-chatbot" has an Elo score of 1309, ahead of GPT-4-Turbo-2023-04-09 with 1253 and Claude 3 Opus with 1246. Claude 3 and GPT-4 Turbo had been competing for the top spot on the leaderboard until the three "gpt2-chatbots" showed up and messed things up.

statement:The content is collected from various media platforms such as public websites. If the included content infringes on your rights, please contact us by email and we will deal with it as soon as possible.

{{userData.name}}Verify

OpenAI secretly tested GPT-4o, and it topped the chatbot arena rankings

To counter GPT-4o, Google launches Astra project: low-latency chat interaction within mobile phone camera

Tencent's Hunyuan Wenshengtu model is open source: equipped with the first Chinese-English bilingual DiT architecture, free for commercial use

AI Weibo

AI Applications

5000+ AI applications! Updated daily

AIAICLUB

Highly recommended! Official brand Weibo

AI Tutorials

Tons of tutorials to read

AI Basic Training Camp

Zero-based entry, leading you to become an AI expert

1ai tiktok

1ai master

TikTok account: 1ai.net

1ai master

TikTok account: 1ai.net

1ai WeChat

Five minutes a day

Become a master in one year

Scan the QR code to follow

{{userData.name}}Verify

Related content:

To counter GPT-4o, Google launches Astra project: low-latency chat interaction within mobile phone camera

Tencent's Hunyuan Wenshengtu model is open source: equipped with the first Chinese-English bilingual DiT architecture, free for commercial use

The New York Times sued Microsoft and OpenAI for using its articles to train large models on copyright grounds

OpenAI releases the all-round model GPT-4o, which is free for all users!

OpenAI CEO: GPT-5 will be special and may be similar to a "virtual brain"

Big news! ChatGPT gets a major upgrade, OpenAI releases GPT-4o Mini

AI Applications

5000+ AI applications! Updated daily

AIAICLUB

Highly recommended! Official brand Weibo

AI Tutorials

Tons of tutorials to read

AI Basic Training Camp

Zero-based entry, leading you to become an AI expert

1ai master

TikTok account: 1ai.net

1ai master

TikTok account: 1ai.net

Five minutes a day

Become a master in one year

Scan the QR code to follow