OpenAI Event #2: "Enhanced Fine-Tuning" Creates Domain Expert AI Models, Altman Calls it the Biggest Surprise of the Year

December 7 News.OpenAI Launched a 12-day "shipmas" launch cycle with a series of new features, products, and demos. On the second day of the event, OpenAI unveiled Reinforcement Fine-Tuning, the first of its kind in the world.Help developers and machine learning engineers build expert models for specific complex domain tasks.

This project improves the reasoning power and accuracy of models for domain-specific tasks through a new model customization technique that allows developers to fine-tune models using a high-quality task set and evaluate the model's response using reference answers.

Introduction to Intensive Fine Tuning

1AI attaches an official description: developers are able to customize OpenAI's models using dozens to thousands of high-quality tasks and score the model's responses using the provided reference answers. Officially this technology enhances the way models reason about similar problems and improves their accuracy on tasks specific to the domain.

Unlike standard fine-tuning, RFT utilizes reinforcement learning algorithms that can improve model performance from the high school level to the expert PhD level.

RFT differs from supervised fine-tuning in that instead of having the model mimic the inputs, it teaches the model to reason in a completely new way, and by scoring the model's answers and reinforcing the correct line of reasoning, RFT significantly improves the model's performance with just a handful of examples.

RFT supports users to create unique models with their own golden datasets and apply them to areas requiring specialized knowledge such as law, finance, engineering, insurance, etc.

Enhanced fine-tuning of group-oriented

OpenAI encourages applications from research organizations, universities and businesses, especially those that are currently led by experts performing a narrow range of complex tasks and would benefit from AI assistance.

OpenAI says that reinforcement fine-tuning performs well on tasks where the outcome has an objectively "correct" answer that most experts would agree with, so it thinks it will perform better in fields such as law, insurance, healthcare, finance, engineering, and so on.

Participants will have early access to the Alpha version of the enhanced fine-tuning APIs and will be able to test them on domain-specific tasks, and OpenAI encourages participants to share datasets and work together to improve OpenAI models.

OpenAI expects to publicly release enhanced fine-tuning capabilities in early 2025.

OpenAI CEO Sam Altman said, "Intensive fine-tuning that works surprisingly well; it's one of my biggest surprises for 2024."

statement:The content of the source of public various media platforms, if the inclusion of the content violates your rights and interests, please contact the mailbox, this site will be the first time to deal with.

OpenAI Event #2: "Enhanced Fine-Tuning" Builds Domain Expert AI Models, Altman Calls It the Biggest Surprise of the Year

China Unicom Establishes Macau Company, Launches Guangdong-Hong Kong-Macao Greater Bay Area Artificial Intelligence Cooperation Program

No longer exclusive to members, Musk's X platform Grok chatbot opens to free users

AI Weibo

AI Applications

5000+ AI applications! Updated daily

1AICLUB

Highly recommended! Official brand Weibo

AI Tutorials

Tons of tutorials to read

AI Basic Training Camp

Zero-based entry, leading you to become an AI expert

1ai tiktok

1ai master

TikTok account: 1ai.net

1ai master

TikTok account: 1ai.net

1ai WeChat

Five minutes a day

Become a master in one year

Scan the QR code to follow

Related content:

China Unicom Establishes Macau Company, Launches Guangdong-Hong Kong-Macao Greater Bay Area Artificial Intelligence Cooperation Program

No longer exclusive to members, Musk's X platform Grok chatbot opens to free users

The pie is almost divided by OpenAI, and AI startups are in a financing dilemma

OpenAI and Meta are about to release AI models with human-level reasoning capabilities, report says

Big news! ChatGPT gets a major upgrade, OpenAI releases GPT-4o Mini

OpenAI, Google and other giants' AI models hit bottlenecks: training data hard to find, high costs, sources say

AI Applications

5000+ AI applications! Updated daily

1AICLUB

Highly recommended! Official brand Weibo

AI Tutorials

Tons of tutorials to read

AI Basic Training Camp

Zero-based entry, leading you to become an AI expert

1ai master

TikTok account: 1ai.net

1ai master

TikTok account: 1ai.net

Five minutes a day

Become a master in one year

Scan the QR code to follow