Epoch AI has launched FrontierMath, a new math benchmark that challenges the mathematical reasoning of top big models. Models including GPT-4o and others have solved less than 2% problems on the benchmark.Designed by more than 60 leading mathematicians, the set of benchmarks covers areas ranging from number theory to algebraic geometry and is extremely difficult. In the future, the study will add questions and optimize the evaluation process. (Quantum Bits)
❯
Search
Scan to open current page
Top
Checking in, please wait
Click for today's check-in bonus!
You have earned {{mission.data.mission.credit}} points today!
My Coupons
-
¥CouponsLimitation of useExpired and UnavailableLimitation of use
before
Limitation of usePermanently validCoupon ID:×Available for the following products: Available for the following products categories: Unrestricted use:Available for all products and product types
No coupons available!
Unverify
Daily tasks completed: