Loading...
Loading...
AI Stats
Leaderboard
Comparisons
Providers
Models
Benchmarks
Prices
Open menu
This website is not yet fully optimised for mobile viewing. Some features may not display or function as intended.
AIME 2024
About this Benchmark
Unfortunately there isn't a description for this benchmark yet.
Model Performance
AIME 2024
Top 20 of 36
Models Using This Benchmark
(36)
OpenAI
(12 models)
o3 Preview
96.7%
o4 Mini
93.4%
o3
91.6%
o3-mini
87.3%
o1-pro
86.0%
o1
74.3%
GPT-4.1 Mini
49.6%
GPT-4.1
48.1%
o1-preview
42.0%
GPT-4.5
36.7%
GPT-4.1 Nano
29.4%
GPT-4o
13.1%
Qwen
(5 models)
DeepSeek
(4 models)
Anthropic
(3 models)
Google
(3 models)
Microsoft
(3 models)
xAI
(3 models)
IBM
(2 models)
Moonshot
(1 model)