Loading...
Loading...
AI Stats
Home
Comparisons
Providers
Models
Benchmarks
Prices
Open menu
AIME 2024
42
Total Models
71.26
Average Score
13.10 - 96.70
Score Range
1
Max Score Achievable
Top 10 Model Performance
Top 10 of 42
Models Using This Benchmark
(42)
OpenAI
(15 models)
o3 Preview
openai
96.70%
gpt-oss-120b
openai
96.60%
gpt-oss-20b
openai
96.00%
o4 Mini
openai
93.40%
o3 Pro
openai
93.00%
o3
openai
91.60%
o3-mini
openai
87.30%
o1 pro
openai
86.00%
o1
openai
74.30%
GPT-4.1 Mini
openai
49.60%
GPT-4.1
openai
48.10%
o1 preview
openai
42.00%
GPT-4.5
openai
36.70%
GPT-4.1 Nano
openai
29.40%
GPT-4o
openai
13.10%
Qwen
(5 models)
Qwen3 235B A22B
qwen
85.70%
Qwen3 32B
qwen
81.40%
Qwen3 30B A3B
qwen
80.40%
QwQ-32B
qwen
79.50%
QwQ-32B-Preview
qwen
50.00%
DeepSeek
(4 models)
R1
deepseek
91.40%
R1
deepseek
79.80%
DeepSeek-V3 0324
deepseek
59.40%
DeepSeek-V3
deepseek
39.20%
xAI
(4 models)
Grok 3 Beta
x-ai
95.80%
Grok 3 Mini Beta
x-ai
95.80%
Grok 3 Mini
x-ai
90.70%
Grok 3
x-ai
60.00%
Microsoft
(3 models)
Phi 4 Reasoning Plus
microsoft
81.30%
Phi 4 Reasoning
microsoft
75.30%
Phi 4 Mini Reasoning
microsoft
57.50%
Anthropic
(2 models)
Claude 3.7 Sonnet
anthropic
80.00%
Claude 3.5 Sonnet
anthropic
16.00%
Google
(2 models)
Gemini 2.5 Flash Preview
google
88.00%
Gemini 2.0 Flash
google
32.00%
IBM
(2 models)
Granite 3.3 8B Base
ibm
81.20%
Granite 3.3 8B Instruct
ibm
81.20%
Mistral
(2 models)
Magistral Medium
mistral
73.60%
Magistral Small
mistral
70.70%
Moonshot
(2 models)
Kimi k1.5
moonshotai
77.50%
Kimi K2 Instruct
moonshotai
69.60%
MiniMax
(1 model)
MiniMax M1
minimax
86.00%