o3 Pro
OpenAI
Highlights
Top benchmark results for openai/o3-pro-2025-06-10.
0.85#2
0.93#6
0.59#7
0.05#8
2748#1
14.22#14
0.84#10
0.87#2
1.82#6
Benchmark table
Detailed scores across tracked benchmarks.
| Benchmark | Category | Top Score | Info | Self Reported | Source |
|---|---|---|---|---|---|
| Aider-Polyglot | code | 0.85 | High Reasoning Effort | No | Source |
| AIME 2024 | math | 0.93 | - | Yes | Source |
| ARC-AGI-1 | - | 0.59 | High Reasoning Effort | No | Source |
| ARC-AGI-2 | - | 0.05 | High Reasoning Effort | No | Source |
| Codeforces | - | 2748 | - | Yes | Source |
| Confabulations | - | 14.22 | Medium Reasoning Effort | No | Source |
| GPQA Diamond | general-knowledge | 0.84 | - | Yes | Source |
| NYT Connections | - | 0.87 | Medium Reasoning Effort | No | Source |
| Thematic Generalisation | - | 1.82 | Medium Reasoning Effort | No | Source |
Benchmark comparisons
Use the selector to switch benchmarks and see how this model stacks up against its closest competitors.
ARC-AGI-1
Compare this model with the leading peers for the selected benchmark.
Benchmark
0.59
Rank #7/31
31 models
Showing 11 models around the selected model (out of 31 total).