GPT 5.1
OpenAI
Highlights
Top benchmark results for openai/gpt-5-1-2025-11-12.
Benchmark table
Detailed scores across tracked benchmarks.
| Benchmark | Category | Top Score | Info | Self Reported | Source |
|---|---|---|---|---|---|
| ARC-AGI-1 | - | 0.73 | High Reasoning Effort | No | Source |
| ARC-AGI-2 | - | 0.18 | High Reasoning Effort | No | Source |
| FACTS Benchmark Suite | - | 0.70 | - | Yes | Source |
| MathArena Apex | - | 0.01 | High Reasoning Effort | No | Source |
| Vending Bench 2 | - | 1473 | - | No | Source |
Benchmark comparisons
Use the selector to switch benchmarks and see how this model stacks up against its closest competitors.
ARC-AGI-2
Compare this model with the leading peers for the selected benchmark.
Benchmark
0.18
Rank #3/29
29 models
Showing 11 models around the selected model (out of 29 total).