Individual benchmark scores plotted by date.
| Organisation | Model | Reported | Top Score | Info | Self Reported | Source |
|---|---|---|---|---|---|---|
| GPT 5 | 07 Aug 2025 | 95.20% | High Reasoning Effort | Yes | Source | |
| GPT 5 Mini | 07 Aug 2025 | 84.30% | High Reasoning Effort | Yes | Source | |
| GPT 4.1 | 14 Apr 2025 | 57.20% | - | Yes | Source | |
| o4 Mini | 16 Apr 2025 | 56.40% | High Reasoning Effort | Yes | Source | |
| o3 | 16 Apr 2025 | 55% | High Reasoning Effort | Yes | Source | |
| GPT 4.1 Mini | 14 Apr 2025 | 47.20% | - | Yes | Source | |
| GPT 5 Nano | 07 Aug 2025 | 43.20% | High Reasoning Effort | Yes | Source | |
| GPT 4.1 Nano | 14 Apr 2025 | 36.60% | - | Yes | Source |