Individual benchmark scores plotted by date.
| Organisation | Model | Reported | Top Score | Info | Self Reported | Source |
|---|---|---|---|---|---|---|
| GPT 5 | 07 Aug 2025 | 86.70% | High Reasoning Effort | Yes | Source | |
| o3 | 16 Apr 2025 | 84.90% | High Reasoning Effort | Yes | Source | |
| o4 Mini | 16 Apr 2025 | 79.50% | High Reasoning Effort | Yes | Source | |
| GPT 4.1 | 14 Apr 2025 | 78.70% | - | Yes | Source | |
| GPT 5 Mini | 07 Aug 2025 | 78.50% | High Reasoning Effort | Yes | Source | |
| GPT 4.1 Mini | 14 Apr 2025 | 68.40% | - | Yes | Source | |
| GPT 5 Nano | 07 Aug 2025 | 65.70% | High Reasoning Effort | Yes | Source | |
| GPT 4.1 Nano | 14 Apr 2025 | 55.20% | - | Yes | Source |