Individual benchmark scores plotted by date.
| Organisation | Model | Reported | Top Score | Info | Self Reported | Source |
|---|---|---|---|---|---|---|
| Nova 2 Pro | 02 Dec 2025 | 0.93 | - | Yes | Source | |
| Gemini 3.1 Pro Preview | 19 Feb 2026 | 0.91 | - | Yes | - | |
| Claude Opus 4.5 | 24 Nov 2025 | 0.89 | Avg@5, 64k Thinking | Yes | Source | |
| GPT 5 | 07 Aug 2025 | 0.81 | With Thinking, Pass @ 1 | Yes | Source | |
| GPT 5 Mini | 07 Aug 2025 | 0.78 | High Reasoning Effort | Yes | Source | |
| Nova 2 Lite | 02 Dec 2025 | 0.77 | - | Yes | Source | |
| GPT 5 Nano | 07 Aug 2025 | 0.62 | High Reasoning Effort | Yes | Source |