Individual benchmark scores plotted by date.
| Organisation | Model | Reported | Top Score | Info | Self Reported | Source |
|---|---|---|---|---|---|---|
| Gemini 3.1 Pro Preview | 19 Feb 2026 | 0.81 | Search + Python + Browse | Yes | - | |
| GPT 5 | 07 Aug 2025 | 0.78 | With Thinking, Pass @ 1 | Yes | Source | |
| Gemini 3.1 Flash Lite Preview | 03 Mar 2026 | 0.77 | - | Yes | Source | |
| GPT 5 Mini | 07 Aug 2025 | 0.74 | High Reasoning Effort | Yes | Source | |
| Nova 2 Pro | 02 Dec 2025 | 0.64 | - | Yes | Source | |
| GPT 5 Nano | 07 Aug 2025 | 0.63 | High Reasoning Effort | Yes | Source | |
| Nova 2 Lite | 02 Dec 2025 | 0.62 | - | Yes | Source |