Individual benchmark scores plotted by date.
| Organisation | Model | Reported | Top Score | Info | Self Reported | Source |
|---|---|---|---|---|---|---|
| Claude Opus 4.7 | 16 Apr 2026 | 77.30% | - | Yes | Source | |
| Qwen 3.6 Plus | 01 Apr 2026 | 74.10% | - | Yes | Source | |
| GLM 5.1 | - | 71.80% | Public Set | Yes | Source | |
| Gemini 3.1 Pro Preview | 19 Feb 2026 | 69.20% | - | Yes | - | |
| GLM 5 | 11 Feb 2026 | 67.80% | - | Yes | Source | |
| GPT 5 Pro | 07 Aug 2025 | 67.20% | inferred family alias from gpt-5.4 (score=0.4083; benches=19) | Yes | Source | |
| GPT 5.4 | 05 Mar 2026 | 67.20% | - | Yes | Source | |
| GPT 5 Search API | 14 Oct 2025 | 67.20% | inferred family alias from gpt-5.4 (score=0.3050; benches=19) | Yes | Source | |
| Claude Opus 4.6 | 05 Feb 2026 | 62.70% | High effort | Yes | Source | |
| Nova 2 Pro | 02 Dec 2025 | 61.60% | - | Yes | Source | |
| Claude Sonnet 4.6 | 17 Feb 2026 | 61.30% | - | Yes | Source | |
| GPT 5.2 Chat | 11 Dec 2025 | 60.60% | inferred alias from gpt-5.2-2025-12-11 | Yes | Source | |
| GPT 5.4 Mini | 17 Mar 2026 | 57.70% | - | Yes | Source | |
| Gemini 3 Flash Preview | 17 Dec 2025 | 57.40% | - | Yes | Source | |
| GPT 5.4 Nano | 17 Mar 2026 | 56.10% | - | Yes | Source | |
| Nova 2 Lite | 02 Dec 2025 | 24.60% | - | Yes | Source |