Individual benchmark scores plotted by date.
| Organisation | Model | Reported | Top Score | Info | Self Reported | Source |
|---|---|---|---|---|---|---|
| Claude Sonnet 4.6 | 17 Feb 2026 | 63.30 | Max thinking; Vals AI | Yes | Source | |
| GPT 5.4 Pro | 05 Mar 2026 | 61.50 | - | Yes | Source | |
| Claude Opus 4.6 | 05 Feb 2026 | 60.70 | LLM Stats (ZeroEval) | Yes | Source | |
| GPT 5 Pro | 07 Aug 2025 | 56.00 | LLM Stats (ZeroEval) | inferred family alias from gpt-5.4 (score=0.4083; benches=19) | Yes | Source | |
| GPT 5 Search API | 14 Oct 2025 | 56.00 | LLM Stats (ZeroEval) | inferred family alias from gpt-5.4 (score=0.3050; benches=19) | Yes | Source | |
| GPT 5.4 | 05 Mar 2026 | 56.00 | - | Yes | Source |