Individual benchmark scores plotted by date.
| Organisation | Model | Reported | Top Score | Info | Self Reported | Source |
|---|---|---|---|---|---|---|
| GPT 5 Pro | 07 Aug 2025 | 54.60 | LLM Stats (ZeroEval) | inferred family alias from gpt-5.4 (score=0.4083; benches=19) | Yes | Source | |
| GPT 5 Search API | 14 Oct 2025 | 54.60 | LLM Stats (ZeroEval) | inferred family alias from gpt-5.4 (score=0.3050; benches=19) | Yes | Source | |
| GPT 5.4 | 05 Mar 2026 | 54.60 | - | Yes | Source | |
| Gemini 3 Flash Preview | 17 Dec 2025 | 49.40 | LLM Stats (ZeroEval) | Yes | Source | |
| MiniMax M2.7 | 18 Mar 2026 | 46.30 | LLM Stats (ZeroEval) | Yes | Source | |
| GPT 5.2 Chat | 11 Dec 2025 | 46.30 | LLM Stats (ZeroEval) | inferred alias from gpt-5.2-2025-12-11 | Yes | Source | |
| MiniMax M2.1 | 23 Dec 2025 | 43.50 | LLM Stats (ZeroEval) | Yes | Source | |
| GPT 5.4 Mini | 17 Mar 2026 | 42.90 | - | Yes | Source | |
| Qwen 3.6 Plus | 01 Apr 2026 | 39.80 | LLM Stats (ZeroEval) | Yes | Source | |
| Qwen 3.5 397B A17B | 16 Feb 2026 | 38.30 | LLM Stats (ZeroEval) | Yes | Source | |
| GPT 5.4 Nano | 17 Mar 2026 | 35.50 | - | Yes | Source | |
| DeepSeek V3.2 Speciale | 01 Dec 2025 | 35.20 | LLM Stats (ZeroEval) | Yes | Source |