Individual benchmark scores plotted by date.
| Organisation | Model | Reported | Top Score | Info | Self Reported | Source |
|---|---|---|---|---|---|---|
| Mistral Large 1.0 | 26 Feb 2024 | 0.52 | LLM Stats (ZeroEval) | inferred family alias from mistral-large-latest (score=0.3650; benches=5) | Yes | Source | |
| LearnLM 1.5 Pro Experimental | 19 Nov 2024 | 0.46 | inferred family alias from gemini-1.5-pro (score=0.3700; benches=23) | Yes | Source | |
| Gemini 1.5 Pro 001 | 23 May 2024 | 0.46 | inferred alias from gemini-1.5-pro | Yes | Source | |
| Gemini 1.5 Pro Exp (2024-08-01) | 01 Aug 2024 | 0.46 | inferred alias from gemini-1.5-pro | Yes | Source | |
| Gemini 1.5 Pro Exp (2024-08-27) | 27 Aug 2024 | 0.46 | inferred alias from gemini-1.5-pro | Yes | Source | |
| Gemini Robotics ER 1.5 Preview | 25 Sept 2025 | 0.46 | inferred family alias from gemini-1.5-pro (score=0.3717; benches=23) | Yes | Source | |
| Gemini 1.5 Flash 001 | 23 May 2024 | 0.35 | inferred alias from gemini-1.5-flash | Yes | Source | |
| Gemini 1.5 Flash Preview | 14 May 2024 | 0.35 | inferred alias from gemini-1.5-flash | Yes | Source |