Individual benchmark scores plotted by date.
| Organisation | Model | Reported | Top Score | Info | Self Reported | Source |
|---|---|---|---|---|---|---|
| Gemini 1.5 Flash 8B | 15 Mar 2024 | 0.86 | LLM Stats (ZeroEval) | Yes | Source | |
| Gemini 1.5 Flash 8B Exp (2024-09-24) | 24 Sept 2024 | 0.86 | LLM Stats (ZeroEval) | inferred alias from gemini-1.5-flash-8b | Yes | Source | |
| Gemini 1.5 Flash 8B Exp (2024-08-27) | 27 Aug 2024 | 0.86 | LLM Stats (ZeroEval) | inferred alias from gemini-1.5-flash-8b | Yes | Source | |
| Gemini 1.5 Flash 001 | 23 May 2024 | 0.10 | LLM Stats (ZeroEval) | inferred alias from gemini-1.5-flash | Yes | Source | |
| Gemini 1.5 Flash Preview | 14 May 2024 | 0.10 | LLM Stats (ZeroEval) | inferred alias from gemini-1.5-flash | Yes | Source | |
| Gemini 1.5 Pro Exp (2024-08-01) | 01 Aug 2024 | 0.07 | LLM Stats (ZeroEval) | inferred alias from gemini-1.5-pro | Yes | Source | |
| Gemini 1.5 Pro 001 | 23 May 2024 | 0.07 | LLM Stats (ZeroEval) | inferred alias from gemini-1.5-pro | Yes | Source | |
| Gemini 1.5 Pro Exp (2024-08-27) | 27 Aug 2024 | 0.07 | LLM Stats (ZeroEval) | inferred alias from gemini-1.5-pro | Yes | Source | |
| LearnLM 1.5 Pro Experimental | 19 Nov 2024 | 0.07 | LLM Stats (ZeroEval) | inferred family alias from gemini-1.5-pro (score=0.3700; benches=23) | Yes | Source | |
| Gemini Robotics ER 1.5 Preview | 25 Sept 2025 | 0.07 | LLM Stats (ZeroEval) | inferred family alias from gemini-1.5-pro (score=0.3717; benches=23) | Yes | Source | |
| Gemini 1.0 Pro Vision 001 | 15 Feb 2024 | 0.06 | LLM Stats (ZeroEval) | inferred modality/version alias from gemini-1.0-pro | No | Source | |
| Gemini 1.0 Nano | 06 Dec 2023 | 0.06 | LLM Stats (ZeroEval) | inferred modality/version alias from gemini-1.0-pro | No | Source | |
| Gemini 1.0 Pro | 06 Dec 2023 | 0.06 | LLM Stats (ZeroEval) | No | Source | |
| Qwen 2.5 Omni 7B | - | 0.04 | LLM Stats (ZeroEval) | Yes | Source | |
| Qwen 2.5 Coder 7B | - | 0.04 | LLM Stats (ZeroEval) | inferred high-confidence family alias from qwen2.5-omni-7b (score=0.4700; benches=45) | Yes | Source | |
| Qwen 2.5 Coder 3B | - | 0.04 | LLM Stats (ZeroEval) | inferred family alias from qwen2.5-omni-7b (score=0.3000; benches=45) | Yes | Source | |
| Qwen 2.5 Math 7B | - | 0.04 | LLM Stats (ZeroEval) | inferred high-confidence family alias from qwen2.5-omni-7b (score=0.4767; benches=45) | Yes | Source | |
| Qwen 2.5 Math 7B PRM800K | - | 0.04 | LLM Stats (ZeroEval) | inferred family alias from qwen2.5-omni-7b (score=0.3696; benches=45) | Yes | Source | |
| Qwen 2.5 Math PRM 7B | - | 0.04 | LLM Stats (ZeroEval) | inferred family alias from qwen2.5-omni-7b (score=0.4092; benches=45) | Yes | Source | |
| Qwen 2.5 Omni 3B | - | 0.04 | LLM Stats (ZeroEval) | inferred high-confidence family alias from qwen2.5-omni-7b (score=0.4933; benches=45) | Yes | Source |