Individual benchmark scores plotted by date.
| Organisation | Model | Reported | Top Score | Info | Self Reported | Source |
|---|---|---|---|---|---|---|
| Seed 2.0 Pro | 14 Feb 2026 | 89.50 | Seed2 official benchmark table | VideoMME ‡ | Yes | Source | |
| Seed 2.0 Lite | 14 Feb 2026 | 87.70 | Seed2 official benchmark table | VideoMME ‡ | Yes | Source | |
| Kimi K2.5 | 27 Jan 2026 | 87.40 | LLM Stats (ZeroEval) | Yes | Source | |
| Gemini 2.5 Computer Use Preview | 07 Oct 2025 | 84.80 | LLM Stats (ZeroEval) | inferred family alias from gemini-2.5-pro (score=0.3960; benches=16) | Yes | Source | |
| Gemini 2.5 Pro Experimental (2025-03-25) | 25 Mar 2025 | 84.80 | LLM Stats (ZeroEval) | inferred alias from gemini-2.5-pro | Yes | Source | |
| Gemini 2.5 Pro Preview TTS (2025-12-10) | 10 Dec 2025 | 84.80 | LLM Stats (ZeroEval) | inferred modality/version alias from gemini-2.5-pro | Yes | Source | |
| Gemini Embedding 2 Preview | 10 Mar 2026 | 84.80 | LLM Stats (ZeroEval) | manual fallback alias from gemini-2.5-pro | Yes | Source | |
| Qwen 3.6 Plus | 01 Apr 2026 | 84.20 | LLM Stats (ZeroEval) | Yes | Source | |
| Seed 2.0 Mini | 14 Feb 2026 | 81.20 | Seed2 official benchmark table | VideoMME ‡ | Yes | Source | |
| Gemini 1.5 Pro 001 | 23 May 2024 | 78.60 | LLM Stats (ZeroEval) | inferred alias from gemini-1.5-pro | Yes | Source | |
| Gemini 1.5 Pro Exp (2024-08-27) | 27 Aug 2024 | 78.60 | LLM Stats (ZeroEval) | inferred alias from gemini-1.5-pro | Yes | Source | |
| Gemini 1.5 Pro Exp (2024-08-01) | 01 Aug 2024 | 78.60 | LLM Stats (ZeroEval) | inferred alias from gemini-1.5-pro | Yes | Source | |
| Gemini Robotics ER 1.5 Preview | 25 Sept 2025 | 78.60 | LLM Stats (ZeroEval) | inferred family alias from gemini-1.5-pro (score=0.3717; benches=23) | Yes | Source | |
| LearnLM 1.5 Pro Experimental | 19 Nov 2024 | 78.60 | LLM Stats (ZeroEval) | inferred family alias from gemini-1.5-pro (score=0.3700; benches=23) | Yes | Source | |
| Gemini 1.5 Flash 001 | 23 May 2024 | 76.10 | LLM Stats (ZeroEval) | inferred alias from gemini-1.5-flash | Yes | Source | |
| Gemini 1.5 Flash Preview | 14 May 2024 | 76.10 | LLM Stats (ZeroEval) | inferred alias from gemini-1.5-flash | Yes | Source | |
| Qwen 3 VL 30B A3B Instruct | - | 74.50 | LLM Stats (ZeroEval) | Yes | Source | |
| Qwen 3 VL 30B A3B Thinking | - | 73.30 | LLM Stats (ZeroEval) | Yes | Source | |
| Qwen 3 8B | - | 71.80 | LLM Stats (ZeroEval) | inferred high-confidence family alias from qwen3-vl-8b-thinking (score=0.4600; benches=50) | Yes | Source | |
| Qwen 3 Embedding 8B | - | 71.80 | LLM Stats (ZeroEval) | inferred family alias from qwen3-vl-8b-thinking (score=0.3850; benches=50) | Yes | Source | |
| Qwen 3 Guard Gen 8B | - | 71.80 | LLM Stats (ZeroEval) | inferred family alias from qwen3-vl-8b-thinking (score=0.3400; benches=50) | Yes | Source | |
| Qwen 3 Guard Stream 8B | - | 71.80 | LLM Stats (ZeroEval) | inferred family alias from qwen3-vl-8b-thinking (score=0.3371; benches=50) | Yes | Source | |
| Qwen 3 Reranker 8B | - | 71.80 | LLM Stats (ZeroEval) | inferred family alias from qwen3-vl-8b-thinking (score=0.3850; benches=50) | Yes | Source | |
| Qwen 3 VL 8B Thinking | - | 71.80 | LLM Stats (ZeroEval) | Yes | Source | |
| Qwen 3 VL Embedding 8B | - | 71.80 | LLM Stats (ZeroEval) | inferred high-confidence family alias from qwen3-vl-8b-thinking (score=0.5232; benches=50) | Yes | Source | |
| Qwen 3 VL Reranker 8B | - | 71.80 | LLM Stats (ZeroEval) | inferred high-confidence family alias from qwen3-vl-8b-thinking (score=0.5275; benches=50) | Yes | Source | |
| Qwen 3 VL 8B Instruct | - | 71.40 | LLM Stats (ZeroEval) | Yes | Source | |
| Gemini 1.5 Flash 8B | 15 Mar 2024 | 66.20 | LLM Stats (ZeroEval) | Yes | Source | |
| Gemini 1.5 Flash 8B Exp (2024-09-24) | 24 Sept 2024 | 66.20 | LLM Stats (ZeroEval) | inferred alias from gemini-1.5-flash-8b | Yes | Source | |
| Gemini 1.5 Flash 8B Exp (2024-08-27) | 27 Aug 2024 | 66.20 | LLM Stats (ZeroEval) | inferred alias from gemini-1.5-flash-8b | Yes | Source | |
| Phi 4 multimodal instruct | 01 Feb 2025 | 55.00 | LLM Stats (ZeroEval) | Yes | Source |