Individual benchmark scores plotted by date.
| Organisation | Model | Reported | Top Score | Info | Self Reported | Source |
|---|---|---|---|---|---|---|
| DeepSeek V2 (2024-06-28) | 28 Jun 2024 | 79.70 | LLM Stats (ZeroEval) | inferred family alias from deepseek-v3 (score=0.4159; benches=20) | Yes | Source | |
| DeepSeek OCR | 20 Oct 2025 | 79.70 | LLM Stats (ZeroEval) | inferred family alias from deepseek-v3 (score=0.3000; benches=20) | Yes | Source | |
| DeepSeek V4 | - | 79.70 | LLM Stats (ZeroEval) | inferred high-confidence family alias from deepseek-v3 (score=0.5818; benches=20) | Yes | Source | |
| Gemini 2.5 Computer Use Preview | 07 Oct 2025 | 72.70 | LLM Stats (ZeroEval) | inferred family alias from gemini-2.5-pro (score=0.3960; benches=16) | Yes | Source | |
| Gemini 2.5 Pro Experimental (2025-03-25) | 25 Mar 2025 | 72.70 | LLM Stats (ZeroEval) | inferred alias from gemini-2.5-pro | Yes | Source | |
| Gemini 2.5 Pro Preview TTS (2025-12-10) | 10 Dec 2025 | 72.70 | LLM Stats (ZeroEval) | inferred modality/version alias from gemini-2.5-pro | Yes | Source | |
| Gemini Embedding 2 Preview | 10 Mar 2026 | 72.70 | LLM Stats (ZeroEval) | manual fallback alias from gemini-2.5-pro | Yes | Source | |
| o3 mini | 30 Jan 2025 | 60.40 | LLM Stats (ZeroEval) | Yes | Source | |
| o4 mini Deep Research | 26 Jun 2025 | 58.20 | LLM Stats (ZeroEval) | inferred modality/version alias from o4-mini | Yes | Source | |
| o4 Mini | 16 Apr 2025 | 58.20 | LLM Stats (ZeroEval) | Yes | Source | |
| Gemini 2.5 Flash Preview (2025-09-25) | 25 Sept 2025 | 56.70 | LLM Stats (ZeroEval) | inferred alias from gemini-2.5-flash | Yes | Source | |
| Gemini 2.5 Flash Preview TTS (2025-05-20) | 20 May 2025 | 56.70 | LLM Stats (ZeroEval) | inferred modality/version alias from gemini-2.5-flash | Yes | Source | |
| Gemini 2.5 Flash Exp Native Audio Thinking Dialog | - | 56.70 | LLM Stats (ZeroEval) | inferred modality/version alias from gemini-2.5-flash | Yes | Source | |
| Gemini 2.5 Flash Image Preview (Nano Banana) | 25 Aug 2025 | 56.70 | LLM Stats (ZeroEval) | inferred modality/version alias from gemini-2.5-flash | Yes | Source | |
| Gemini 2.5 Flash Image (Nano Banana) | 02 Oct 2025 | 56.70 | LLM Stats (ZeroEval) | inferred modality/version alias from gemini-2.5-flash | Yes | Source | |
| Gemini Live 2.5 Flash Preview | 09 Apr 2025 | 56.70 | LLM Stats (ZeroEval) | inferred high-confidence family alias from gemini-2.5-flash (score=0.5083; benches=14) | Yes | Source | |
| Gemini 2.5 Flash Native Audio Preview (2025-09-23) | - | 56.70 | LLM Stats (ZeroEval) | inferred modality/version alias from gemini-2.5-flash | Yes | Source | |
| Gemini 2.5 Flash Preview Native Audio Dialog | - | 56.70 | LLM Stats (ZeroEval) | inferred modality/version alias from gemini-2.5-flash | Yes | Source | |
| Gemini 2.5 Flash Preview TTS (2025-12-10) | 10 Dec 2025 | 56.70 | LLM Stats (ZeroEval) | inferred modality/version alias from gemini-2.5-flash | Yes | Source | |
| GPT 4.5 | 27 Feb 2025 | 44.90 | LLM Stats (ZeroEval) | Yes | Source | |
| GPT 4o Audio (2024-10-01) | 01 Oct 2024 | 18.20 | LLM Stats (ZeroEval) | inferred modality/version alias from gpt-4o-2024-08-06 | Yes | Source | |
| GPT 4o Audio (2025-06-03) | 03 Jun 2025 | 18.20 | LLM Stats (ZeroEval) | inferred modality/version alias from gpt-4o-2024-08-06 | Yes | Source | |
| GPT 4o (2024-08-06) | 06 Aug 2024 | 18.20 | LLM Stats (ZeroEval) | Yes | Source | |
| GPT 4o Audio (2024-12-17) | 17 Dec 2024 | 18.20 | LLM Stats (ZeroEval) | inferred modality/version alias from gpt-4o-2024-08-06 | Yes | Source | |
| GPT 4o Realtime Preview (2024-10-01) | 01 Oct 2024 | 18.20 | LLM Stats (ZeroEval) | inferred modality/version alias from gpt-4o-2024-08-06 | Yes | Source | |
| GPT 4o Realtime Preview (2025-06-03) | 03 Jun 2025 | 18.20 | LLM Stats (ZeroEval) | inferred modality/version alias from gpt-4o-2024-08-06 | Yes | Source | |
| GPT 4o Search Preview | 11 Mar 2025 | 18.20 | LLM Stats (ZeroEval) | inferred modality/version alias from gpt-4o-2024-08-06 | Yes | Source | |
| GPT 4o Transcribe | 20 Mar 2025 | 18.20 | LLM Stats (ZeroEval) | inferred modality/version alias from gpt-4o-2024-08-06 | Yes | Source | |
| GPT 4o Transcribe Diarize | 15 Oct 2025 | 18.20 | LLM Stats (ZeroEval) | inferred modality/version alias from gpt-4o-2024-08-06 | Yes | Source |