Loading...
Loading...
AI Stats
Home
Comparisons
Providers
Models
Benchmarks
Prices
Open menu
SimpleQA
Twitter
14
Total Models
33.84
Average Score
12.10 - 54.30
Score Range
1
Max Score Achievable
Top 10 Model Performance
Top 10 of 14
Models Using This Benchmark
(14)
Google
(7 models)
Gemini 2.5 Pro Preview
google
54.00%
Gemini 2.5 Pro Experimental
google
52.90%
Gemini 2.5 Pro Preview
google
50.80%
Gemini 2.0 Flash
google
29.90%
Gemini 2.5 Flash Preview
google
29.70%
Gemini 2.5 Flash Preview
google
26.90%
Gemini 2.5 Flash Lite Preview
google
13.00%
Moonshot
(2 models)
Kimi K2 Base
moonshotai
35.30%
Kimi K2 Instruct
moonshotai
31.00%
xAI
(2 models)
Grok 3 Beta
x-ai
43.60%
Grok 3 Mini Beta
x-ai
21.70%
MiniMax
(1 model)
MiniMax M1
minimax
18.50%
Mistral
(1 model)
Mistral Small 3.2
mistral
12.10%
Qwen
(1 model)
Qwen3 A235 A22B Instruct 2507
qwen
54.30%