Loading...
Loading...
AI Stats
Home
Comparisons
Providers
Models
Benchmarks
Prices
Open menu
NYT Connections
Twitter
45
Total Models
32.68
Average Score
2.20 - 92.40
Score Range
1
Max Score Achievable
Top 10 Model Performance
Top 10 of 45
Models Using This Benchmark
(45)
OpenAI
(14 models)
o3 Pro
openai
87.30%
o1 pro
openai
82.50%
o3
openai
79.50%
o4 Mini
openai
74.70%
o1
openai
70.80%
o3-mini
openai
61.40%
GPT-4.5
openai
34.20%
o1 mini
openai
26.90%
GPT-4.1
openai
23.60%
GPT-4o
openai
18.70%
GPT-4o
openai
17.80%
GPT-4.1 Mini
openai
15.10%
GPT-4o-mini
openai
9.90%
GPT-4.1 Nano
openai
8.60%
Anthropic
(7 models)
Claude Opus 4
anthropic
52.70%
Claude Sonnet 4
anthropic
41.40%
Claude 3.7 Sonnet
anthropic
33.60%
Claude 3 Opus
anthropic
19.20%
Claude 3.5 Sonnet
anthropic
17.70%
Claude 3.5 Haiku
anthropic
10.30%
Claude 3 Haiku
anthropic
2.20%
Google
(5 models)
Gemini 2.5 Pro Preview
google
58.70%
Gemini 2.5 Pro Experimental
google
54.10%
Gemini 2.5 Pro Preview
google
42.50%
Gemma 2 27B
google
12.20%
Gemma 3 27B
google
11.80%
DeepSeek
(4 models)
R1
deepseek
49.80%
R1
deepseek
38.60%
DeepSeek-V3 0324
deepseek
17.40%
DeepSeek-V3
deepseek
15.10%
Meta
(4 models)
Llama 4 Maverick
meta
19.10%
Llama 4 Scout
meta
17.90%
Llama 3.1 405B (base)
meta
16.20%
Llama 3.3 70B Instruct
meta
15.10%
xAI
(4 models)
Grok 4
x-ai
92.40%
Grok 3 Mini Beta
x-ai
30.90%
Grok 3 Beta
x-ai
20.30%
Grok 2
x-ai
19.20%
Qwen
(3 models)
Qwen3 235B A22B
qwen
55.60%
Qwen3 30B A3B
qwen
38.00%
Qwen2.5 72B Instruct
qwen
11.10%
Amazon
(1 model)
Nova Pro 1.0
amazon
10.10%
Cohere
(1 model)
Command A
cohere
13.60%
Microsoft
(1 model)
Phi 4
microsoft
10.20%
Mistral
(1 model)
Mistral Large 2
mistral
12.60%