Loading...
Loading...
AI Stats
Home
Comparisons
Providers
Models
Benchmarks
Prices
Open menu
Online Judgement Benchmark
Twitter
2
Total Models
29.80
Average Score
27.10 - 32.50
Score Range
1
Max Score Achievable
Top 10 Model Performance
2 models
Models Using This Benchmark
(2)
Moonshot
(1 model)
Kimi K2 Instruct
moonshotai
27.10%
Qwen
(1 model)
Qwen3 235B A22B Thinking 2507
qwen
32.50%