Loading...
Loading...
AI Stats
Leaderboard
Comparisons
Providers
Models
Benchmarks
Prices
Open menu
This website is not yet fully optimised for mobile viewing. Some features may not display or function as intended.
LiveBench
Twitter
About this Benchmark
Unfortunately there isn't a description for this benchmark yet.
Model Performance
LiveBench
16 models
Models Using This Benchmark
(16)
Anthropic
(6 models)
Claude Opus 4
72.9%
Claude Sonnet 4
72.1%
Claude 3.7 Sonnet ...
67.4%
Claude 3.7 Sonnet
58.5%
Claude 3.5 Sonnet
51.8%
Claude 3.5 Haiku
39.5%
OpenAI
(6 models)
DeepSeek
(2 models)
xAI
(2 models)