Loading...
Loading...
AI Stats
Leaderboard
Comparisons
Providers
Models
Benchmarks
Prices
Open menu
This website is not yet fully optimised for mobile viewing. Some features may not display or function as intended.
Humanity's Last Exam
Twitter
About this Benchmark
Unfortunately there isn't a description for this benchmark yet.
Model Performance
Humanity's Last Exam
5 models
Models Using This Benchmark
(5)
Google
(2 models)
Gemini 2.5 Pro
18.8%
Gemini 2.5 Flash
12.1%
OpenAI
(2 models)
DeepSeek
(1 model)