Search...
Ctrl K
Models
Providers
Apps
Rankings
Playground
Models
Providers
Apps
Rankings
Playground
Search...
Ctrl K
Sign In
Sign In
MultiChallenge (o3-mini grader) - Benchmark Leaderboard & Model Performance | AI Stats
MultiChallenge (o3-mini grader)
Overview
Overview
Type: percentage
Language
Recorded Results
3
Average Score
46.73
Score Range
39.90 - 50.20
Leading Model
50.20 - o3 mini
Scores Over Time
Individual benchmark scores plotted by date.
Models Using This Benchmark
Organisation
Model
Reported
Top Score
Info
Self Reported
Source
OpenAI
o3 mini
30 Jan 2025
50.20
LLM Stats (ZeroEval)
Yes
Source
OpenAI
GPT 4.5
27 Feb 2025
50.10
LLM Stats (ZeroEval)
Yes
Source
OpenAI
GPT 4o (2024-08-06)
06 Aug 2024
39.90
LLM Stats (ZeroEval)
Yes
Source