Search...
Ctrl K
Models
Providers
Apps
Rankings
Playground
Models
Providers
Apps
Rankings
Playground
Search...
Ctrl K
Sign In
Sign In
ScienceQA Visual - Benchmark Leaderboard & Model Performance | AI Stats
ScienceQA Visual
Overview
Overview
Type: percentage
Multimodal
Recorded Results
1
Average Score
97.50
Score Range
97.50 - 97.50
Leading Model
97.50 - Phi 4 multimodal instruct
Scores Over Time
Individual benchmark scores plotted by date.
Models Using This Benchmark
Organisation
Model
Reported
Top Score
Info
Self Reported
Source
Microsoft
Phi 4 multimodal instruct
01 Feb 2025
97.50
-
Yes
Source