Search...
Ctrl K
Models
Providers
Apps
Rankings
Playground
Models
Providers
Apps
Rankings
Playground
Search...
Ctrl K
Sign In
Sign In
SWE-Bench Multimodal - Benchmark Leaderboard & Model Performance | AI Stats
SWE-Bench Multimodal
Overview
Overview
Type: percentage
Code
View benchmark source
Recorded Results
1
Average Score
34.50%
Score Range
34.50% - 34.50%
Leading Model
34.50% - Claude Opus 4.7
Scores Over Time
Individual benchmark scores plotted by date.
Models Using This Benchmark
Organisation
Model
Reported
Top Score
Info
Self Reported
Source
Anthropic
Claude Opus 4.7
16 Apr 2026
34.50%
-
Yes
Source