Unified model, provider, and gateway data for teams building with AI APIs.

Explore

Models
Playground
Compare
Providers
Apps
Rankings

Build

Documentation
API Reference
Quickstart
SDKs
Status

Company

Announcements
Pricing
Works With
Support
Privacy
Terms

Community

Discord
GitHub
Reddit
LinkedIn
X

© 2025 • AI Stats

Spotted a data issue or broken page?Open an issueorcontact support

Models Playground Compare Providers Apps Rankings

Models Playground Compare Providers Apps Rankings

SWE-Bench Multimodal - Benchmark Leaderboard & Model Performance | AI Stats

SWE-Bench Multimodal

Type: percentage

Code

View benchmark source

Recorded Results

1

Average Score

34.50%

Score Range

34.50% - 34.50%

Leading Model

34.50% - Claude Opus 4.7

Scores Over Time

Individual benchmark scores plotted by date.

Models Using This Benchmark

Organisation	Model	Reported	Top Score	Info	Self Reported	Source
Anthropic	Claude Opus 4.7	16 Apr 2026	34.50%	-	Yes	Source