Capabilities, modalities, and lifecycle fields pulled from the model database.
Comparative results across benchmarks shared by the selected models.
| GPQA Diamond | 66.3% |
| Graphwalks bfs <128k | 61.7% |
| LMArena Text | 140200.0% |
| EQ-Bench 3 | 123480.0% |
| Graphwalks parents <128k | 58.0% |
| SWE-Lancer | 35.1% |
| LiveBench | 55.9% |
| ARC-AGI-1 | 5.5% |
| ARC-AGI-2 | 0.4% |
| LMArena WebDev | 125652.0% |
| SWE-Bench | 54.6% |
| LongFact-Concepts hallucination rate | 0.7% |
| OpenAI-MRCR: 2 needle 256k | 56.2% |
| BrowseComp Long Context 128k | 85.9% |
| BrowseComp Long Context 256k | 75.5% |
| FActScore hallucination rate | 6.7% |
| LongFact-Objects hallucination rate | 1.1% |
| NYT Connections | 23.6% |
| Ai2 SciArena | 103726.0% |
| AidanBench | 137900.0% |
| VideoMME | 78.7% |
| Aider-Polyglot | 52.4% |
| AIME 2024 | 48.1% |
| OpenAI-MRCR: 2 needle 128k | 57.2% |
| SimpleBench | 27.0% |
Observed provider pricing per million tokens.
All unique meters observed across the selected models.
| Meter | GPT 4.1 |
|---|---|
| Input Text Tokens | $2.00 |
Providers that expose each model based on observed pricing data.
Plans that include each selected model, grouped by organisation.
2 plans
Maximum input and output token capacity.
Usage and distribution terms.
Model release chronology.
Most recent training data date (when available).
A deeper field-by-field view (including benchmarks, pricing, and links).
| General Information | |
| Context Window | Input: 1,047,576 Output: 32,768 |
| Modalities | In: Vision, Text Out: Text |
| Reasoning | - |
| Web access | - |
| Parameters | - |
| Training Tokens | - |
| License | Proprietary |
| Knowledge Cutoff | May 2024 |
| Status | Available |
| Release | Apr 2025 |
| Announced | Apr 2025 |
| Deprecation | - |
| Retirement | - |
| Links | - |
| Operational Metrics | |
| Cost per 1M Tokens | Input: $2.00 Output: $8.00 |
| Latency | - |
| Throughput | - |
| Benchmarks | |
| AIME 2024 | |
| ARC-AGI-1 | |
| ARC-AGI-2 | |
| Ai2 SciArena | |
| AidanBench | |
| Aider-Polyglot | |
| BrowseComp Long Context 128k | |
| BrowseComp Long Context 256k | |
| EQ-Bench 3 | |
| FActScore hallucination rate | |
| GPQA Diamond | |
| Graphwalks bfs <128k | |
| Graphwalks parents <128k | |
| LMArena Text | |
| LMArena WebDev | |
| LiveBench | |
| LongFact-Concepts hallucination rate | |
| LongFact-Objects hallucination rate | |
| NYT Connections | |
| OpenAI-MRCR: 2 needle 128k | |
| OpenAI-MRCR: 2 needle 256k | |
| SWE-Bench | |
| SWE-Lancer | |
| SimpleBench | |
| VideoMME | |