Capabilities, modalities, and lifecycle fields pulled from the model database.
Comparative results across benchmarks shared by the selected models.
| AIME 2024 | 79.8% |
| LMArena Text | 139300.0% |
| Ai2 SciArena | 101812.0% |
| Elimation Game | 536.2% |
| AidanBench | 150500.0% |
| EQ-Bench 3 | 127010.0% |
| SimpleBench | 30.9% |
| LMArena WebDev | 119832.0% |
| Thematic Generalisation | 180.0% |
| NYT Connections | 38.6% |
| Aider-Polyglot | 56.9% |
| ARC-AGI-1 | 15.8% |
| ARC-AGI-2 | 1.3% |
| LiveBench | 65.1% |
| Confabulations | 1265.0% |
| GPQA Diamond | 71.5% |
Observed provider pricing per million tokens.
No providers found yet.
Maximum input and output token capacity.
Usage and distribution terms.
Model release chronology.
A deeper field-by-field view (including benchmarks, pricing, and links).
| General Information | |
| Context Window | Input: 128,000 Output: 131,072 |
| Modalities | In: Text Out: Text |
| Reasoning | - |
| Web access | - |
| Parameters | 671.0B |
| Training Tokens | 14.8T |
| License | MIT |
| Knowledge Cutoff | - |
| Status | Available |
| Release | Jan 2025 |
| Announced | Jan 2025 |
| Deprecation | - |
| Retirement | - |
| Links | |
| Operational Metrics | |
| Cost per 1M Tokens | Input: - Output: - |
| Latency | - |
| Throughput | - |
| Benchmarks | |
| AIME 2024 | |
| ARC-AGI-1 | |
| ARC-AGI-2 | |
| Ai2 SciArena | |
| AidanBench | |
| Aider-Polyglot | |
| Confabulations | |
| EQ-Bench 3 | |
| Elimation Game | |
| GPQA Diamond | |
| LMArena Text | |
| LMArena WebDev | |
| LiveBench | |
| NYT Connections | |
| SimpleBench | |
| Thematic Generalisation | |