Capabilities, modalities, and lifecycle fields pulled from the model database.
Comparative results across benchmarks shared by the selected models.
| Terminal Bench | 32.5% |
| MMLU-Pro | 80.9% |
| AIME 2025 | 91.0% |
| IF-Bench | 70.8% |
| Tau 2 Telecom | 76.0% |
| Tau 2 Airline | 64.8% |
| BFCL Overall FC V4 | 60.3% |
| LiveCodeBench Coding | 71.0% |
| GPQA Diamond | 79.6% |
| MMMU Pro | 61.8% |
| SWE-Bench | 64.5% |
| OCRBench V2 | 56.1% |
| Realkie | 62.1% |
| MMLU Pro | 76.6% |
| QVHighlights | 77.2% |
| Tau 2 Retail | 76.5% |
| Scale MCP Atlas | 24.6% |
| ScreenSpot | 83.3% |
| LongCodeBench 1M | 84.0% |
Observed provider pricing per million tokens.
No providers found yet.
Maximum input and output token capacity.
Usage and distribution terms.
Model release chronology.
A deeper field-by-field view (including benchmarks, pricing, and links).
| General Information | |
| Context Window | Input: 1,000,000 Output: - |
| Modalities | In: Text, Vision, Video Out: Text |
| Reasoning | - |
| Web access | - |
| Parameters | - |
| Training Tokens | - |
| License | Proprietary |
| Knowledge Cutoff | - |
| Status | Available |
| Release | Dec 2025 |
| Announced | Dec 2025 |
| Deprecation | - |
| Retirement | - |
| Links | |
| Operational Metrics | |
| Cost per 1M Tokens | Input: - Output: - |
| Latency | - |
| Throughput | - |
| Benchmarks | |
| AIME 2025 | |
| BFCL Overall FC V4 | |
| GPQA Diamond | |
| IF-Bench | |
| LiveCodeBench Coding | |
| LongCodeBench 1M | |
| MMLU Pro | |
| MMLU-Pro | |
| MMMU Pro | |
| OCRBench V2 | |
| QVHighlights | |
| Realkie | |
| SWE-Bench | |
| Scale MCP Atlas | |
| ScreenSpot | |
| Tau 2 Airline | |
| Tau 2 Retail | |
| Tau 2 Telecom | |
| Terminal Bench | |