This website is not yet fully optimised for mobile viewing. Some features may not display or function as intended.
DeepSeek-V3
DeepSeekOverview
No description provided. Want to help? Contribute on GitHub or click here!
Quick Links
Key Metrics
Max Input
131.072K
Tokens
Max Output
131.072K
Tokens
Input Price
-
Per 1M Tokens
Output Price
-
Per 1M Tokens
Throughput
-
tok/s
Latency
-
ms
Model Information
Release Details
Released
25 Dec 2024
Knowledge Cutoff
-
License
MIT + Model License (Commercial use allowed)
Model Architecture
Parameters
-
Training Data
-
Context Window
Input Context Length
131,072 tokens
Output Context Length
131,072 tokens
Key Features
Web Access
No
Real-time access to current web information
Multimodal
No
Ability to process multiple data types (text, images, etc.)
Reasoning
Unknown
Advanced logical and deductive reasoning capabilities
Fine-Tunable
Unknown
Can be customized for specific use cases
Model Release & Updates
25 Dec 2024
Model Released
Model first made available to the public
Benchmarks & Performance Comparison
GPQA
59.10%
LisanBench
-
AidanBench
-
Aider-Polyglot
-
LiveBench
-
AIME 2024
39.20%
ARC-AGI-1
-
ARC-AGI-2
-
AIME 2025
-
Humanity's Last Exam
-