This website is not yet fully optimised for mobile viewing. Some features may not display or function as intended.
DeepSeek VL2 Small
DeepSeekOverview
No description provided. Want to help? Contribute on GitHub or click here!
Quick Links
Key Metrics
Max Input
129.28K
Tokens
Max Output
129.28K
Tokens
Input Price
-
Per 1M Tokens
Output Price
-
Per 1M Tokens
Throughput
-
tok/s
Latency
-
ms
Model Information
Release Details
Released
13 Dec 2024
Knowledge Cutoff
-
License
deepseek
Model Architecture
Parameters
-
Training Data
-
Context Window
Input Context Length
129,280 tokens
Output Context Length
129,280 tokens
Key Features
Web Access
No
Real-time access to current web information
Multimodal
Yes
Ability to process multiple data types (text, images, etc.)
Reasoning
Unknown
Advanced logical and deductive reasoning capabilities
Fine-Tunable
Unknown
Can be customized for specific use cases
Model Release & Updates
13 Dec 2024
Model Released
Model first made available to the public
Benchmarks & Performance Comparison
GPQA
-
LisanBench
-
AidanBench
-
Aider-Polyglot
-
LiveBench
-
AIME 2024
-
ARC-AGI-1
-
ARC-AGI-2
-
AIME 2025
-
Humanity's Last Exam
-