This website is not yet fully optimised for mobile viewing. Some features may not display or function as intended.
CN
DeepSeek

DeepSeek VL2 Small

DeepSeek

Overview

No description provided. Want to help? Contribute on GitHub or click here!

Quick Links

Key Metrics
Max Input
129.28K
Tokens
Max Output
129.28K
Tokens
Input Price
-
Per 1M Tokens
Output Price
-
Per 1M Tokens
Throughput
-
tok/s
Latency
-
ms
Model Information
Release Details

Released

13 Dec 2024

Knowledge Cutoff

-

License

deepseek

Model Architecture

Parameters

-

Training Data

-

Context Window

Input Context Length

129,280 tokens

Output Context Length

129,280 tokens

Key Features
Web Access
No

Real-time access to current web information

Multimodal
Yes

Ability to process multiple data types (text, images, etc.)

Reasoning
Unknown

Advanced logical and deductive reasoning capabilities

Fine-Tunable
Unknown

Can be customized for specific use cases

Model Release & Updates
13 Dec 2024
Model Released
Model first made available to the public
Benchmarks & Performance Comparison
GPQA
-
LisanBench
-
AidanBench
-
Aider-Polyglot
-
LiveBench
-
AIME 2024
-
ARC-AGI-1
-
ARC-AGI-2
-
AIME 2025
-
Humanity's Last Exam
-