Search...
Ctrl K
Models
Providers
Apps
Rankings
Playground
Models
Providers
Apps
Rankings
Playground
Search...
Ctrl K
Sign In
Sign In
DeepInfra Models - Ordered by Date Added | AI Stats
DeepInfra
Overview
Models
Models
Filter Parameters
68 models
DeepInfra: DeepSeek OCR
deepseek/deepseek-ocr
Modalities
Input
-
Output
-
Supported Parameters
-
Pricing
Input Text Tokens:
$0.03
/ 1M tokens
Output Text Tokens:
$0.1
/ 1M tokens
DeepInfra: Deepseek R1 (2025-05-28)
deepseek/deepseek-r1-0528
Modalities
Input
-
Output
-
Supported Parameters
-
Pricing
Input Text Tokens:
$0.5
/ 1M tokens
Output Text Tokens:
$2.15
/ 1M tokens
Cached Read Text Tokens:
$0.35
/ 1M tokens
DeepInfra: DeepSeek V3 (2025-03-24)
deepseek/deepseek-v3-0324
Modalities
Input
-
Output
-
Supported Parameters
-
Pricing
Input Text Tokens:
$0.2
/ 1M tokens
Output Text Tokens:
$0.77
/ 1M tokens
Cached Read Text Tokens:
$0.135
/ 1M tokens
DeepInfra: DeepSeek V3.1
deepseek/deepseek-v3.1
Modalities
Input
-
Output
-
Supported Parameters
-
Pricing
Input Text Tokens:
$0.21
/ 1M tokens
Output Text Tokens:
$0.79
/ 1M tokens
Cached Read Text Tokens:
$0.13
/ 1M tokens
DeepInfra: DeepSeek V3.1 Terminus
deepseek/deepseek-v3.1-terminus
Modalities
Input
-
Output
-
Supported Parameters
-
Pricing
Input Text Tokens:
$0.21
/ 1M tokens
Output Text Tokens:
$0.79
/ 1M tokens
Cached Read Text Tokens:
$0.13
/ 1M tokens
DeepInfra: DeepSeek V3.2
deepseek/deepseek-v3.2
Modalities
Input
-
Output
-
Supported Parameters
-
Pricing
Input Text Tokens:
$0.26
/ 1M tokens
Output Text Tokens:
$0.38
/ 1M tokens
Cached Read Text Tokens:
$0.13
/ 1M tokens
DeepInfra: Gemma 3 12B
google/gemma-3-12b
Modalities
Input
-
Output
-
Supported Parameters
-
Pricing
Input Text Tokens:
$0.04
/ 1M tokens
Output Text Tokens:
$0.13
/ 1M tokens
DeepInfra: Gemma 3 27B
google/gemma-3-27b
Modalities
Input
-
Output
-
Supported Parameters
-
Pricing
Input Text Tokens:
$0.08
/ 1M tokens
Output Text Tokens:
$0.16
/ 1M tokens
DeepInfra: Gemma 3 4B
google/gemma-3-4b
Modalities
Input
-
Output
-
Supported Parameters
-
Pricing
Input Text Tokens:
$0.04
/ 1M tokens
Output Text Tokens:
$0.08
/ 1M tokens
DeepInfra: GLM 4.6
z-ai/glm-4.6
Modalities
Input
-
Output
-
Supported Parameters
-
Pricing
Input Text Tokens:
$0.43
/ 1M tokens
Output Text Tokens:
$1.74
/ 1M tokens
Cached Read Text Tokens:
$0.08
/ 1M tokens
DeepInfra: GLM 4.6V
z-ai/glm-4.6v
Modalities
Input
-
Output
-
Supported Parameters
-
Pricing
Input Text Tokens:
$0.3
/ 1M tokens
Output Text Tokens:
$0.9
/ 1M tokens
DeepInfra: GLM 4.7
z-ai/glm-4.7
Modalities
Input
-
Output
-
Supported Parameters
-
Pricing
Input Text Tokens:
$0.4
/ 1M tokens
Output Text Tokens:
$1.75
/ 1M tokens
Cached Read Text Tokens:
$0.08
/ 1M tokens
DeepInfra: GLM 4.7 Flash
z-ai/glm-4.7-flash
Modalities
Input
-
Output
-
Supported Parameters
-
Pricing
Input Text Tokens:
$0.06
/ 1M tokens
Output Text Tokens:
$0.4
/ 1M tokens
Cached Read Text Tokens:
$0.01
/ 1M tokens
DeepInfra: GLM 5
z-ai/glm-5
Modalities
Input
Text
Output
Text
Supported Parameters
-
Pricing
Input Text Tokens:
$0.8
/ 1M tokens
Output Text Tokens:
$2.56
/ 1M tokens
Cached Read Text Tokens:
$0.16
/ 1M tokens
DeepInfra: google/embeddinggemma-300m
google/embeddinggemma-300m
Modalities
Input
-
Output
-
Supported Parameters
-
Pricing
Input Text Tokens:
$0.002
/ 1M tokens
DeepInfra: GPT OSS 120b
openai/gpt-oss-120b
Modalities
Input
-
Output
-
Supported Parameters
-
Pricing
Input Text Tokens:
$0.039
/ 1M tokens
Output Text Tokens:
$0.19
/ 1M tokens
DeepInfra: GPT OSS 20b
openai/gpt-oss-20b
Modalities
Input
-
Output
-
Supported Parameters
-
Pricing
Input Text Tokens:
$0.03
/ 1M tokens
Output Text Tokens:
$0.14
/ 1M tokens
DeepInfra: Hermes 3 Llama 3.1 405B
nous/hermes-3-llama-3.1-405b
Modalities
Input
-
Output
-
Supported Parameters
-
Pricing
Total Tokens:
$1
/ 1M tokens
DeepInfra: Hermes 3 Llama 3.1 70B
nousresearch/hermes-3-llama-3.1-70b
Modalities
Input
-
Output
-
Supported Parameters
-
Pricing
-
DeepInfra: Kimi K2 (2025-09-05)
moonshotai/kimi-k2-0905
Modalities
Input
-
Output
-
Supported Parameters
-
Pricing
Input Text Tokens:
$0.4
/ 1M tokens
Output Text Tokens:
$2
/ 1M tokens
Cached Read Text Tokens:
$0.15
/ 1M tokens
DeepInfra: Kimi K2 Thinking
moonshotai/kimi-k2-thinking
Modalities
Input
-
Output
-
Supported Parameters
-
Pricing
Input Text Tokens:
$0.47
/ 1M tokens
Output Text Tokens:
$2
/ 1M tokens
Cached Read Text Tokens:
$0.141
/ 1M tokens
DeepInfra: Kimi K2.5
moonshotai/kimi-k2.5
Modalities
Input
-
Output
-
Supported Parameters
-
Pricing
Input Text Tokens:
$0.45
/ 1M tokens
Output Text Tokens:
$2.25
/ 1M tokens
Cached Read Text Tokens:
$0.07
/ 1M tokens
DeepInfra: Llama 3 8B Instruct
meta/llama-3-8b
Modalities
Input
-
Output
-
Supported Parameters
-
Pricing
Input Text Tokens:
$0.03
/ 1M tokens
Output Text Tokens:
$0.04
/ 1M tokens
DeepInfra: Llama 3.1 70B Instruct
meta/llama-3.1-70b
Modalities
Input
-
Output
-
Supported Parameters
-
Pricing
Total Tokens:
$0.4
/ 1M tokens
DeepInfra: Llama 3.1 8B Instruct
meta/llama-3.1-8b
Modalities
Input
-
Output
-
Supported Parameters
-
Pricing
Input Text Tokens:
$0.02
/ 1M tokens
Output Text Tokens:
$0.05
/ 1M tokens
DeepInfra: Llama 3.2 3B Instruct
meta/llama-3.2-3b
Modalities
Input
-
Output
-
Supported Parameters
-
Pricing
-
DeepInfra: Llama 3.3 Nemotron Super 49B V1.5
nvidia/llama-3.3-nemotron-super-49b-v1.5
Modalities
Input
-
Output
-
Supported Parameters
-
Pricing
-
DeepInfra: Llama 4 Maverick
meta/llama-4-maverick
Modalities
Input
-
Output
-
Supported Parameters
-
Pricing
Input Text Tokens:
$0.15
/ 1M tokens
Output Text Tokens:
$0.6
/ 1M tokens
DeepInfra: Llama 4 Scout
meta/llama-4-scout
Modalities
Input
-
Output
-
Supported Parameters
-
Pricing
Input Text Tokens:
$0.08
/ 1M tokens
Output Text Tokens:
$0.3
/ 1M tokens
DeepInfra: meta-llama/Llama-Guard-4-12B
meta/llama-guard-4-12b
Modalities
Input
-
Output
-
Supported Parameters
-
Pricing
Total Tokens:
$0.18
/ 1M tokens
DeepInfra: MiniMax M2.1
minimax/minimax-m2.1
Modalities
Input
-
Output
-
Supported Parameters
-
Pricing
Input Text Tokens:
$0.27
/ 1M tokens
Output Text Tokens:
$0.95
/ 1M tokens
Cached Read Text Tokens:
$0.029
/ 1M tokens
DeepInfra: MiniMax M2.5
minimax/minimax-m2.5
Modalities
Input
-
Output
-
Supported Parameters
-
Pricing
Input Text Tokens:
$0.27
/ 1M tokens
Output Text Tokens:
$0.95
/ 1M tokens
Cached Read Text Tokens:
$0.03
/ 1M tokens
DeepInfra: Mistral Small 3.2
mistral/mistral-small-3.2
Modalities
Input
-
Output
-
Supported Parameters
-
Pricing
Input Text Tokens:
$0.075
/ 1M tokens
Output Text Tokens:
$0.2
/ 1M tokens
DeepInfra: mistralai/Mistral-Nemo-Instruct-2407
mistral/mistral-nemo-2407
Modalities
Input
-
Output
-
Supported Parameters
-
Pricing
-
DeepInfra: mistralai/Mistral-Small-24B-Instruct-2501
mistral/mistral-small-24b-2501
Modalities
Input
-
Output
-
Supported Parameters
-
Pricing
-
DeepInfra: Mixtral 8x7b
mistral/mixtral-8x7b
Modalities
Input
-
Output
-
Supported Parameters
-
Pricing
Total Tokens:
$0.54
/ 1M tokens
DeepInfra: Nemotron 3 Super
nvidia/nemotron-3-super-120b-a12b
Modalities
Input
-
Output
-
Supported Parameters
-
Pricing
Input Text Tokens:
$0.1
/ 1M tokens
Output Text Tokens:
$0.5
/ 1M tokens
Cached Read Text Tokens:
$0.1
/ 1M tokens
DeepInfra: Nemotron Nano 3 30B A3B
nvidia/nemotron-3-nano-30b-a3b
Modalities
Input
-
Output
-
Supported Parameters
-
Pricing
Input Text Tokens:
$0.05
/ 1M tokens
Output Text Tokens:
$0.2
/ 1M tokens
DeepInfra: Nvidia Nemotron Nano 9B V2
nvidia/nvidia-nemotron-nano-9b-v2
Modalities
Input
-
Output
-
Supported Parameters
-
Pricing
-
DeepInfra: nvidia/NVIDIA-Nemotron-Nano-12B-v2-VL
nvidia/nvidia-nemotron-nano-12b-v2-vl
Modalities
Input
-
Output
-
Supported Parameters
-
Pricing
-
DeepInfra: Olmo 3.1 32B Instruct
allenai/olmo-3.1-32b
Modalities
Input
-
Output
-
Supported Parameters
-
Pricing
Input Text Tokens:
$0.2
/ 1M tokens
Output Text Tokens:
$0.6
/ 1M tokens
DeepInfra: Phi 4
microsoft/phi-4
Modalities
Input
-
Output
-
Supported Parameters
-
Pricing
Input Text Tokens:
$0.07
/ 1M tokens
Output Text Tokens:
$0.14
/ 1M tokens
DeepInfra: Qwen 2.5 72B
qwen/qwen2.5-72b
Modalities
Input
-
Output
-
Supported Parameters
-
Pricing
-
DeepInfra: Qwen 2.5 VL 32B Instruct
qwen/qwen2.5-vl-32b
Modalities
Input
-
Output
-
Supported Parameters
-
Pricing
-
DeepInfra: Qwen 3 14B
qwen/qwen3-14b
Modalities
Input
-
Output
-
Supported Parameters
-
Pricing
-
DeepInfra: Qwen 3 235B A22B Thinking 2507
qwen/qwen3-235b-a22b-thinking-2507
Modalities
Input
-
Output
-
Supported Parameters
-
Pricing
-
DeepInfra: Qwen 3 30B A3B
qwen/qwen3-30b-a3b
Modalities
Input
-
Output
-
Supported Parameters
-
Pricing
-
DeepInfra: Qwen 3 32B
qwen/qwen3-32b
Modalities
Input
-
Output
-
Supported Parameters
-
Pricing
-
DeepInfra: Qwen 3 A235 A22B Instruct 2507
qwen/qwen3-235b-a22b-2507
Modalities
Input
-
Output
-
Supported Parameters
-
Pricing
Input Text Tokens:
$0.071
/ 1M tokens
Output Text Tokens:
$0.1
/ 1M tokens
DeepInfra: Qwen 3 Coder 480B A35B Instruct
qwen/qwen3-coder-480b-a35b
Modalities
Input
-
Output
-
Supported Parameters
-
Pricing
-
DeepInfra: Qwen 3 Embedding 0.6B
qwen/qwen3-embedding-0.6b
Modalities
Input
-
Output
-
Supported Parameters
-
Pricing
-
DeepInfra: Qwen 3 Embedding 4B
qwen/qwen3-embedding-4b
Modalities
Input
-
Output
-
Supported Parameters
-
Pricing
-
DeepInfra: Qwen 3 Max Thinking
qwen/qwen3-max-thinking
Modalities
Input
-
Output
-
Supported Parameters
-
Pricing
Input Text Tokens:
$1.2
/ 1M tokens
Output Text Tokens:
$6
/ 1M tokens
Cached Read Text Tokens:
$0.24
/ 1M tokens
DeepInfra: Qwen 3 Next 80B A3B Instruct
qwen/qwen3-next-80b-a3b-instruct
Modalities
Input
-
Output
-
Supported Parameters
-
Pricing
-
DeepInfra: Qwen 3 VL 235B A22B Instruct
qwen/qwen3-vl-235b-a22b-instruct
Modalities
Input
-
Output
-
Supported Parameters
-
Pricing
-
DeepInfra: Qwen 3 VL 30B A3B Instruct
qwen/qwen3-vl-30b-a3b-instruct
Modalities
Input
-
Output
-
Supported Parameters
-
Pricing
-
DeepInfra: Qwen 3.5 0.8B
qwen/qwen3.5-0.8b
Modalities
Input
-
Output
-
Supported Parameters
-
Pricing
Input Text Tokens:
$0.01
/ 1M tokens
Output Text Tokens:
$0.05
/ 1M tokens
DeepInfra: Qwen 3.5 122B A10B
qwen/qwen3.5-122b-a10b
Modalities
Input
-
Output
-
Supported Parameters
-
Pricing
Input Text Tokens:
$0.29
/ 1M tokens
Output Text Tokens:
$2.9
/ 1M tokens
DeepInfra: Qwen 3.5 27B
qwen/qwen3.5-27b
Modalities
Input
-
Output
-
Supported Parameters
-
Pricing
Input Text Tokens:
$0.26
/ 1M tokens
Output Text Tokens:
$2.6
/ 1M tokens
DeepInfra: Qwen 3.5 2B
qwen/qwen3.5-2b
Modalities
Input
-
Output
-
Supported Parameters
-
Pricing
Input Text Tokens:
$0.02
/ 1M tokens
Output Text Tokens:
$0.1
/ 1M tokens
DeepInfra: Qwen 3.5 35B A3B
qwen/qwen3.5-35b-a3b
Modalities
Input
-
Output
-
Supported Parameters
-
Pricing
Input Text Tokens:
$0.22
/ 1M tokens
Output Text Tokens:
$2.2
/ 1M tokens
DeepInfra: Qwen 3.5 397B A17B
qwen/qwen3.5-397b-a17b
Modalities
Input
-
Output
-
Supported Parameters
-
Pricing
Input Text Tokens:
$0.54
/ 1M tokens
Output Text Tokens:
$3.4
/ 1M tokens
DeepInfra: Qwen 3.5 4B
qwen/qwen3.5-4b
Modalities
Input
-
Output
-
Supported Parameters
-
Pricing
Input Text Tokens:
$0.03
/ 1M tokens
Output Text Tokens:
$0.15
/ 1M tokens
DeepInfra: Qwen 3.5 9B
qwen/qwen3.5-9b
Modalities
Input
-
Output
-
Supported Parameters
-
Pricing
Input Text Tokens:
$0.04
/ 1M tokens
Output Text Tokens:
$0.2
/ 1M tokens
DeepInfra: Qwen/Qwen3-Max
qwen/qwen3-max
Modalities
Input
-
Output
-
Supported Parameters
-
Pricing
Input Text Tokens:
$1.2
/ 1M tokens
Output Text Tokens:
$6
/ 1M tokens
Cached Read Text Tokens:
$0.24
/ 1M tokens
DeepInfra: Seed 1.8
bytedance/seed-1.8
Modalities
Input
-
Output
-
Supported Parameters
-
Pricing
Input Text Tokens:
$0.25
/ 1M tokens
Output Text Tokens:
$2
/ 1M tokens
Cached Read Text Tokens:
$0.05
/ 1M tokens
DeepInfra: Seed 2.0 Mini
bytedance/seed-2.0-mini
Modalities
Input
-
Output
-
Supported Parameters
-
Pricing
Input Text Tokens:
$0.1
/ 1M tokens
Output Text Tokens:
$0.4
/ 1M tokens
Cached Read Text Tokens:
$0.02
/ 1M tokens
DeepInfra: Step 3.5 Flash
stepfun/step-3.5-flash
Modalities
Input
Text
Output
Text
Supported Parameters
-
Pricing
Input Text Tokens:
$0.1
/ 1M tokens
Output Text Tokens:
$0.3
/ 1M tokens
Cached Read Text Tokens:
$0.02
/ 1M tokens