Quickstart

Start calling this model with endpoint-specific examples.

Benchmarks

Headline benchmark standings and comparison context.

About

Key dates, capabilities, and model metadata.

Qwen: Qwen 3 32B

Qwen 3 32B is a dense 32.8B parameter causal language model from the Qwen3 series, optimized for both complex reasoning and efficient dialogue.

Overview Playground Providers Pricing Performance Apps Activity Quickstart Benchmarks Family Timeline

Pricing

Effective pricing across providers over the past hour and 30-day pricing history by meter.

Effective Pricing

Based on the last hour of usage: we total provider spend, divide by total token volume, and express it as USD per 1M tokens. Providers with more traffic naturally carry more weight.

Weighted Avg Input Price

per 1M tokens (past hour)

Weighted Avg Output Price

per 1M tokens (past hour)

Provider	Input $/Million	Output $/Million	Cache Token %
Alibaba Cloud	$0.160	$0.640	--
Amazon Bedrock	$0.155	$0.618	--
AtlasCloud	$0.100	$1.200	--
DeepInfra	$0.080	$0.280	--
DigitalOcean	$0.250	$0.550	--
GMICloud	$0.100	$0.600	--
Groq	$0.290	$0.590	--
Nebius Token Factory	$0.100	$0.300	--
Nebius Token Factory (Fast)	$0.200	$0.600	--
NovitaAI	$0.100	$0.450	--
OVHcloud	$0.080	$0.230	--
SiliconFlow	$0.140	$0.570	--

Pricing History

Meter

No 30-day pricing history is available for the selected filters.