Search...
Ctrl K
Models
Providers
Rankings
Chat
Models
Providers
Rankings
Chat
Search...
Ctrl K
Sign In
Sign In
Llama 3.1 8B Instruct Providers - Availability & Pricing Details | AI Stats
Llama 3.1 8B Instruct
Meta
Chat
Compare
Overview
Timeline
Benchmarks
Providers
Quickstart
Performance
Providers
Providers
API
Subscriptions
Standard Tier
All Quantizations
Default
Cerebras
FP16
Total Context
32.0K
Max output
8K
Latency
0.44s
Throughput
571
tps
Uptime
Input
$0.1
Output
$0.1
NovitaAI
FP8
Total Context
16.4K
Max output
16K
Latency
0.72s
Throughput
85.2
tps
Uptime
Input
$0.02
Output
$0.05
Inactive Providers
DeepInfra
Not Available
BF16
Latency
--
Throughput
--
Uptime
Input
$0.02
Output
$0.05
Nebius Token Factory
Not Available
Latency
--
Throughput
--
Uptime
Input
$0.03
Output
$0.09
Weights & Biases
Not Available
Latency
--
Throughput
--
Uptime
Input
$0.22
Output
$0.22