Llama 3.1 8B Instruct Providers - Availability & Pricing Details

Cerebras

Total Context32.0K

Max Output8K

Latency

0.00s

Throughput

0.0tps

Uptime

Pricing

Input

$0.1

Per 1M tokens

Output

$0.1

Per 1M tokens

DeepInfra

Total Context131.1K

Max Output131K

Latency

0.00s

Throughput

0.0tps

Uptime

Pricing

Input

$0.02

Per 1M tokens

Output

$0.03

Per 1M tokens

Groq

Total Context131.1K

Max Output131K

Latency

0.00s

Throughput

0.0tps

Uptime

Pricing

Input

$0.05

Per 1M tokens

Output

$0.08

Per 1M tokens

Cache Reads

$0.025

Per 1M tokens

NovitaAI

Total Context16.4K

Max Output16K

Latency

0.00s

Throughput

0.0tps

Uptime

Pricing

Input

$0.02

Per 1M tokens

Output

$0.05

Per 1M tokens

Weights & Biases

Latency

0.00s

Throughput

0.0tps

Uptime

Pricing

Input

$0.22

Per 1M tokens

Output

$0.22

Per 1M tokens

Cloudflare

Latency

0.00s

Throughput

0.0tps

Uptime

Pricing

Input

$0.123

Per 1M tokens

Output

$0.266

Per 1M tokens

Meta: Llama 3.1 8B Instruct

Quickstart

Benchmarks

About

Meta: Llama 3.1 8B Instruct

Providers