Pricing / Compare / GPT-OSS 20B vs Llama 4 Scout

GPT-OSS 20B vs Llama 4 Scout

Side-by-side API pricing comparison · Groq vs Groq

🏆 GPT-OSS 20B is 3.3% cheaper on blended cost ($0.537 vs $0.555/Mtok)

GPT-OSS 20B

by Groq

Current fast Open weights

Input

$0.075/Mtok

Output

$1.00/Mtok

✓ Cheaper

Blended avg	$0.537/Mtok
Context	128K tokens
Modality	text
Parameters	20B
Released	Jan 1, 2026

Full details →

Llama 4 Scout

by Groq

Current fast Open weights

Input

$0.110/Mtok

Output

$1.00/Mtok

Blended avg	$0.555/Mtok
Context	10M tokens
Modality	text, image
Parameters	17B (16 experts)
Released	Apr 6, 2025

Full details →

Cost at scale — 1M tokens (50/50 input/output)

Volume	GPT-OSS 20B	Llama 4 Scout	Savings
1M tokens	$0.54	$0.56	$0.02 (3.6%)
10M tokens	$5.38	$5.55	$0.18 (3.2%)
100M tokens	$53.75	$55.5	$1.75 (3.2%)
1000M tokens	$537.5	$555	$17.5 (3.2%)

Summary

GPT-OSS 20B by Groq costs $0.075/Mtok input and $1.00/Mtok output, with a 128K-token context window. It supports text input.

Llama 4 Scout by Groq costs $0.110/Mtok input and $1.00/Mtok output, with a 10M-token context window. It supports text, image input.

On a blended cost basis, GPT-OSS 20B is 3.3% cheaper than Llama 4 Scout.

Note: Pricing is per million tokens. Actual costs vary with usage patterns, prompt caching, and batch discounts. Always verify against official provider pricing pages.

GPT-OSS 20B vs Llama 4 Scout

GPT-OSS 20B

Llama 4 Scout

Cost at scale — 1M tokens (50/50 input/output)

Summary

More comparisons