LIVE Cheapest: GLM-4.7-Flash $0.000/Mtok in 153 models tracked Updated Jun 25, 2026
Jun 25, 2026
ModelPriceWatch$/Mtok
Pricing / Compare / GPT-OSS 20B vs Llama 4 Scout

GPT-OSS 20B vs Llama 4 Scout

Side-by-side API pricing comparison · Groq vs Groq

🏆 GPT-OSS 20B is 3.3% cheaper on blended cost ($0.537 vs $0.555/Mtok)

GPT-OSS 20B

by Groq

Current fast Open weights
Input
$0.075/Mtok
Output
$1.00/Mtok
✓ Cheaper
Blended avg$0.537/Mtok
Context128K tokens
Modalitytext
Parameters20B
ReleasedJan 1, 2026
Full details →

Llama 4 Scout

by Groq

Current fast Open weights
Input
$0.110/Mtok
Output
$1.00/Mtok
Blended avg$0.555/Mtok
Context10M tokens
Modalitytext, image
Parameters17B (16 experts)
ReleasedApr 6, 2025
Full details →

Cost at scale — 1M tokens (50/50 input/output)

VolumeGPT-OSS 20BLlama 4 ScoutSavings
1M tokens $0.54 $0.56 $0.02 (3.6%)
10M tokens $5.38 $5.55 $0.18 (3.2%)
100M tokens $53.75 $55.5 $1.75 (3.2%)
1000M tokens $537.5 $555 $17.5 (3.2%)

Summary

GPT-OSS 20B by Groq costs $0.075/Mtok input and $1.00/Mtok output, with a 128K-token context window. It supports text input.

Llama 4 Scout by Groq costs $0.110/Mtok input and $1.00/Mtok output, with a 10M-token context window. It supports text, image input.

On a blended cost basis, GPT-OSS 20B is 3.3% cheaper than Llama 4 Scout.

Note: Pricing is per million tokens. Actual costs vary with usage patterns, prompt caching, and batch discounts. Always verify against official provider pricing pages.