LIVE Cheapest: GLM-4.7-Flash $0.000/Mtok in 153 models tracked Updated Jun 25, 2026
Jun 25, 2026
ModelPriceWatch$/Mtok
Pricing / Compare / DeepSeek V4 Flash vs GPT OSS 120B

DeepSeek V4 Flash vs GPT OSS 120B

Side-by-side API pricing comparison · Fireworks vs Fireworks

🏆 DeepSeek V4 Flash is 78.6% cheaper on blended cost ($0.210 vs $0.375/Mtok)

DeepSeek V4 Flash

by Fireworks

Current budget
Input
$0.140/Mtok
Output
$0.280/Mtok
✓ Cheaper
Blended avg$0.210/Mtok
Cached input$0.028/Mtok
Context1M tokens
Modalitytext
ParametersProprietary
ReleasedDec 1, 2025
Full details →

GPT OSS 120B

by Fireworks

Current budget Open weights
Input
$0.150/Mtok
Output
$0.600/Mtok
Blended avg$0.375/Mtok
Cached input$0.015/Mtok
Context128K tokens
Modalitytext
Parameters120B
ReleasedJan 1, 2026
Full details →

Cost at scale — 1M tokens (50/50 input/output)

VolumeDeepSeek V4 FlashGPT OSS 120BSavings
1M tokens $0.21 $0.38 $0.16 (42.7%)
10M tokens $2.1 $3.75 $1.65 (44%)
100M tokens $21 $37.5 $16.5 (44%)
1000M tokens $210 $375 $165 (44%)

Summary

DeepSeek V4 Flash by Fireworks costs $0.140/Mtok input and $0.280/Mtok output, with a 1M-token context window. It supports text input.

GPT OSS 120B by Fireworks costs $0.150/Mtok input and $0.600/Mtok output, with a 128K-token context window. It supports text input.

On a blended cost basis, DeepSeek V4 Flash is 78.6% cheaper than GPT OSS 120B. It also has a larger context window.

Note: Pricing is per million tokens. Actual costs vary with usage patterns, prompt caching, and batch discounts. Always verify against official provider pricing pages.