DeepSeek V4 Flash vs GPT OSS 120B
Side-by-side API pricing comparison · Fireworks vs Fireworks
🏆
DeepSeek V4 Flash is 78.6% cheaper on blended cost ($0.210 vs $0.375/Mtok)
DeepSeek V4 Flash
by Fireworks
Current budgetInput
$0.140/Mtok
Output
$0.280/Mtok
✓ Cheaper
| Blended avg | $0.210/Mtok |
|---|---|
| Cached input | $0.028/Mtok |
| Context | 1M tokens |
| Modality | text |
| Parameters | Proprietary |
| Released | Dec 1, 2025 |
GPT OSS 120B
by Fireworks
Current budget Open weightsInput
$0.150/Mtok
Output
$0.600/Mtok
| Blended avg | $0.375/Mtok |
|---|---|
| Cached input | $0.015/Mtok |
| Context | 128K tokens |
| Modality | text |
| Parameters | 120B |
| Released | Jan 1, 2026 |
Cost at scale — 1M tokens (50/50 input/output)
| Volume | DeepSeek V4 Flash | GPT OSS 120B | Savings |
|---|---|---|---|
| 1M tokens | $0.21 | $0.38 | $0.16 (42.7%) |
| 10M tokens | $2.1 | $3.75 | $1.65 (44%) |
| 100M tokens | $21 | $37.5 | $16.5 (44%) |
| 1000M tokens | $210 | $375 | $165 (44%) |
Summary
DeepSeek V4 Flash by Fireworks costs $0.140/Mtok input and $0.280/Mtok output, with a 1M-token context window. It supports text input.
GPT OSS 120B by Fireworks costs $0.150/Mtok input and $0.600/Mtok output, with a 128K-token context window. It supports text input.
On a blended cost basis, DeepSeek V4 Flash is 78.6% cheaper than GPT OSS 120B. It also has a larger context window.
Note: Pricing is per million tokens. Actual costs vary with usage patterns, prompt caching, and batch discounts. Always verify against official provider pricing pages.