LIVE Cheapest: GLM-4.7-Flash $0.000/Mtok in 153 models tracked Updated Jun 25, 2026
Jun 25, 2026
ModelPriceWatch$/Mtok
Pricing / Compare / text-embedding-004 vs Gemini 2.5 Flash

text-embedding-004 vs Gemini 2.5 Flash

Side-by-side API pricing comparison · Google vs Google

🏆 text-embedding-004 is 650% cheaper on blended cost ($0.025 vs $0.188/Mtok)

text-embedding-004

by Google

Current embedding
Input
$0.025/Mtok
Output
$0.025/Mtok
✓ Cheaper
Blended avg$0.025/Mtok
Context2K tokens
Modalitytext
ParametersProprietary
ReleasedOct 1, 2024
Full details →

Gemini 2.5 Flash

by Google

Current budget
Input
$0.075/Mtok
Output
$0.300/Mtok
Blended avg$0.188/Mtok
Context1M tokens
Modalitytext, image, audio, video
ParametersProprietary
ReleasedJun 17, 2025
Full details →

Cost at scale — 1M tokens (50/50 input/output)

Volumetext-embedding-004Gemini 2.5 FlashSavings
1M tokens $0.03 $0.19 $0.16 (85.3%)
10M tokens $0.25 $1.88 $1.63 (86.9%)
100M tokens $2.5 $18.75 $16.25 (86.7%)
1000M tokens $25 $187.5 $162.5 (86.7%)

Summary

text-embedding-004 by Google costs $0.025/Mtok input and $0.025/Mtok output, with a 2K-token context window. It supports text input.

Gemini 2.5 Flash by Google costs $0.075/Mtok input and $0.300/Mtok output, with a 1M-token context window. It supports text, image, audio, video input.

On a blended cost basis, text-embedding-004 is 650% cheaper than Gemini 2.5 Flash.

Note: Pricing is per million tokens. Actual costs vary with usage patterns, prompt caching, and batch discounts. Always verify against official provider pricing pages.