What is the best LLM API for embedding?

Based on our verified pricing data, the cheapest model that qualifies is Rerank 3.5 by Cohere at $0.020/Mtok input. See the full ranking above for more options.

How often are prices updated?

Prices are verified against official provider pricing pages 3 times daily (8am, 2pm, 8pm UTC) by our automated scraper pipeline.

Pricing / Best For / Best Embedding Model APIs

Best Embedding Model APIs

Compare embedding model APIs for RAG pipelines and semantic search. Find the cheapest and most capable embedding models for vector databases.

14 models qualify Showing top 14 Sorted by blended cost

Rerank 3.5

Cohere

$0.020 in $0.020 out

$0.020/Mtok blended

— ctx

text-embedding-3-small

OpenAI

$0.020 in $0.020 out

$0.020/Mtok blended

8K ctx

rerank-2.5-lite

Voyage AI

$0.020 in $0.020 out

$0.020/Mtok blended

— ctx

Cost calculator for this use case

Tokens per day

Input/output ratio: 70/30

Days per month

🥇 Rerank 3.5 $—

🥈 text-embedding-3-small $—

🥉 rerank-2.5-lite $—

Full ranking — top 14 models

#	Model	Provider	Input $/Mtok	Output $/Mtok	Blended	Context
1	Rerank 3.5	Cohere	$0.020	$0.020	$0.020	—	→
2	text-embedding-3-small	OpenAI	$0.020	$0.020	$0.020	8K	→
3	rerank-2.5-lite	Voyage AI	$0.020	$0.020	$0.020	—	→
4	voyage-4-lite	Voyage AI	$0.020	$0.020	$0.020	—	→
5	text-embedding-004	Google	$0.025	$0.025	$0.025	2K	→
6	rerank-2.5	Voyage AI	$0.050	$0.050	$0.050	—	→
7	voyage-4	Voyage AI	$0.060	$0.060	$0.060	—	→
8	Granite Embedding 278M Multilingual	IBM	$0.106	$0.106	$0.106	—	→
9	Embed 4	Cohere	$0.120	$0.120	$0.120	—	→
10	voyage-4-large	Voyage AI	$0.120	$0.120	$0.120	—	→
11	voyage-multimodal-3.5	Voyage AI	$0.120	$0.120	$0.120	—	→
12	text-embedding-3-large	OpenAI	$0.130	$0.130	$0.130	8K	→
13	voyage-code-3	Voyage AI	$0.180	$0.180	$0.180	32K	→
14	voyage-context-3	Voyage AI	$0.180	$0.180	$0.180	32K	→

How models are selected

Embedding-category models, sorted by input price per million tokens.

Prices are per million tokens (Mtok), sourced directly from official provider pricing pages and verified by our automated scraper pipeline that runs 3x daily. "Blended cost" is the average of input and output pricing — a quick proxy for typical 50/50 usage patterns.

Best Embedding Model APIs

Rerank 3.5

text-embedding-3-small

rerank-2.5-lite

Cost calculator for this use case

Full ranking — top 14 models

How models are selected

Other use case rankings