Best LLM API for Chatbots
Find the best LLM API for chatbots and conversational AI. Compare pricing for models optimized for dialogue, customer support, and interactive assistants.
Cost calculator for this use case
🥇 Granite 4 H Medium
$—
🥈 Llama 3.1 8B Instant
$—
🥉 GPT-OSS 20B
$—
Full ranking — top 15 models
| # | Model | Provider | Input $/Mtok | Output $/Mtok | Blended | Context | |
|---|---|---|---|---|---|---|---|
| 1 | Granite 4 H Medium | IBM | $0.150 | $0.600 | $0.375 | 128K | → |
| 2 | Llama 3.1 8B Instant | Groq | $0.050 | $1.00 | $0.525 | 128K | → |
| 3 | GPT-OSS 20B | Groq | $0.075 | $1.00 | $0.537 | 128K | → |
| 4 | Llama 4 Scout | Groq | $0.110 | $1.00 | $0.555 | 10M | → |
| 5 | GPT-OSS 120B | Groq | $0.150 | $1.00 | $0.575 | 128K | → |
| 6 | Qwen3 32B | Groq | $0.290 | $1.00 | $0.645 | 128K | → |
| 7 | DeepSeek V4 Pro | DeepSeek | $0.435 | $0.870 | $0.652 | 1M | → |
| 8 | Granite 4 H Large | IBM | $0.300 | $1.20 | $0.750 | 128K | → |
| 9 | Llama 3.3 70B Versatile | Groq | $0.590 | $1.00 | $0.795 | 128K | → |
| 10 | Qwen-Plus | Alibaba | $0.400 | $1.20 | $0.800 | 131K | → |
| 11 | Qwen 3.6 27B | Groq | $0.600 | $1.00 | $0.800 | 128K | → |
| 12 | Mistral Large 3 | Mistral | $0.500 | $1.50 | $1.00 | 128K | → |
| 13 | Sonar | Perplexity | $1.00 | $1.00 | $1.00 | 200K | → |
| 14 | Gemini 3.1 Flash | $0.300 | $2.50 | $1.40 | 1M | → | |
| 15 | Reka Flash | Reka | $0.800 | $2.00 | $1.40 | 128K | → |
How models are selected
Mid-tier and flagship models suitable for conversational use, sorted by blended cost.
Prices are per million tokens (Mtok), sourced directly from official provider pricing pages and verified by our automated scraper pipeline that runs 3x daily. "Blended cost" is the average of input and output pricing — a quick proxy for typical 50/50 usage patterns.