Where does this ranking come from?

Speed comes from anonymous community benchmarks (median), and prices are maintained by hand from official provider pages.

How often is it updated?

Speed refreshes roughly every 5 minutes; prices update as providers change them.

Cheapest LLM APIs (output price ranking)

As of 2026-06-21, the top model is Qwen-Turbo ($0.20/1M output). See the full ranking below — click a model for details or run your own test.

⚡ Fastest 💰 Cheapest ⭐ Best value

🥇
Qwen-Turbo通义千问 Qwen
$0.20/1M output
🥈
MiMo V2.5小米 MiMo
cache hit $0.003/1M
$0.28/1M output
🥉
DeepSeek V4 FlashDeepSeek
cache hit $0.003/1M
$0.28/1M output
4
GPT-4o miniOpenAI
cache hit $0.075/1M
$0.60/1M output
5
MiMo V2.5 Pro小米 MiMo
cache hit $0.003/1M
$0.83/1M output
6
DeepSeek V4 ProDeepSeek
cache hit $0.004/1M
$0.87/1M output
7
Doubao Seed 1.6字节豆包
$1.1/1M output
8
Step 3.7 Flash阶跃 StepFun
cache hit $0.038/1M
$1.1/1M output
9
MiniMax-M3MiniMax
cache hit $0.058/1M
$1.2/1M output
10
GPT-4.1 miniOpenAI
cache hit $0.10/1M
$1.6/1M output
11
Qwen-Plus通义千问 Qwen
$2.4/1M output
12
Grok 4.3xAI
cache hit $0.20/1M
$2.5/1M output
13
GLM-5.1智谱 Zhipu
cache hit $0.18/1M
$3.3/1M output
14
Kimi K2.6Moonshot Kimi
cache hit $0.15/1M
$3.8/1M output
15
Kimi K2.7 CodeMoonshot Kimi
cache hit $0.18/1M
$3.8/1M output

Full pricing →Speed leaderboard →Run your own test ▶

📋 Embed this ranking on your site

Copy the code below (live data, auto-updating):

<iframe src="https://www.tokrace.com/embed/best/cheapest?lang=en" width="400" height="360" style="border:0;border-radius:12px" loading="lazy" title="Cheapest LLM APIs (output price ranking)"></iframe>

· Speed is the median of anonymous benchmarks; prices follow official pages and cross-currency uses a fixed USD rate. Reference only.