Real-world LLM speed leaderboard

Anonymous real-world speed results from TOKRACE, annotated by sample confidence. Current 8 models · 335 runs

ModelMedian tok/sTTFTPeak
api.stepfun.com· 40 runsUsable signal
164
avg 176
1.50s
528
api.deepseek.com· 56 runsStable sample
142
avg 138
0.71s
346
api.kimi.com· 74 runsStable sample
104
avg 170
1.41s
565
api.xiaomimimo.com· 25 runsUsable signal
86
avg 86
6.30s
514
api.deepseek.com· 33 runsUsable signal
81
avg 87
1.41s
293
open.bigmodel.cn· 36 runsUsable signal
53
avg 64
4.80s
594
open.bigmodel.cn· 50 runsStable sample
50
avg 60
3.96s
475
api.xiaomimimo.com· 21 runsUsable signal
35
avg 38
2.07s
131

· Data comes from voluntary anonymous sharing and contains speed metrics only, not prompts or API keys · Updates every 5 minutes

· Speed is affected by network, time of day and provider load; different endpoints for the same model are tracked separately