Fastest LLM APIs (real-world speed ranking)

As of 2026-06-21, the top model is step-3.7-flash (162 tok/s). See the full ranking below — click a model for details or run your own test.

⚡ Fastest💰 Cheapest⭐ Best value
  1. 🥇
    step-3.7-flashapi.stepfun.com
    TTFT 1.47s · peak 528 · 44 runs
    162 tok/s
  2. 🥈
    deepseek-v4-flashapi.deepseek.com
    TTFT 0.71s · peak 346 · 60 runs
    142 tok/s
  3. 🥉
    kimi-for-codingapi.kimi.com
    TTFT 1.41s · peak 900 · 83 runs
    115 tok/s
  4. 4
    mimo-v2.5api.xiaomimimo.com
    TTFT 6.09s · peak 514 · 26 runs
    85 tok/s
  5. 5
    deepseek-v4-proapi.deepseek.com
    TTFT 1.37s · peak 293 · 35 runs
    81 tok/s
  6. 6
    glm-5.1open.bigmodel.cn
    TTFT 4.70s · peak 594 · 41 runs
    53 tok/s
  7. 7
    glm-5.2open.bigmodel.cn
    TTFT 3.97s · peak 475 · 55 runs
    49 tok/s
  8. 8
    mimo-v2.5-proapi.xiaomimimo.com
    TTFT 2.02s · peak 131 · 22 runs
    37 tok/s
Full pricingSpeed leaderboardRun your own test
📋 Embed this ranking on your site

Copy the code below (live data, auto-updating):

<iframe src="https://www.tokrace.com/embed/best/fastest?lang=en" width="400" height="360" style="border:0;border-radius:12px" loading="lazy" title="Fastest LLM APIs (real-world speed ranking)"></iframe>

· Speed is the median of anonymous benchmarks; prices follow official pages and cross-currency uses a fixed USD rate. Reference only.