MiMo-V2
MiMo-V2-Flash Pricing Guide

MiMo-V2-Flash Pricing: API Cost, 256K Context & Low-Latency Fit

MiMo-V2-Flash is the long-tail answer for users searching Xiaomi's low-latency model pricing, API cost, and deployment fit for high-frequency AI workloads.

Pricing Snapshot

For many searchers, MiMo-V2-Flash pricing is the primary decision point.

ModelInput / 1M TokensOutput / 1M TokensPositioning
MiMo-V2-Flash$0.10$0.30Designed for high-frequency, low-latency production scenarios.

Who MiMo-V2-Flash Is For

Low-Latency Deployments

MiMo-V2-Flash is positioned for teams that care about throughput, responsiveness, and budget-aware inference at scale.

Long-Context Efficiency

With 256K context, MiMo-V2-Flash can still support larger prompts and richer sessions without moving up to the most expensive model tier.