Model and Rate Limit

This page lists all the models currently supported by the Xiaomi MiMo API Open Platform, including model capabilities, length limits, and rate-limiting quotas, to help you select the appropriate model based on your usage scenario.

Rate Limiting Instructions

The platform sets a model concurrency limit for each account. When the server load is high, response delays or 429 error may occur. We recommend that you reasonably plan your request frequency and implement request retry and backoff strategies in high-concurrency scenarios to avoid triggering rate limits.

RPM (Requests Per Minute): The maximum number of requests initiated per minute. The calculation scope is the sum of the total number of requests from all API Keys under a single account when calling the same model.
TPM (Tokens Per Minute): The maximum number of Tokens that can be interacted with per minute. The calculation scope is the sum of the total number of requested Tokens for all API Keys under a single account when calling the same model.

Text Generation Model

Model Series	Model ID (Model ID)	Capability Support	Length Limit (token)	Rate Limiting
Pro Series	`mimo-v2.5-pro`	Text Generation Deep Thinking Streaming Output Function Call Structured Output Web Search	Context Window: 1M Maximum Output: 128K	Maximum RPM: 100 Maximum TPM: 10M
Pro Series	`mimo-v2-pro`		Context Window: 1M Maximum Output: 128K
Omni Series	`mimo-v2.5`	Text Generation Full-modal Understanding Deep Thinking Streaming Output Function Call Structured Output Web Search	Context Window: 1M Maximum Output: 128K
Omni Series	`mimo-v2-omni`		Context Window: 256K Maximum Output: 128K
Flash Series	`mimo-v2-flash`	Text Generation Deep Thinking Streaming Output Function Call Structured Output Web Search	Context Window: 256K Maximum Output: 64K

Text-to-Speech (TTS) Model

Model ID (Model ID)	Capability Support	Length Limit (token)	Rate Limiting
`mimo-v2.5-tts`	Speech Synthesis	Context Window: 8K Maximum Output: 8K	Maximum RPM: 100 Maximum TPM: 10M
`mimo-v2.5-tts-voiceclone`	Speech Synthesis Timbre Cloning
`mimo-v2.5-tts-voicedesign`	Speech Synthesis Timbre Design
`mimo-v2-tts`	Speech Synthesis

Quick Selection Guide

Requirement Scenario	Recommendation Model
Complex reasoning, in-depth analysis, long document processing	`mimo-v2.5-pro`
Understanding of image, audio, and video content	`mimo-v2.5` or `mimo-v2-omni`
High concurrency, low cost, and fast response	`out-of-v2-flash`
Text-to-Speech (Standard Preset Voice)	`mimo-v2.5-tts`
Voice Cloning (Upload Audio Sample)	`mimo-v2.5-tts-voiceclone`
Customized Tone Design	`mimo-v2.5-tts-voicedesign`

Update Time May 22, 2026