﻿# Pricing and Rate Limits

The platform sets a model concurrency limit for accounts. When server load is high, response delays or 429 errors may occur. For details on the RPM and TPM limits of each model, please refer to the following table. We recommend that you plan your request frequency reasonably. 

> RPM: Requests Per Minute, which refers to the maximum number of requests you can initiate to us within one minute, and is the sum of the number of requests from all API Keys of a single account when invoking a certain model
>
> TPM: Tokens Per Minute, which refers to the maximum number of Tokens you can interact with us within one minute, and is the sum of the number of requested Tokens from all API Keys of a single account when invoking a certain model

## Pricing 

### Domestic Pricing of the Model

<table>
<colgroup>
<col style="width: 259px" />
<col style="width: 152px" />
<col style="width: 152px" />
<col style="width: 152px" />
<col style="width: 168px" />
<col style="width: 154px" />
<col />
</colgroup>
<thead>
<tr>
<th></th>
<th colspan="3">Input ≤ 256K</th>
<th colspan="3">Input 256K - 1M</th>
</tr>
</thead>
<tbody>
<tr>
<td></td>
<td>Input (Cache Miss)</td>
<td>Input (Cache Hit)</td>
<td>Output</td>
<td>Input (Cache Miss)</td>
<td>Input (Cache Hit)</td>
<td>Output</td>
</tr>
<tr>
<td>`mimo-v2.5-pro`<br />`mimo-v2-pro`</td>
<td>¥7.00</td>
<td>¥1.40</td>
<td>¥21.00</td>
<td>¥14.00</td>
<td>¥2.80</td>
<td>¥42.00</td>
</tr>
<tr>
<td>`mimo-v2.5`</td>
<td>¥2.80</td>
<td>¥0.56</td>
<td>¥14.00</td>
<td>¥5.60</td>
<td>¥1.12</td>
<td>¥28.00</td>
</tr>
<tr>
<td>`mimo-v2-omni`</td>
<td>¥2.80</td>
<td>¥0.56</td>
<td>¥14.00</td>
<td>—</td>
<td>—</td>
<td>—</td>
</tr>
<tr>
<td>`mimo-v2-flash`</td>
<td>¥0.70</td>
<td>¥0.07</td>
<td>¥2.10</td>
<td>—</td>
<td>—</td>
<td>—</td>
</tr>
<tr>
<td>`mimo-v2.5-tts`<br />`mimo-v2.5-tts-voiceclone`<br />`mimo-v2.5-tts-voicedesign`<br />`mimo-v2-tts`</td>
<td>Limited-time free</td>
<td></td>
<td></td>
<td></td>
<td></td>
<td></td>
</tr>
</tbody>
</table>

> Note: Cache writing is currently free of charge for a limited time; — indicates that the context limit of this model is 256K, and this range does not apply. Unit: yuan / 1M tokens.

### Overseas Pricing of the Model 

<table>
<colgroup>
<col style="width: 272px" />
<col style="width: 145px" />
<col style="width: 152px" />
<col style="width: 152px" />
<col style="width: 172px" />
<col style="width: 150px" />
<col />
</colgroup>
<thead>
<tr>
<th></th>
<th colspan="3">Input ≤ 256K</th>
<th colspan="3">Input 256K - 1M</th>
</tr>
</thead>
<tbody>
<tr>
<td></td>
<td>Input (Cache Miss)</td>
<td>Input (Cache Hit)</td>
<td>Output</td>
<td>Input (Cache Miss)</td>
<td>Input (Cache Hit)</td>
<td>Output</td>
</tr>
<tr>
<td>`mimo-v2.5-pro`<br /> `mimo-v2-pro`</td>
<td>$1.00</td>
<td>$0.20</td>
<td>$3.00</td>
<td>$2.00</td>
<td>$0.40</td>
<td>$6.00</td>
</tr>
<tr>
<td>`mimo-v2.5`</td>
<td>$0.40</td>
<td>$0.08</td>
<td>$2.00</td>
<td>$0.80</td>
<td>$0.16</td>
<td>$4.00</td>
</tr>
<tr>
<td>`mimo-v2-omni`</td>
<td>$0.40</td>
<td>$0.08</td>
<td>$2.00</td>
<td>—</td>
<td>—</td>
<td>—</td>
</tr>
<tr>
<td>`mimo-v2-flash`</td>
<td>$0.10</td>
<td>$0.01</td>
<td>$0.30</td>
<td>—</td>
<td>—</td>
<td>—</td>
</tr>
<tr>
<td>`mimo-v2.5-tts`<br />`mimo-v2.5-tts-voiceclone`<br />`mimo-v2.5-tts-voicedesign`<br />`mimo-v2-tts`</td>
<td>Limited-time free</td>
<td></td>
<td></td>
<td></td>
<td></td>
<td></td>
</tr>
</tbody>
</table>

> Note: Cache writing is currently free of charge for a limited time; — indicates that the context limit of this model is 256K, and this range does not apply. Unit: $ / 1M tokens.

### Pricing for Network Service Plugins 

<table>
<colgroup>
<col />
<col />
<col style="width: 444px" />
</colgroup>
<thead>
<tr>
<th>Service Item</th>
<th>Price</th>
<th>Description</th>
</tr>
</thead>
<tbody>
<tr>
<td>Domestic Internet Connectivity Service</td>
<td>¥25 / 1000 times</td>
<td>Includes web search and web parsing, used for domestic regional networked search of relevant content</td>
</tr>
<tr>
<td>Overseas Internet Connectivity Service</td>
<td>$5 / 1000 times</td>
<td>Includes web search and web parsing, used for networked search of relevant content in overseas regions</td>
</tr>
</tbody>
</table>

## Model Details

### Pro Series

<table>
<colgroup>
<col />
<col style="width: 612px" />
</colgroup>
<thead>
<tr>
<th>**Model Name**</th>
<th>`mimo-v2.5-pro`, `mimo-v2-pro`</th>
</tr>
</thead>
<tbody>
<tr>
<td>**Category**</td>
<td>Text Generation - General Large Language Model</td>
</tr>
<tr>
<td>**Context Length**</td>
<td>1 M</td>
</tr>
<tr>
<td>**Maximum Output Length**</td>
<td>128 K</td>
</tr>
<tr>
<td>**Model Capability**</td>
<td>Text generation, deep thinking, streaming output, function call, structured output, internet search</td>
</tr>
<tr>
<td>**Flow Control**</td>
<td>RPM: 100<br />TPM: 10 M</td>
</tr>
</tbody>
</table>

### Omni Series

<table>
<colgroup>
<col />
<col style="width: 296px" />
<col style="width: 323px" />
</colgroup>
<thead>
<tr>
<th>**Model Name**</th>
<th>`mimo-v2.5`</th>
<th>`mimo-v2-omni`</th>
</tr>
</thead>
<tbody>
<tr>
<td>**Category**</td>
<td>Text Generation - Full Modal Understanding Model</td>
<td>Text Generation - Full Modal Understanding Model</td>
</tr>
<tr>
<td>**Context Length**</td>
<td>1 M</td>
<td>256 K</td>
</tr>
<tr>
<td>**Maximum Output Length**</td>
<td>128 K</td>
<td>128 K</td>
</tr>
<tr>
<td>**Model Capability**</td>
<td colspan="2">Full-modal understanding, in-depth thinking, streaming output, function call, structured output, and internet search</td>
</tr>
<tr>
<td>**Flow Control**</td>
<td colspan="2">RPM: 100<br />TPM: 10 M</td>
</tr>
</tbody>
</table>

### TTS Series

<table>
<colgroup>
<col />
<col style="width: 207px" />
<col style="width: 236px" />
<col style="width: 261px" />
<col style="width: 237px" />
</colgroup>
<thead>
<tr>
<th>**Model Name**</th>
<th>`mimo-v2.5-tts`</th>
<th>`mimo-v2.5-tts-voiceclone`</th>
<th>`mimo-v2.5-tts-voicedesign`</th>
<th>`mimo-v2-tts`</th>
</tr>
</thead>
<tbody>
<tr>
<td>**Category**</td>
<td>Speech Synthesis Model</td>
<td>Speech Synthesis Model</td>
<td>Speech Synthesis Model</td>
<td>Speech Synthesis Model</td>
</tr>
<tr>
<td>**Context Length**</td>
<td>8 K</td>
<td>8 K</td>
<td>8 K</td>
<td>8 K</td>
</tr>
<tr>
<td>**Maximum Output Length**</td>
<td>8 K</td>
<td>8 K</td>
<td>8 K</td>
<td>8 K</td>
</tr>
<tr>
<td>**Model Capability**</td>
<td>Speech Synthesis</td>
<td>Timbre Cloning</td>
<td>Timbre Design</td>
<td>Speech Synthesis</td>
</tr>
<tr>
<td>**Flow Control**</td>
<td colspan="4">RPM: 100<br />TPM: 10 M</td>
</tr>
</tbody>
</table>

### MiMo-V2-Flash

<table>
<colgroup>
<col />
<col style="width: 659px" />
</colgroup>
<thead>
<tr>
<th>**Model Name**</th>
<th>`mimo-v2-flash`</th>
</tr>
</thead>
<tbody>
<tr>
<td>**Category**</td>
<td>Text Generation - General Large Language Model</td>
</tr>
<tr>
<td>**Context Length**</td>
<td>256 K</td>
</tr>
<tr>
<td>**Maximum Output Length**</td>
<td>64 K</td>
</tr>
<tr>
<td>**Model Capability**</td>
<td>Text generation, deep thinking, streaming output, function call, structured output, internet search</td>
</tr>
<tr>
<td>**Flow Control**</td>
<td>RPM: 100<br />TPM: 10 M</td>
</tr>
</tbody>
</table>
