minimax-m2.5
Capability: 128K context · tool use · streaming · CJK-native
Pricing: per-token, mid tier (live rate)
MiniMax M2.5 was MiniMax’s prior production model. It earned a
reputation for creative writing in Chinese — character voice
consistency, dialogue, narrative flow — at a price tier where most
competitors flatten tone. Still production-capable for that niche.
For new general-purpose work, prefer
MiniMax M2.7.
Request
Body parameters
| Field | Type | Required | Default | Notes |
|---|---|---|---|---|
model | string | yes | — | minimax-m2.5 |
messages | array | yes | — | CJK accepted natively. |
max_tokens | integer | no | model max | Max: 8192. |
temperature | number | no | 1.0 | Creative writing benefits from 1.0–1.3. |
tools | array | no | — | Function calling supported. |
stream | boolean | no | false | SSE streaming. |
Response
Code examples
Streaming + caching
"stream": true for SSE. Automatic prompt caching.
Errors
| Code | Trigger | Billed? |
|---|---|---|
| 400 / 401 / 402 / 422 / 429 | Standard | No |
| 5xx | Upstream | No (auto-retry) |
When to use
- Existing creative-writing pipelines validated against this version.
- For new work, prefer MiniMax M2.7.
- For Chinese general-purpose flagship, see Kimi K2.6 or GLM-5.1.
Limits
| Limit | Value |
|---|---|
| Context window | 128K tokens |
| Max output | 8192 tokens |
| Supports tool use | Yes |
| Supports vision | No |
| Supports streaming | Yes |
| Supports prompt caching | Automatic |