kimi-k2-6 is Moonshot’s long-context flagship — strong on Chinese tasks with a 128K-token context window.
Pricing
| Input | Cache read | Output |
|---|---|---|
| $0.95 / 1M | $0.16 / 1M | $4.00 / 1M |
Protocols
| Protocol | Path |
|---|---|
| OpenAI Chat Completions | POST https://llm.bytespike.ai/v1/chat/completions |
Quickstart
Capabilities
| Capability | Supported |
|---|---|
| Chat Completions | ✅ |
| Streaming (SSE) | ✅ |
| Tools / function calling | ✅ |
| JSON mode | ✅ |
| Vision (image input) | — |
| Long context | ✅ 128K tokens |
| Context window | 128K tokens |
When to use
- Long-document Chinese-language QA — 128K context for large-document review.
- CN-region multi-doc synthesis — Kimi’s strongest dimension.
- Long-context workloads — throw long prompts at it.
Next
- claude-opus-4-8 — Anthropic flagship alternative
- gemini-3-1-pro — 1M-context alternative