Claude family
Anthropic’s reasoning-and-vision lineup, available via both the Anthropic Messages protocol and the OpenAI Chat Completions API. Cache control, tool use, and thinking blocks pass through transparently.GPT family
OpenAI’s GPT lineup — Chat Completions for ubiquitous SDK code, Responses for Codex-style clients. Reasoning, structured outputs, web search, and vision all pass through.Gemini family
Google’s Gemini lineup, exposed through the OpenAI Chat Completions API. The current flagship isgemini-3-1-pro (1M context); gemini-3-5-flash and gemini-3-flash cover the fast multimodal tiers.
DeepSeek family
The DeepSeek lineup via DeepSeek’s own HTTP API. V4 is dual-protocol (Chat Completions + Anthropic Messages); the older V3.2 is Chat-Completions-only. Vision not exposed on the HTTP API today.字节跳动 / 豆包 (Doubao) family
ByteDance’s Doubao lineup — strong on Chinese-language tasks, vision-capable. Served over the OpenAI Chat Completions API.| Model | Tier | Page |
|---|---|---|
doubao-seed-2.0-pro | flagship | → |
doubao-seed-1.5-pro | mid | (soon) |
doubao-lite | small | (soon) |
Zhipu / 智谱 (GLM) family
Zhipu’s GLM lineup — Chinese-LLM general chat with vision support. Served over the OpenAI Chat Completions API.| Model | Tier | Page |
|---|---|---|
glm-5-1 | flagship | → |
glm-5-air | mid | (soon) |
Moonshot / Kimi family
Moonshot’s Kimi lineup — long-context Chinese-LLM. Served over the OpenAI Chat Completions API.| Model | Tier | Context | Page |
|---|---|---|---|
kimi-k2-6 | flagship | 128K | → |
MiniMax family
MiniMax’s lineup — competitive Chinese-LLM general chat. Served over the OpenAI Chat Completions API.| Model | Tier | Context | Page |
|---|---|---|---|
minimax-m2-7 | flagship | 128K | → |
Image and video
Image and video models live under their own protocol surfaces (/v1/images/generations for image, /v1/tasks/submit for async video). Per-model pages with current per-call rates live in /api-reference/image and /api-reference/video today; they’ll migrate into /models/ in a follow-up.
DOSIA recommended paths
If you’re calling ByteSpike from DOSIA, two protocol surfaces matter — and each one has a recommended primary model plus cheaper fallbacks.| DOSIA mode | Protocol | Primary | Cost-optimized fallback | Region-of-China primary |
|---|---|---|---|---|
| Agent (tool use, thinking, cache_control) | Anthropic Messages | claude-sonnet-4-6 | claude-haiku-4-5 | deepseek-v4-pro (anthropic-compat) |
| Chat (general Q&A, drafting) | OpenAI Chat Completions | gpt-5-4 | gpt-5-4-mini | deepseek-v4-flash or doubao-seed-2.0-pro |
- Global edition preset → unlock
claude-sonnet-4-6,claude-opus-4-8,gpt-5-4,gpt-5-4-mini,gpt-5-5,gemini-2-5-flash,gemini-3-1-pro. Default to Anthropic for Agent, OpenAI for Chat. - China edition preset → unlock
deepseek-v4-pro,deepseek-v4-flash,doubao-seed-2.0-pro,glm-5-1,kimi-k2-6,minimax-m2-7,deepseek-v3-2. Default todeepseek-v4-profor Agent (anthropic-compat),doubao-seed-2.0-profor Chat.
How to read a model page
Each page is the same shape:- Quickstart — a runnable request against the model via its primary protocol.
- Capabilities — what the model itself supports (vision, tools, reasoning, JSON mode, streaming).
- When to use — opinionated guidance.
- Next — links to related models.
Next
- Endpoint types — the eight request shapes ByteSpike accepts
- Pricing — the per-model public rate card