Models - ByteSpike

This is the model-first index for ByteSpike. Pick the model you want; the per-model page tells you what it costs, what protocol it speaks, and how to make your first call.

Claude family

Anthropic’s reasoning-and-vision lineup, available via both the Anthropic Messages protocol and the OpenAI Chat Completions API. Cache control, tool use, and thinking blocks pass through transparently.

Model	Tier	Context	Price	Page
`claude-opus-4-8`	flagship	200K	$5/$ 25	→
`claude-opus-4-7`	large	200K	$5/$ 25	→
`claude-opus-4-6`	large	200K	$5/$ 25	legacy
`claude-opus-4-5`	large	200K	$5/$ 25	legacy
`claude-sonnet-4-6`	mid	200K	$3/$ 15	→
`claude-sonnet-4-5`	mid	200K	$3/$ 15	legacy
`claude-haiku-4-5`	small	200K	$1/$ 5	→

GPT family

OpenAI’s GPT lineup — Chat Completions for ubiquitous SDK code, Responses for Codex-style clients. Reasoning, structured outputs, web search, and vision all pass through.

Model	Tier	Context	Price	Page
`gpt-5-5`	flagship	128K	$5/$ 30	→
`gpt-5-4`	mid	128K	$2.50/$ 15	→
`gpt-5-4-mini`	small-mid	128K	$0.75/$ 4.50	→
`gpt-5-4-nano`	small	128K	$0.20/$ 1.25	→ (soon)
`gpt-5-2`	mid	128K	$1.75/$ 14	→

Gemini family

Google’s Gemini lineup, exposed through the OpenAI Chat Completions API. The current flagship is gemini-3-1-pro (1M context); gemini-3-5-flash and gemini-3-flash cover the fast multimodal tiers.

Model	Tier	Context	Price	Page
`gemini-3-1-pro`	flagship	1M	$2/$ 12	→
`gemini-3-5-flash`	mid	1M	$1.50/$ 9	→
`gemini-3-flash`	small-mid	200K	$0.50/$ 3	→
`gemini-2-5-flash`	small-mid	1M	$0.50/$ 3	→

DeepSeek family

The DeepSeek lineup via DeepSeek’s own HTTP API. V4 is dual-protocol (Chat Completions + Anthropic Messages); the older V3.2 is Chat-Completions-only. Vision not exposed on the HTTP API today.

Model	Tier	Context	Price	Page
`deepseek-v4-flash`	small-mid	64K	$0.14/$ 0.28	→
`deepseek-v4-pro`	flagship	64K	$0.435/$ 0.87	→
`deepseek-v3-2`	older V3	64K	$0.14/$ 0.28	→

字节跳动 / 豆包 (Doubao) family

ByteDance’s Doubao lineup — strong on Chinese-language tasks, vision-capable. Served over the OpenAI Chat Completions API.

Model	Tier	Page
`doubao-seed-2.0-pro`	flagship	→
`doubao-seed-1.5-pro`	mid	(soon)
`doubao-lite`	small	(soon)

Zhipu / 智谱 (GLM) family

Zhipu’s GLM lineup — Chinese-LLM general chat with vision support. Served over the OpenAI Chat Completions API.

Model	Tier	Page
`glm-5-1`	flagship	→
`glm-5-air`	mid	(soon)

Moonshot / Kimi family

Moonshot’s Kimi lineup — long-context Chinese-LLM. Served over the OpenAI Chat Completions API.

Model	Tier	Context	Page
`kimi-k2-6`	flagship	128K	→

MiniMax family

MiniMax’s lineup — competitive Chinese-LLM general chat. Served over the OpenAI Chat Completions API.

Model	Tier	Context	Page
`minimax-m2-7`	flagship	128K	→

Image and video

Image and video models live under their own protocol surfaces (/v1/images/generations for image, /v1/tasks/submit for async video). Per-model pages with current per-call rates live in /api-reference/image and /api-reference/video today; they’ll migrate into /models/ in a follow-up.

DOSIA recommended paths

If you’re calling ByteSpike from DOSIA, two protocol surfaces matter — and each one has a recommended primary model plus cheaper fallbacks.

DOSIA mode	Protocol	Primary	Cost-optimized fallback	Region-of-China primary
Agent (tool use, thinking, cache_control)	Anthropic Messages	`claude-sonnet-4-6`	`claude-haiku-4-5`	`deepseek-v4-pro` (anthropic-compat)
Chat (general Q&A, drafting)	OpenAI Chat Completions	`gpt-5-4`	`gpt-5-4-mini`	`deepseek-v4-flash` or `doubao-seed-2.0-pro`

For DOSIA Cloud Enterprise admins building permission templates:

Global edition preset → unlock claude-sonnet-4-6, claude-opus-4-8, gpt-5-4, gpt-5-4-mini, gpt-5-5, gemini-2-5-flash, gemini-3-1-pro. Default to Anthropic for Agent, OpenAI for Chat.
China edition preset → unlock deepseek-v4-pro, deepseek-v4-flash, doubao-seed-2.0-pro, glm-5-1, kimi-k2-6, minimax-m2-7, deepseek-v3-2. Default to deepseek-v4-pro for Agent (anthropic-compat), doubao-seed-2.0-pro for Chat.

Both presets are admin-defined in the DOSIA Cloud Console; members never have to pick a model unless they want to override.

How to read a model page

Each page is the same shape:

Quickstart — a runnable request against the model via its primary protocol.
Capabilities — what the model itself supports (vision, tools, reasoning, JSON mode, streaming).
When to use — opinionated guidance.
Next — links to related models.

Same shape across families so you can scan side-by-side.

Endpoint types — the eight request shapes ByteSpike accepts
Pricing — the per-model public rate card

​Claude family

​GPT family

​Gemini family

​DeepSeek family

​字节跳动 / 豆包 (Doubao) family

​Zhipu / 智谱 (GLM) family

​Moonshot / Kimi family

​MiniMax family

​Image and video

​DOSIA recommended paths

​How to read a model page

​Next