deepseek-v4-pro - ByteSpike

deepseek-v4-pro is DeepSeek’s reasoning flagship and the most cost-effective high-capability model on ByteSpike. A fraction of the cost of gpt-5-4 at comparable quality on most benchmarks, with the same dual-protocol surface that DeepSeek exposes natively. Pricing:

0.435 / 1M input,

0.87 / 1M output, $0.004 / 1M cache read — see the rate card. DeepSeek V4 is a distinct series from the older V3.2, at different price points. For V3.2 see deepseek-v3-2.

Protocols

Protocol	Path
Anthropic Messages	`POST https://llm.bytespike.ai/v1/messages`
OpenAI Chat Completions	`POST https://llm.bytespike.ai/v1/chat/completions`

Both protocols are first-class. The same ByteSpike key works on either; pick whichever shape your client already speaks.

Quickstart

curl https://llm.bytespike.ai/v1/chat/completions \
  -H "Authorization: Bearer $BYTESPIKE_API_KEY" \
  -H "content-type: application/json" \
  -d '{
    "model": "deepseek-v4-pro",
    "messages": [
      { "role": "user", "content": "Hello, ByteSpike." }
    ]
  }'

Capabilities

Capability	Supported
Chat Completions	✅
Anthropic Messages	✅
Streaming (SSE)	✅
Tool use (function calling)	✅ parallel
JSON mode	✅
Reasoning chain	✅ (`reasoning_content` field / `thinking` block)
Vision (HTTP API)	❌
Context window	64K tokens

The reasoning chain (“thinking output”) appears as:

On the OpenAI endpoint: reasoning_content field on the choice.
On the Anthropic endpoint: a thinking block ahead of the text block in content[].

Reasoning tokens count toward output billing.

When to use

Cost-sensitive reasoning — when you’d reach for gpt-5-4 but want a fraction of the cost.
Agents on a budget — tool_use blocks pass through transparently via the Anthropic endpoint; DOSIA Agent uses this model heavily.
DOSIA Agent mode — must use the Anthropic Messages endpoint.

When not to use:

Vision-required tasks — not on HTTP API today.
Web search — DeepSeek does not expose a grounding tool.
Long context — 64K is the ceiling here; go to claude-opus-4-8 (200K) or gemini-3-1-pro (1M).

deepseek-v4-flash — small-mid
deepseek-v3.2 — older V3 series

deepseek-v4-flash deepseek-r1

​Protocols

​Quickstart

​Capabilities

​When to use

​Next

Protocols

Quickstart

Capabilities

When to use

Next