claude-opus-4-7 - ByteSpike

claude-opus-4-7 is the prior-generation Opus on ByteSpike, now superseded by claude-opus-4-8 as the Claude flagship. It remains a strong choice for deep reasoning, long-document QA, and multi-document synthesis within its 200K-token context window. Pricing:

5.00 / 1M input,

25.00 / 1M output, $0.50 / 1M cache read — see the rate card.

Protocols

Protocol	Path
Anthropic Messages	`POST https://llm.bytespike.ai/v1/messages`
OpenAI Chat Completions	`POST https://llm.bytespike.ai/v1/chat/completions`

Quickstart

curl https://llm.bytespike.ai/v1/messages \
  -H "x-api-key: $BYTESPIKE_API_KEY" \
  -H "anthropic-version: 2023-06-01" \
  -H "content-type: application/json" \
  -d '{
    "model": "claude-opus-4-7",
    "max_tokens": 2048,
    "thinking": { "type": "enabled", "budget_tokens": 1024 },
    "messages": [
      { "role": "user", "content": "Untangle this contract." }
    ]
  }'

Capabilities

Capability	Supported
Chat completions	✅
Streaming (SSE)	✅
Vision (image input)	✅
Tool use (function calling)	✅ parallel
Prompt caching (cache_control)	✅
Extended thinking	✅
Web search (web_search tool)	✅
JSON / structured output	✅
Context window	200K tokens

When to use

Deep reasoning — pair with thinking: { enabled, budget_tokens: 1024+ } for the extended chain-of-thought.
Long-document QA — legal/regulatory text and multi-doc synthesis within 200K.
Web search — research-style queries that need fresh facts.

When not to use:

New flagship work — claude-opus-4-8 is the current Opus and supersedes this model.
Default chat — claude-sonnet-4-6 is one-fifth the cost at the same 200K context.
High-volume classification — claude-haiku-4-5 is twenty times cheaper.

claude-opus-4-8 — current Opus flagship
claude-sonnet-4-6 — production mid-tier
claude-haiku-4-5 — small + cheap

claude-opus-4-8 claude-sonnet-4-6

​Protocols

​Quickstart

​Capabilities

​When to use

​Next

Protocols

Quickstart

Capabilities

When to use

Next