Skip to main content
claude-opus-4-7 is the prior-generation Opus on ByteSpike, now superseded by claude-opus-4-8 as the Claude flagship. It remains a strong choice for deep reasoning, long-document QA, and multi-document synthesis within its 200K-token context window. Pricing: 5.00/1Minput,5.00 / 1M input, 25.00 / 1M output, $0.50 / 1M cache read — see the rate card.

Protocols

ProtocolPath
Anthropic MessagesPOST https://llm.bytespike.ai/v1/messages
OpenAI Chat CompletionsPOST https://llm.bytespike.ai/v1/chat/completions

Quickstart

curl https://llm.bytespike.ai/v1/messages \
  -H "x-api-key: $BYTESPIKE_API_KEY" \
  -H "anthropic-version: 2023-06-01" \
  -H "content-type: application/json" \
  -d '{
    "model": "claude-opus-4-7",
    "max_tokens": 2048,
    "thinking": { "type": "enabled", "budget_tokens": 1024 },
    "messages": [
      { "role": "user", "content": "Untangle this contract." }
    ]
  }'

Capabilities

CapabilitySupported
Chat completions
Streaming (SSE)
Vision (image input)
Tool use (function calling)✅ parallel
Prompt caching (cache_control)
Extended thinking
Web search (web_search tool)
JSON / structured output
Context window200K tokens

When to use

  • Deep reasoning — pair with thinking: { enabled, budget_tokens: 1024+ } for the extended chain-of-thought.
  • Long-document QA — legal/regulatory text and multi-doc synthesis within 200K.
  • Web search — research-style queries that need fresh facts.
When not to use:
  • New flagship work — claude-opus-4-8 is the current Opus and supersedes this model.
  • Default chat — claude-sonnet-4-6 is one-fifth the cost at the same 200K context.
  • High-volume classification — claude-haiku-4-5 is twenty times cheaper.

Next