Skip to main content
claude-opus-4-8 is the flagship of the Claude family on ByteSpike — Anthropic’s most capable model, released 2026-05-29. It is built for the hardest work: codebase-scale reasoning, multi-document synthesis, long-document review, and agentic tool use. Context window is 200K tokens. Pricing: 5.00/1Minput,5.00 / 1M input, 25.00 / 1M output, $0.50 / 1M cache read — see the rate card.

Protocols

ProtocolPath
Anthropic MessagesPOST https://llm.bytespike.ai/v1/messages
OpenAI Chat CompletionsPOST https://llm.bytespike.ai/v1/chat/completions

Quickstart

curl https://llm.bytespike.ai/v1/messages \
  -H "x-api-key: $BYTESPIKE_API_KEY" \
  -H "anthropic-version: 2023-06-01" \
  -H "content-type: application/json" \
  -d '{
    "model": "claude-opus-4-8",
    "max_tokens": 2048,
    "thinking": { "type": "enabled", "budget_tokens": 1024 },
    "messages": [
      { "role": "user", "content": "Untangle this contract." }
    ]
  }'

Capabilities

CapabilitySupported
Chat completions
Streaming (SSE)
Vision (image input)
Tool use (function calling)✅ parallel
Prompt caching (cache_control)
Extended thinking
Web search (web_search tool)
JSON / structured output
Context window200K tokens

When to use

  • Codebase-scale reasoning — feed a repo into the 200K window, ask for an audit or a multi-file refactor plan.
  • Hard agentic work — tool use plus extended thinking for multi-step tasks that need a real plan.
  • Multi-document synthesis — long-document QA and cross-document reasoning where Sonnet falls short.
  • Web search + reasoning — research-style queries that need both fresh facts and deep analysis.
When not to use:
  • Default chat at scale — claude-sonnet-4-6 is one-fifth the cost at the same context window.
  • High-volume classification — claude-haiku-4-5 is far cheaper for simple, repetitive calls.

Next