claude-opus-4-7 is the prior-generation Opus on ByteSpike, now superseded by claude-opus-4-8 as the Claude flagship. It remains a strong choice for deep reasoning, long-document QA, and multi-document synthesis within its 200K-token context window.
Pricing: 25.00 / 1M output, $0.50 / 1M cache read — see the rate card.
Protocols
| Protocol | Path |
|---|---|
| Anthropic Messages | POST https://llm.bytespike.ai/v1/messages |
| OpenAI Chat Completions | POST https://llm.bytespike.ai/v1/chat/completions |
Quickstart
Capabilities
| Capability | Supported |
|---|---|
| Chat completions | ✅ |
| Streaming (SSE) | ✅ |
| Vision (image input) | ✅ |
| Tool use (function calling) | ✅ parallel |
| Prompt caching (cache_control) | ✅ |
| Extended thinking | ✅ |
| Web search (web_search tool) | ✅ |
| JSON / structured output | ✅ |
| Context window | 200K tokens |
When to use
- Deep reasoning — pair with
thinking: { enabled, budget_tokens: 1024+ }for the extended chain-of-thought. - Long-document QA — legal/regulatory text and multi-doc synthesis within 200K.
- Web search — research-style queries that need fresh facts.
- New flagship work —
claude-opus-4-8is the current Opus and supersedes this model. - Default chat —
claude-sonnet-4-6is one-fifth the cost at the same 200K context. - High-volume classification —
claude-haiku-4-5is twenty times cheaper.
Next
- claude-opus-4-8 — current Opus flagship
- claude-sonnet-4-6 — production mid-tier
- claude-haiku-4-5 — small + cheap