claude-opus-4-8 is the flagship of the Claude family on ByteSpike — Anthropic’s most capable model, released 2026-05-29. It is built for the hardest work: codebase-scale reasoning, multi-document synthesis, long-document review, and agentic tool use. Context window is 200K tokens.
Pricing: 25.00 / 1M output, $0.50 / 1M cache read — see the rate card.
Protocols
| Protocol | Path |
|---|---|
| Anthropic Messages | POST https://llm.bytespike.ai/v1/messages |
| OpenAI Chat Completions | POST https://llm.bytespike.ai/v1/chat/completions |
Quickstart
Capabilities
| Capability | Supported |
|---|---|
| Chat completions | ✅ |
| Streaming (SSE) | ✅ |
| Vision (image input) | ✅ |
| Tool use (function calling) | ✅ parallel |
| Prompt caching (cache_control) | ✅ |
| Extended thinking | ✅ |
| Web search (web_search tool) | ✅ |
| JSON / structured output | ✅ |
| Context window | 200K tokens |
When to use
- Codebase-scale reasoning — feed a repo into the 200K window, ask for an audit or a multi-file refactor plan.
- Hard agentic work — tool use plus extended thinking for multi-step tasks that need a real plan.
- Multi-document synthesis — long-document QA and cross-document reasoning where Sonnet falls short.
- Web search + reasoning — research-style queries that need both fresh facts and deep analysis.
- Default chat at scale —
claude-sonnet-4-6is one-fifth the cost at the same context window. - High-volume classification —
claude-haiku-4-5is far cheaper for simple, repetitive calls.
Next
- claude-opus-4-7 — prior-generation Opus
- claude-sonnet-4-6 — production mid-tier
- claude-haiku-4-5 — small + cheap