Claude Opus 4.5 - ByteSpike

厂商： Anthropic Model ID： claude-opus-4-5 能力： 200K context · tool use · vision · prompt caching · streaming · extended thinking 价格： 按 token，Opus 档（实时价格） Opus 4.5 是 4 系列 Opus 家族里的第一款，在 200K token context 上以 Opus 品质交付深度长程推理。对新工作，建议选 Opus 4.8 —— 当前旗舰，输出可量化地更紧致、工具调用精度更高。4.5 保留是给已经针对它验证过的团队继续使用。

请求

curl https://llm.bytespike.ai/v1/messages \
  -H "x-api-key: $BYTESPIKE_API_KEY" \
  -H "anthropic-version: 2023-06-01" \
  -H "content-type: application/json" \
  -d '{
    "model": "claude-opus-4-5",
    "max_tokens": 16384,
    "messages": [
      {"role": "user", "content": "Review this 150-page deposition for inconsistencies in the dates."}
    ]
  }'

Body 参数

字段	类型	是否必填	默认	说明
`model`	string	是	—	`claude-opus-4-5`
`messages`	array	是	—	对话历史。最多 200K token 输入。
`max_tokens`	integer	是	—	硬上限。本模型最大：32768。
`system`	string \| array	否	—	array 形式支持 `cache_control`。
`temperature`	number	否	1.0	范围 0.0–1.0。
`top_p`	number	否	1.0	Nucleus sampling。
`tools`	array	否	—	支持。
`tool_choice`	object	否	`{"type":"auto"}`	`auto` / `any` / `tool`（指定名）。
`thinking`	object	否	—	Extended-thinking 预算。见 Anthropic thinking 文档。
`stream`	boolean	否	false	SSE 流式。

响应

{
  "id": "msg_opus_…",
  "type": "message",
  "role": "assistant",
  "model": "claude-opus-4-5",
  "content": [
    {"type": "thinking", "thinking": "<extended reasoning trace>"},
    {"type": "text", "text": "Three date inconsistencies on pages 42, 87, and 131..."}
  ],
  "stop_reason": "end_turn",
  "usage": {
    "input_tokens": 168250,
    "output_tokens": 1872
  }
}

代码示例

curl https://llm.bytespike.ai/v1/messages \
  -H "x-api-key: $BYTESPIKE_API_KEY" \
  -H "anthropic-version: 2023-06-01" \
  -H "content-type: application/json" \
  -d '{
    "model": "claude-opus-4-5",
    "max_tokens": 16384,
    "messages": [{"role": "user", "content": "Review this deposition for date inconsistencies."}]
  }'

Cache control

Cache control 是 4 系列中对成本最具杠杆效应的设置。大块 200K token context 下，缓存命中可以让重复 agent 轮次的成本下降 10 倍。缓存价见 pricing table。

{
  "model": "claude-opus-4-5",
  "system": [
    {
      "type": "text",
      "text": "<the 100K-token corpus you keep referring to>",
      "cache_control": {"type": "ephemeral"}
    }
  ],
  "messages": [...]
}

错误

Code	触发	是否计费
400	Body 校验失败（`max_tokens` 过大等）	否
401	key 缺失 / 已撤销	否
402	钱包用尽（Opus 比 Sonnet 触发得更快）	否
413	输入超过 200K token	否
429	速率限制	否
5xx	上游 provider 问题	否（自动重试信封）

何时使用

200K 窗口内的长程推理（法律评审、代码库审计、多文档综合）。
Sonnet 开始漏步骤的多步 prompt。
对新工作，建议选 Opus 4.8 —— 当前旗舰，输出更紧致。
中端成本 / 延迟，见 Sonnet 4.6。

限制

限制	值
Context window	200K tokens
Max output	32768 tokens
支持 tool use	是
支持 vision	是
支持 streaming	是
支持 prompt caching	是
支持 extended thinking	是

​请求

​Body 参数

​响应

​代码示例

​Cache control

​错误

​何时使用

​限制

请求

Body 参数

响应

代码示例

Cache control

错误

何时使用

限制