gpt-5-5 - ByteSpike

gpt-5-5 is the current flagship of OpenAI’s GPT-5 family. Vision-capable on image input, reasoning depth tunable via reasoning_effort, broad tool / function support, JSON mode and structured outputs. On the gateway, it’s the default landing spot when DOSIA’s text-writing-tools.chat_with is asked to “use a different LLM” — so a user prompt like “have GPT rewrite this paragraph” routes here. Pricing:

5.00 / 1M input,

30.00 / 1M output, $0.50 / 1M cache read — see the rate card. Failures don’t bill.

Protocols

Protocol	Path
OpenAI Chat Completions	`POST https://llm.bytespike.ai/v1/chat/completions`

Quickstart

curl https://llm.bytespike.ai/v1/chat/completions \
  -H "Authorization: Bearer $BYTESPIKE_API_KEY" \
  -H "content-type: application/json" \
  -d '{
    "model": "gpt-5-5",
    "messages": [
      { "role": "user", "content": "Untangle this contract paragraph." }
    ],
    "reasoning_effort": "medium"
  }'

Capabilities

Capability	Supported
Chat Completions	✅
Streaming (SSE)	✅
Vision (image input)	✅
Tools / function calling	✅ parallel
JSON mode	✅
Structured outputs	✅
`reasoning_effort` low / medium / high	✅
Context window	128K tokens
Modality	chat + vision
Capability bucket	`vision` + `external_chat`

When to use

Default GPT-5 family pick — when “use GPT-5” is the brief, this is the alias.
DOSIA chat_with — main brain calls this when the user asks for a non-Claude voice (“have GPT rewrite this”).
Reasoning-heavy chat — pair with reasoning_effort: "high" for harder analysis.

When not to use:

High-volume / budget-bound — drop to gpt-5-4-mini or gpt-5-4-nano.
You want Claude’s tool_use / cache_control block model — go to claude-sonnet-4-6.

gpt-5-4 — previous GPT-5 family flagship
claude-opus-4-8 — Anthropic alternative for hard reasoning
DOSIA MCP integration — chat_with tool surface

gpt-5-4 gemini-2-5-flash

​Protocols

​Quickstart

​Capabilities

​When to use

​Next

Protocols

Quickstart

Capabilities

When to use

Next