gpt-5-5 is the current flagship of OpenAI’s GPT-5 family. Vision-capable on image input, reasoning depth tunable via reasoning_effort, broad tool / function support, JSON mode and structured outputs. On the gateway, it’s the default landing spot when DOSIA’s text-writing-tools.chat_with is asked to “use a different LLM” — so a user prompt like “have GPT rewrite this paragraph” routes here.
Pricing: 30.00 / 1M output, $0.50 / 1M cache read — see the rate card. Failures don’t bill.
Protocols
| Protocol | Path |
|---|---|
| OpenAI Chat Completions | POST https://llm.bytespike.ai/v1/chat/completions |
Quickstart
Capabilities
| Capability | Supported |
|---|---|
| Chat Completions | ✅ |
| Streaming (SSE) | ✅ |
| Vision (image input) | ✅ |
| Tools / function calling | ✅ parallel |
| JSON mode | ✅ |
| Structured outputs | ✅ |
reasoning_effort low / medium / high | ✅ |
| Context window | 128K tokens |
| Modality | chat + vision |
| Capability bucket | vision + external_chat |
When to use
- Default GPT-5 family pick — when “use GPT-5” is the brief, this is the alias.
- DOSIA
chat_with— main brain calls this when the user asks for a non-Claude voice (“have GPT rewrite this”). - Reasoning-heavy chat — pair with
reasoning_effort: "high"for harder analysis.
- High-volume / budget-bound — drop to
gpt-5-4-miniorgpt-5-4-nano. - You want Claude’s
tool_use/cache_controlblock model — go toclaude-sonnet-4-6.
Next
gpt-5-4— previous GPT-5 family flagshipclaude-opus-4-8— Anthropic alternative for hard reasoning- DOSIA MCP integration —
chat_withtool surface