gpt-5-4 is the GPT-5 production workhorse — mid-tier on the price curve, full feature coverage. Reach for it when you want the GPT-5 family’s reasoning capability and tighter structured-output validation at a mid-tier rate.
Pricing: 15.00 / 1M output, $0.25 / 1M cache read — see the rate card.
Protocols
| Protocol | Path |
|---|---|
| OpenAI Chat Completions | POST https://llm.bytespike.ai/v1/chat/completions |
| OpenAI Responses | POST https://llm.bytespike.ai/v1/responses |
Quickstart
Capabilities
| Capability | Supported |
|---|---|
| Chat Completions | ✅ |
| Responses API | ✅ |
| Streaming (SSE) | ✅ |
| Vision (image input) | ✅ |
| Tool use (function calling) | ✅ parallel |
| JSON mode | ✅ |
| Structured outputs (json_schema) | ✅ |
| Reasoning effort (low/medium/high) | ✅ |
| Web search | ✅ |
| Context window | 128K tokens |
When to use
- Production workhorse — when you need the GPT-5 quality envelope with vision + tools + reasoning, at a mid-tier price.
- Structured outputs.
response_format: { type: "json_schema" }returns schema-validated JSON; ByteSpike does not modify the schema. - Codex-style clients. Hit the Responses API instead of Chat Completions for the structured-reasoning shape.
- Fresh-fact queries. Add
tools: [{ "type": "web_search" }].
- Flagship reasoning — use
gpt-5-5, the current GPT-5 family flagship. - High-volume classification —
gpt-5-4-miniis cheaper.
Next
- gpt-5-4-mini — small-mid of the GPT-5.4 family
- gpt-5-4-nano — smallest GPT-5.4 tier
- gpt-5-5 — current GPT-5 flagship