Skip to main content
Vendor: DeepSeek Model ID: deepseek-v3-2 Capability: 64K context · tool use · streaming · structured output Pricing: per-token, mid tier (live rate) DeepSeek V3.2 was the last 3-series model and it earned a reputation for code generation that punched above its price tier. Still a fine production model for code-heavy workflows that have already validated against this version. For new work, prefer DeepSeek V4 Pro or the faster V4 Flash.

Request

curl https://llm.bytespike.ai/v1/chat/completions \
  -H "Authorization: Bearer $BYTESPIKE_API_KEY" \
  -H "content-type: application/json" \
  -d '{
    "model": "deepseek-v3-2",
    "messages": [{"role": "user", "content": "Write a Python implementation of trie."}]
  }'

Body parameters

FieldTypeRequiredDefaultNotes
modelstringyesdeepseek-v3-2
messagesarrayyes
max_tokensintegernomodel maxMax: 8192.
temperaturenumberno1.0
toolsarraynoFunction calling supported.
response_formatobjectnoJSON mode.
streambooleannofalseSSE streaming.

Response

{
  "id": "chatcmpl-…",
  "object": "chat.completion",
  "model": "deepseek-v3-2",
  "choices": [{"index": 0, "message": {"role": "assistant", "content": "class Trie..."}, "finish_reason": "stop"}],
  "usage": {"prompt_tokens": 28, "completion_tokens": 412, "total_tokens": 440}
}

Code examples

curl https://llm.bytespike.ai/v1/chat/completions \
  -H "Authorization: Bearer $BYTESPIKE_API_KEY" \
  -H "content-type: application/json" \
  -d '{"model": "deepseek-v3-2", "messages": [{"role": "user", "content": "Implement trie in Python."}]}'

Streaming + caching

"stream": true for SSE. Automatic prompt caching on stable prefixes.

Errors

CodeTriggerBilled?
400 / 401 / 402 / 422 / 429StandardNo
5xxUpstreamNo (auto-retry)

When to use

Limits

LimitValue
Context window64K tokens
Max output8192 tokens
Supports tool useYes
Supports visionNo
Supports streamingYes
Supports prompt cachingAutomatic