veo3.1-fast is the Fast tier of veo3.1. Trade-offs: drops the native audio option, slightly looser fidelity on hard motion, and meaningfully lower per-second cost. The right default for programmatic / volume video work where Sora-grade quality isn’t the bar.
Pricing: 0.05 / second.
Protocols
| Protocol | Path | Purpose |
|---|---|---|
| OpenAI Video — submit | POST https://llm.bytespike.ai/v1/videos/generations | enqueues; returns task_id |
| OpenAI Video — poll | GET https://llm.bytespike.ai/v1/videos/tasks/{task_id} | returns status and result_url |
Quickstart
Capabilities
| Capability | Supported |
|---|---|
| Text-to-video | ✅ |
Image-to-video (with source_image) | ✅ |
| Audio generation | — (use veo3.1 for audio) |
duration_seconds 5 / 10 | ✅ |
size 1280×720 | ✅ |
| Modality | video |
| Capability bucket | video_generate |
When to use
- Volume / programmatic video — short social-card clips, A/B variants, demo placeholders.
- No audio needed — explainer-style clips that will be dubbed or scored separately.
- Internal-facing creative — quick visualizations for design review.
- Audio matters in the deliverable — go to
veo3.1. - Hero / final marketing creative —
sora2orsora2-pro.
Next
veo3.1— full tier with audioseedance2-fast— ByteDance fast tier- Multimodal endpoints — overview