task_id, then poll tasks/query
until the result lands. Typical latency 60–120 seconds for a 5-second clip.
Slug: sora-2 · Capability: 1080p text-to-video, up to 10s · Pricing: per-second
(live rate)
Submit
Body
| Field | Type | Required | Notes |
|---|---|---|---|
model | string | yes | sora-2 |
prompt | string | yes | Text prompt. |
duration_seconds | integer | yes | 1–10 |
resolution | string | no | 720p (default), 1080p |
aspect_ratio | string | no | 16:9 (default), 9:16, 1:1 |
seed | integer | no | Reproducibility. |
image_init | string | no | URL of init image for image-to-video. |
Response
tasks/cancel before the status
flips to running for a free abort.
Poll
When to use
- Need 1080p output → Sora 2 or Sora 2 Pro
- Need vertical (9:16) for short-form social → Sora 2 supports it natively
- Latency-sensitive → consider Veo 3.1 Fast or Seedance Fast for ~30s turn-around at lower fidelity