Submit Video Task
Video Series
Submit Video Task
POST
Submit Video Task
Introduction
The submit video task API is used to create a new video generation task. Upon successful submission, it returns a task ID that you can use to query the task status. Important Note: Video generation is an asynchronous task. You need to first submit a task to get a task ID, then poll the task status until it succeeds.Authentication
Bearer Token, e.g.
Bearer sk-xxxxxxxxxxRequest Parameters
Model identifier, supported models and features:Sora 2 Series:
sora-2- Supports text-to-video, image-to-video, video-to-video (Remix mode)
veo-3.0-fast-generate-001- Text-to-video (first frame mode)veo-3.1-fast-generate-preview- Text-to-video (first frame mode, first/last frame mode)
wan2.5-t2v-preview- Text-to-videowan2.5-i2v-preview- Image-to-video (first frame mode)
doubao-seedance-1-0-lite-t2v-250428- Text-to-videodoubao-seedance-1-0-lite-i2v-250428- Image-to-video (first frame mode, first/last frame mode, reference image mode)doubao-seedance-1-0-pro-250528- Text-to-video (first frame mode)doubao-seedance-1-5-pro-251215- Text-to-video, image-to-video (first frame mode, first/last frame mode), supports audio generationdoubao-seedance-1-5-pro-251215-noAudio- Text-to-video, image-to-video (first frame mode, first/last frame mode), no audio generation
Video generation prompt, describing scene actions and settings. Note: Doubao Seedance series models do not require this field, the prompt should be written directly in the
text field of the metadata.content arrayReference image for image-to-video (supports Base64 or URL format)
Video duration (seconds), different models support different durations
Video resolution:
480p, 720p, 1080p, 4kAspect ratio:
16:9, 9:16, 1:1, 4:3, 3:4, 21:9, adaptive (adaptive, only supported by some models)Model-Specific Parameters
Different models support different specific parameters. Below are detailed descriptions by model series:- Sora 2
- Veo
- Ali Wanxiang
- Doubao Seedance
Video duration (seconds), supports:
4, 8, 12Video resolution, supports:
720x1280 (portrait), 1280x720 (landscape)Reference image (supports URL or Base64 format), for image-to-video
Remix mode: Regenerate based on existing video ID (must start with
video_)Usage Examples
- Sora 2
- Veo
- Ali Wanxiang
- Doubao Seedance
1. Text-to-Video (Basic Example)2. Text-to-Video (Landscape, 8 seconds)3. Image-to-Video (First Frame Mode)4. Remix Mode (Video-to-Video)
- The
contentarray must be placed in themetadataobject - All parameters are passed through special markers in the prompt text (e.g.
--ratio 16:9) - Images must be placed in the
contentarray using theimage_urltype - First/last frame mode requires two images, marked with
role: "first_frame"androle: "last_frame"respectively - Reference image mode requires using markers like
[图1],[图2]in the prompt to reference images, with images markedrole: "reference_image" doubao-seedance-1-0-lite-t2v-250428does not support image input andadaptiveaspect ratiodoubao-seedance-1-0-pro-250528only supports first frame modedoubao-seedance-1-5-pro-251215automatically generates audio, suitable for scenes requiring background musicdoubao-seedance-1-5-pro-251215-noAudiodoes not generate audio, faster rendering, suitable for scenes requiring post-production audio- 1.5 pro series supports text-to-video and image-to-video (first frame mode, first/last frame mode), does not support reference image mode
- 1.5 pro series resolution limit: Only supports
480pand720p(does not support 1080p) - 1.5 pro series duration range: Supports any integer between 4-12 seconds
- First/last frame mode: Requires providing two images, marked with
role: "first_frame"androle: "last_frame"respectively
Response Example
Related APIs
Query Video Task
Query video generation task status and results
Download Video
Download completed video files (Sora 2 only)
