Vidu Q3(pro/turbo)
Vidu Q3(pro/turbo) Video Generation
- Async processing mode, returns task ID for subsequent queries
- Supports text-to-video, image-to-video, first-last frame video generation
- Supports 540p / 720p / 1080p resolution
- Duration range 1-16 seconds, audio enabled by default
POST
Documentation Index
Fetch the complete documentation index at: https://docs.apimart.ai/llms.txt
Use this file to discover all available pages before exploring further.
Authorization
All API endpoints require Bearer Token authenticationGet your API Key:Visit the API Key Management Page to get your API KeyAdd it to the request header:
Request Parameters
Video generation model nameSupported models:
viduq3-pro- Vidu Q3 Providuq3-turbo- Vidu Q3 Turbo
Text prompt, maximum 2000 charactersRequired for text-to-video. Optional for image-to-video and first-last frame modes.Example:
"A cat playing piano, camera slowly zooms in"Video duration (seconds)Range:
1 to 16Default: 5Video resolutionOptions:
540p- Standard definition720p- HD (default)1080p- Full HD
720pVideo aspect ratio (only for text-to-video mode)Options:
16:9- Landscape9:16- Portrait4:3- Traditional3:4- Portrait traditional1:1- Square
Image URL array for image-to-video generationThe system automatically determines the generation mode based on the number of images:
- 0 images (not provided): Text-to-video mode
- 1 image: Image-to-video mode (image used as starting frame)
- 2 images: First-last frame mode (first image = first frame, second image = last frame)
["https://example.com/photo.jpg"]Whether to generate audio (dialogue, sound effects)Default:
trueSet to false if you need a silent video.Random seed for reproducible resultsUsing the same seed with identical parameters will produce the same video output.
Auto Routing
The system automatically determines the generation mode based on the number of images inimage_urls:
| Images Count | Mode | Description |
|---|---|---|
| 0 (not provided) | Text-to-Video | Generate from text description only |
| 1 | Image-to-Video | Use the image as starting frame |
| 2 | First-Last Frame | First image = first frame, second image = last frame |
Parameter Support Matrix
| Parameter | Text-to-Video | Image-to-Video | First-Last Frame |
|---|---|---|---|
model | ✅ Required | ✅ Required | ✅ Required |
prompt | ✅ Required | Optional | Optional |
image_urls | - | ✅ 1 image | ✅ 2 images |
duration | ✅ 1-16s | ✅ 1-16s | ✅ 1-16s |
resolution | ✅ | ✅ | ✅ |
aspect_ratio | ✅ | - | - |
audio | ✅ | ✅ | ✅ |
seed | ✅ | ✅ | ✅ |
Response
Response status code, 200 on success
Response data array
Use Cases
Case 1: Text-to-Video
Case 2: Image-to-Video (Single Image)
Case 3: First-Last Frame Video
Case 4: Silent Video (Audio Disabled)
Query Task ResultsVideo generation is an async task that returns a
task_id upon submission. Use the Get Task Status endpoint to query generation progress and results.