Kling v3
Kling v3 Video Generation
- Async processing mode, returns task ID for subsequent queries
- Supports text-to-video, image-to-video (first frame/first-last frame control)
- Supports standard mode (720P), professional mode (1080P), and 4K mode
- Text-to-video supports 15-second duration
- Supports generating videos with audio
POST
Documentation Index
Fetch the complete documentation index at: https://docs.apimart.ai/llms.txt
Use this file to discover all available pages before exploring further.
Authorization
All API endpoints require Bearer Token authenticationGet your API Key:Visit the API Key Management Page to get your API KeyAdd it to the request header:
Request Parameters
Video generation model nameSupported models:
kling-v3- Kling v3 (recommended)
Text promptDescribe scenes, actions, styles in detail for better generation results. English prompts are recommended.Example:
"a golden retriever running on the beach, sunset, cinematic"Generation modeOptions:
std- Standard mode (720P)pro- Professional mode (1080P)4k- 4K mode
stdDefault:
5
Video duration (seconds)Range: 3-15 (minimum 3 seconds, maximum 15 seconds)⚠️ Note: Must be a plain number (e.g. 6), do not add quotes, otherwise an error will occurVideo aspect ratioOptions:
16:9- Landscape9:16- Portrait1:1- Square
16:9Negative prompt to exclude unwanted contentExample:
"blurry, low quality, distorted"Image URL array for image-to-video generation
- Pass 1 image: used as first frame
- Pass 2 images: automatically assigned as first frame + last frame
["https://example.com/first.jpg"]Whether to add watermark
Whether to generate video with audio
Whether to enable multi-shot mode.
truefalse
Shot split method:
customize / intelligence.Required when multi_shot=true.Per-shot information, such as prompt and duration.Define shot order, prompt, and duration via Required when
index, prompt, and duration.- Supports 1 to 6 shots
- Maximum content length per shot is 512
- Each shot duration must be >= 1 and cannot exceed total task duration
- Sum of all shot durations must equal top-level
duration
multi_shot=true and shot_type=customize.Reference subject list, up to 3 subjects.Notes:
- Create on the fly via
name,description,element_input_urls
name,description, andelement_input_urlsare required for on-the-fly creationelement_input_urls: 2-4 images per subject (first as frontal image, rest as references)- Reference elements in
promptwith@name, e.g."@element_dog chasing @element_cat on grass"
Parameter Constraints
mode=4kis supported forkling-v3image_urlssupports up to 2 images (1 first frame, 2 first+last frames)- Last-frame-only input is invalid (must include first frame)
- When
multi_shot=true, top-levelpromptcan be omitted multi_promptsupports up to 6 shots, andindexmust start from 1 and be continuous
Feature Support Matrix
| Type | Feature | std 5s | std 10s | std 15s | pro 5s | pro 10s |
|---|---|---|---|---|---|---|
| Text-to-Video | Generation | ✅ | ✅ | ✅ | ✅ | ✅ |
| Image-to-Video | Generation | ✅ | ✅ | ✅ | ✅ | ✅ |
| Image-to-Video | First Frame | ✅ | ✅ | ✅ | ✅ | ✅ |
| Image-to-Video | Last Frame | ✅ | ✅ | ✅ | ✅ | ✅ |
Text-to-Video vs Image-to-Video
The system automatically determines the mode based on whetherimage_urls is provided: no images means text-to-video, with images means image-to-video.
| Parameter | Text-to-Video | Image-to-Video |
|---|---|---|
prompt | ✅ Required | ✅ Required |
image_urls | ❌ Not used | ✅ Required (1-2 images) |
negative_prompt | ✅ Optional | ✅ Optional |
mode | ✅ Optional | ✅ Optional |
duration | ✅ Optional (3-15) | ✅ Optional (3-15) |
aspect_ratio | ✅ Optional | ⚠️ May be overridden by image ratio |
watermark | ✅ Optional | ✅ Optional |
audio | ✅ Optional | ✅ Optional |
Response
Response status code, 200 on success
Response data array
Use Cases
Case 1: Text-to-Video (Standard Mode)
Case 2: Text-to-Video (Pro Mode + Negative Prompt)
Case 3: Text-to-Video (15 seconds)
Case 4: Image-to-Video (First Frame)
Case 5: Image-to-Video (First + Last Frame Control)
Case 6: Generate Video with Audio
Case 7: Multi-Shot Storyboard (customize, 15 seconds, portrait with audio)
Query Task ResultsVideo generation is an async task that returns a
task_id upon submission. Use the Get Task Status endpoint to query generation progress and results.