Kling AI Model Catalog
Overview
Kling AI offers multiple model versions across video generation, image generation, lip sync, virtual try-on, and effects. Each version trades off quality, speed, and cost. This skill is the reference for choosing the right model.
Video Generation Models
| Model ID |
Supports |
Max Duration |
Resolution |
Speed |
Quality |
kling-v1 |
T2V, I2V |
10s |
720p |
Fast |
Good |
kling-v1-5 |
I2V only |
10s |
1080p |
Fast |
Better |
kling-v1-6 |
T2V, I2V |
10s |
1080p |
Medium |
Better+ |
kling-v2-master |
T2V, I2V |
10s |
1080p |
Medium |
High |
kling-v2-1 |
I2V only |
10s |
1080p |
Medium |
High |
kling-v2-1-master |
T2V, I2V |
10s |
1080p |
Medium |
High |
kling-v2-5-turbo |
T2V, I2V |
10s |
1080p 30fps |
Fast |
High |
kling-v2-6 |
T2V, I2V |
10s |
1080p 30-48fps |
Medium |
Highest |
T2V = text-to-video, I2V = image-to-video
Kling v2.5 Turbo (Recommended for Speed)
- 40% faster than v2.0
- Up to 1080p at 30 FPS
- Best cost/quality ratio for production pipelines
Kling v2.6 (Recommended for Quality)
- Native audio generation (voice, SFX, ambient in one pass)
- 1080p at 30-48 FPS
- Set
motion_has_audio: true for synchronized audio
Image Generation Models (Kolors)
| Model ID |
Purpose |
Resolution |
kolors-v1-5 |
Face/subject reference |
Up to 2048x2048 |
kolors-v2-0 |
Image restyle |
Up to 2048x2048 |
kolors-v2-1 |
Text-to-image |
Up to 2048x2048 |
Specialty Models
| Feature |
Endpoint |
Model Versions |
| Lip Sync |
/v1/videos/lip-sync |
v1.6+ |
| Virtual Try-On |
/v1/images/kolors-virtual-try-on |
v1.5 |
| Video Extension |
/v1/videos/video-extend |
All video models |
| Effects |
/v1/videos/effects |
v1.6+ |
| Motion Control |
T2V/I2V with camera_control |
v1.6+ |
Mode Selection
Every video generation accepts a mode parameter:
| Mode |
Credits (5s) |
Credits (10s) |
Use Case |
standard |
10 |
20 |
Drafts, previews, iteration |
professional |
35 |
70 |
Final output, client delivery |
Model Selection Decision Tree
Need fastest generation?
→ kling-v2-5-turbo + standard mode
Need highest quality?
→ kling-v2-6 + professional mode
Need audio in the video?
→ kling-v2-6 with motion_has_audio: true
Image-to-video only?
→ kling-v2-1 (optimized for I2V)
Budget-conscious production?
→ kling-v2-5-turbo + standard mode (10 credits/5s)
Legacy compatibility?
→ kling-v1-6 (stable, well-documented)
API Usage
# Specify model in any video generation request
response = requests.post(f"{BASE}/videos/text2video", headers=headers, json={
"model_name": "kling-v2-6", # model version
"mode": "professional", # standard or professional
"prompt": "A futuristic city at sunset with flying cars",
"duration": "5",
"aspect_ratio": "16:9",
})
Aspect Ratios (All Models)
| Ratio |
Use Case |
16:9 |
Landscape, YouTube, presentations |
9:16 |
Vertical, TikTok, Reels, Stories |
1:1 |
Square, Instagram, thumbnails |
4:3 |
Classic TV, presentations |
3:4 |
Portrait photos |
3:2 |
Standard photography |
2:3 |
Tall portrait |
21:9 |
Ultra-wide, cinematic |
Resources