Kling AI Text-to-Video Generation

v20260423

klingai-text-to-video

Utilize the Kling AI API to generate high-quality, cinematic videos directly from descriptive text prompts. This advanced tool supports professional modes, camera movement control (pan, tilt, zoom), negative prompting, and synchronized native audio, making it ideal for professionals building robust text-to-video pipelines and creative content generation.

Kling AI Text-to-Video Video Generation API AI Media

Get Skill

359 downloads

Overview

Kling AI Text-to-Video

Overview

Generate videos from text prompts using the /v1/videos/text2video endpoint. Supports models v1 through v2.6, standard/professional modes, camera control, negative prompts, and native audio (v2.6+).

Endpoint: POST https://api.klingai.com/v1/videos/text2video

Request Parameters

Parameter	Type	Required	Description
`model_name`	string	Yes	Model version (see model catalog)
`prompt`	string	Yes	Video description, max 2500 chars
`negative_prompt`	string	No	What to exclude from generation
`duration`	string	Yes	`"5"` or `"10"` seconds
`aspect_ratio`	string	No	`"16:9"` (default), `"9:16"`, `"1:1"`, etc.
`mode`	string	No	`"standard"` (default) or `"professional"`
`cfg_scale`	float	No	Prompt adherence (0.0-1.0, default 0.5)
`camera_control`	object	No	Camera movement config
`callback_url`	string	No	Webhook URL for completion notification

Complete Example — Python

import jwt, time, os, requests

BASE = "https://api.klingai.com/v1"

def get_headers():
    ak, sk = os.environ["KLING_ACCESS_KEY"], os.environ["KLING_SECRET_KEY"]
    token = jwt.encode(
        {"iss": ak, "exp": int(time.time()) + 1800, "nbf": int(time.time()) - 5},
        sk, algorithm="HS256", headers={"alg": "HS256", "typ": "JWT"}
    )
    return {"Authorization": f"Bearer {token}", "Content-Type": "application/json"}

# Create text-to-video task
response = requests.post(f"{BASE}/videos/text2video", headers=get_headers(), json={
    "model_name": "kling-v2-6",
    "prompt": "Aerial drone shot of a coral reef at golden hour, "
              "tropical fish swimming through crystal clear water, "
              "sun rays penetrating the surface, cinematic 4K",
    "negative_prompt": "blurry, low quality, distorted, watermark",
    "duration": "5",
    "aspect_ratio": "16:9",
    "mode": "professional",
    "cfg_scale": 0.5,
})

task = response.json()
task_id = task["data"]["task_id"]

# Poll for completion
while True:
    time.sleep(15)
    result = requests.get(
        f"{BASE}/videos/text2video/{task_id}", headers=get_headers()
    ).json()

    status = result["data"]["task_status"]
    if status == "succeed":
        video = result["data"]["task_result"]["videos"][0]
        print(f"Video URL: {video['url']}")
        print(f"Duration: {video['duration']}s")
        break
    elif status == "failed":
        raise RuntimeError(result["data"]["task_status_msg"])
    # else: submitted/processing — keep polling

With Camera Control

# Camera movement types: pan, tilt, zoom, roll
response = requests.post(f"{BASE}/videos/text2video", headers=get_headers(), json={
    "model_name": "kling-v2-6",
    "prompt": "A medieval castle on a cliff at sunrise, fog in the valley",
    "duration": "5",
    "mode": "standard",
    "camera_control": {
        "type": "simple",
        "config": {
            "horizontal": 5,    # pan right (negative = left), range -10 to 10
            "vertical": 0,      # tilt (negative = down, positive = up)
            "zoom": 3,          # zoom in (positive) or out (negative)
            "roll": 0,          # rotation
            "pan": 0,           # dolly left/right
            "tilt": -2,         # dolly up/down
        }
    },
})

Rule: Only one non-zero field in config for type: "simple".

With Native Audio (v2.6 only)

response = requests.post(f"{BASE}/videos/text2video", headers=get_headers(), json={
    "model_name": "kling-v2-6",
    "prompt": "A jazz band performing in a dimly lit club, saxophone solo, "
              "audience clapping, warm amber lighting",
    "duration": "10",
    "mode": "professional",
    "motion_has_audio": True,  # generates synchronized audio
})

Prompt Engineering Tips

Technique	Example
Scene + action + style	"A samurai walking through cherry blossoms, cinematic slow motion"
Lighting cues	"golden hour", "neon-lit", "overcast diffused light"
Camera language	"close-up", "wide establishing shot", "tracking shot"
Negative prompt	"blurry, watermark, text overlay, distorted faces"
Material/texture	"brushed steel", "hand-painted watercolor", "photorealistic"

Cost Reference

Duration	Standard	Professional
5 seconds	10 credits	35 credits
10 seconds	20 credits	70 credits

Error Handling

Error	Cause	Fix
`400` invalid prompt	Empty or >2500 chars	Check prompt length
`400` invalid model	Unsupported `model_name`	Use valid model ID from catalog
`402` insufficient credits	Not enough credits	Top up account
`task_status: failed`	Content policy violation or complexity	Simplify prompt, remove restricted content

Resources

Info

Category Artificial Intelligence

Name klingai-text-to-video

Version v20260423

Size 6.75KB

Source jeremylongshore/claude-code-plugins-plus-skills

Updated At 2026-07-13