Kling 2.1 Pro image-to-video model. Higher quality video generation from images with text prompts. Requires an input image.
Added May 29, 2025
Approx. Price
$0.220 per video
Model Type
image-to-video
Preview Examples
2
Generation controls available for this model.
Resolution/Aspect Options
3
Default Duration
5
2 duration options
Tunable Settings
4
Aspect Ratio
Default
16:9
Options (3)
Landscape (16:9), Portrait (9:16), Square (1:1)
Choose between landscape (16:9), portrait (9:16), or square (1:1) orientation
CFG Scale
Default
0.5
Controls how closely the generation follows the prompt (0.0-1.0)
Duration
Default
5
Options (2)
5 seconds, 10 seconds (double price)
Length of the generated video in seconds
Negative Prompt
Default
blur, distort, and low quality
What to avoid in the video (default: blur, distort, and low quality)
Human preference benchmarks sourced from Artificial Analysis.
Image to Video
#49 / 76
ELO
1189.0
Appearances
2,804
95% CI
-10/10
Release Date 2025-05 · Matched as Kling 2.1 Pro
Artificial Analysis APIVideo editing via conversational natural language commands
High quality cinematic video generation