Native 4K Kling V3 generation for text-to-video and image-to-video. Supports first/last-frame image control, 3-15 second durations, 16:9/9:16/1:1 output, and optional native audio.
Added Apr 23, 2026
Approx. Price
$2.10 per video
Model Type
both
Preview Examples
2
Generation controls available for this model.
Resolution/Aspect Options
3
Default Duration
5
13 duration options
Tunable Settings
3
Aspect Ratio
Default
16:9
Options (3)
Landscape (16:9), Portrait (9:16), Square (1:1)
Output aspect ratio
Duration
Default
5
Options (13)
3 seconds, 4 seconds, 5 seconds, 6 seconds +9 more
Video duration in seconds
Generate Audio
Default
Yes
Enable native audio generation
Human preference benchmarks sourced from Artificial Analysis.
No Artificial Analysis benchmark data is available yet for this model.
Artificial Analysis APICinematic drone shot over a snow-capped mountain range at golden hour with volumetric clouds and smooth forward camera motion.
text-to-video | https://fal.ai/models/fal-ai/kling-video/v3/4k/text-to-video
Animate a still frame into a moody nighttime scene with subtle character movement and gentle handheld camera drift.
image-to-video | https://fal.ai/models/fal-ai/kling-video/v3/4k/image-to-video