Private AI
Kuaishou's unified multi-modal video model (Standard tier) optimized for cost efficiency. Supports text-only input for text-to-video, image input for image-to-video, reference images/video for reference-based generation, or video-only input for natural language video editing.
Added Dec 17, 2025
Approx. Price
$0.420 per video
Model Type
both
Generation controls available for this model.
Output Format
N/A
Default Duration
5
2 duration options
Duration
Default
5
Options (2)
5 seconds, 10 seconds
Video duration in seconds
Keep Original Sound
Default
true
Options (2)
Yes, No
Preserve original audio when using video input
Mode
Default
auto
Options (3)
Auto-detect, Edit Video, Reference to Video
How to process video input: Edit modifies the video, Reference uses it as style guidance for new generation
No benchmark data is available yet for this model.
Loading examples…