Private AI
Kuaishou's unified multi-modal video model (Standard tier) optimized for cost efficiency. Supports text-only input for text-to-video, image input for image-to-video, reference images/video for reference-based generation, or video-only input for natural language video editing.
Added Dec 17, 2025
Approx. Price
$0.420 per video
Model Type
both
Generation controls available for this model.
Output Format
N/A
Default Duration
5
2 duration options
Duration
Default
5
Options (2)
5 seconds, 10 seconds
Video duration in seconds
Keep Original Sound
Default
true
Options (2)
Yes, No
Preserve original audio when using video input
Mode
Default
auto
Options (3)
Auto-detect, Edit Video, Reference to Video
How to process video input: Edit modifies the video, Reference uses it as style guidance for new generation
Human preference benchmarks sourced from Artificial Analysis.
No Artificial Analysis benchmark data is available yet for this model.
Artificial Analysis APILoading examples…