Private AI
Vidu Q3 text-to-video and image-to-video with high visual fidelity, multiple styles, 540p/720p/1080p output, 1-16s duration, and optional audio plus background music.
Added Jan 31, 2026
Approx. Price
$0.350 per video
Model Type
both
Generation controls available for this model.
Output Format
Default Duration
5
16 duration options
Aspect Ratio (T2V only)
Default
4:3
Options (5)
Landscape (16:9), Standard (4:3), Square (1:1), Portrait (3:4) +1 more
Applies to text-to-video
Background Music
Default
Yes
Add background music
Duration
Default
5
Options (16)
1 second, 2 seconds, 3 seconds, 4 seconds +12 more
Video length in seconds (1-16)
Generate Audio
Default
Yes
Generate synchronized audio
Motion
Default
auto
Options (4)
Auto, Small, Medium, Large
Human preference benchmarks sourced from Artificial Analysis.
No Artificial Analysis benchmark data is available yet for this model.
Artificial Analysis APILoading examples…
Movement intensity
Resolution
Default
720p
Options (3)
540p, 720p, 1080p
Output resolution
Style (T2V only)
Default
general
Options (2)
General, Anime
Visual style for text-to-video