Private AI
Text-to-video and image-to-video with optional end frame control. Native audio generation, cinematic realism, and consistent subjects. Supports 4/6/8 seconds at 720p or 1080p.
Added Oct 11, 2025
Approx. Price
$1.60 per video
Model Type
both
Generation controls available for this model.
Output Format
Default Duration
8
3 duration options
Aspect Ratio (T2V)
Default
16:9
Options (2)
Landscape (16:9), Portrait (9:16)
Applies to text-to-video
Duration
Default
8
Options (3)
4 seconds, 6 seconds, 8 seconds
4/6/8 seconds
Generate Audio
Default
Yes
Enable native audio generation
Resolution
Default
720p
Options (2)
720p, 1080p
Output resolution
Human preference benchmarks sourced from Artificial Analysis.
Text to Video
#19 / 81
ELO
1211.0
Appearances
5,446
95% CI
-9/9
Image to Video
#26 / 74
ELO
1257.0
Appearances
5,253
95% CI
-10/10
Release Date 2026-01 · Matched as Veo 3.1
Artificial Analysis APILoading examples…