Private AI
Fast Veo 3.1 generation for text-to-video, image-to-video, and reference-to-video with up to 3 total images. Supports optional end frame control for image-to-video, native audio, and 4/6/8 seconds at 720p or 1080p.
Added May 14, 2026
Approx. Price
$0.640 per video
Model Type
both
Generation controls available for this model.
Output Format
Default Duration
8
3 duration options
Aspect Ratio (T2V)
Default
16:9
Options (2)
Landscape (16:9), Portrait (9:16)
Applies to text-to-video
Duration
Default
8
Options (3)
4 seconds, 6 seconds, 8 seconds
4/6/8 seconds
Generate Audio
Default
Yes
Enable native audio generation
Resolution
Default
720p
Options (2)
720p, 1080p
Output resolution
Human preference benchmarks sourced from Artificial Analysis.
Text to Video
#20 / 83
ELO
1212.0
Appearances
5,131
95% CI
-9/9
Image to Video
#17 / 76
ELO
1262.0
Appearances
5,128
95% CI
-10/10
Release Date 2026-01 · Matched as Veo 3.1 Fast
Artificial Analysis APILoading examples…