Vidu Q2 is a high-end text-to-image model with cinematic lighting and clean composition. Supports up to 4K resolution and flexible aspect ratios. Upload reference images to guide generation with subject/composition consistency.
Added Dec 2, 2025
Approx. Price
$0.030 per image
Model Type
both
Preview Examples
5
Generation controls available for this model.
Images Per Run
Up to 1
Resolution Options
3
1080p (Fast preview (1920x1080)), 2K (Higher detail (2560x1440)), 4K (Maximum sharpness (3840x2160))
Tunable Settings
4
Aspect Ratio
Default
1:1
Options (9)
Auto (match references), Square, 16:9 Widescreen, 9:16 Vertical +5 more
Canvas shape for generation. Use "auto" to match reference images.
Number of Images
Default
1
Resolution
Default
1080p
Options (3)
1080p (Fast preview (1920x1080)), 2K (Higher detail (2560x1440)), 4K (Maximum sharpness (3840x2160))
Seed
Default
-1
Control reproducibility (-1 for random).
Human preference benchmarks sourced from Artificial Analysis.
Text to Image
#45 / 129
ELO
1110.0
Appearances
6,004
95% CI
-9/9
Image Editing
#37 / 60
ELO
1131.0
Appearances
7,276
95% CI
-9/9
Release Date 2025-11 · Matched as Vidu Q2
Artificial Analysis API
Commercial food photography, extreme macro close-up. A wooden dipper pulling up thick, golden honey from a jar. The honey is translucent and glowing in the sunlight. A few pollen particles floating in the air. Background is a blurred garden. Mouth-watering texture, warm tone, high fidelity, subsurface scattering.

Hyper-realistic portrait of an elderly tribal elder with weathered, leathery skin deep in a rainforest. Every wrinkle and pore is visible. Raindrops sitting on the skin. Intricate colorful feather headdress. Rim lighting from the side highlighting the fine facial fuzz and floating dust particles. Intense gaze, cinematic lighting, shot on 70mm lens, IMAX quality.

A futuristic cyberpunk megastructure city built vertically inside a massive cave system. Neon lights in teal and orange illuminating the fog. Hundreds of flying vehicles moving on different levels. Bottom-up view looking at the towering skyscrapers disappearing into the mist. Volumetric lighting, atmospheric perspective, high contrast, unreal engine 5 style, intricate details.

High-speed photography of a colorful paint and powder explosion in a dark studio. Swirls of red, blue, and gold liquid mixing in mid-air. Liquid droplets frozen in time. High contrast, sharp focus, subsurface scattering in the liquid, dynamic composition, 4k, wallpaper quality.

Close-up of a mechanical heart mechanism. Intricate steampunk style. Thousands of tiny brass gears, copper pipes, and steel pistons working together. Steam escaping from small vents. A glowing amber crystal energy source in the center. Vintage atmosphere, cinematic depth of field, sharp focus on the gears, metallic texture.