Private AI
Audio-driven two-person avatar generation from a single image and left/right audio tracks. Supports simultaneous or sequential dialogue, 480p/720p output, and up to 30 seconds of audio.
Added May 23, 2026
Approx. Price
$0.150 per video
Model Type
image-to-video
Generation controls available for this model.
Output Format
Default Duration
N/A
Audio Order
Default
meanwhile
Options (3)
Together, Left then right, Right then left
Whether both speakers talk together or one after the other
Left Audio URL
Default
N/A
Public URL for the person on the left
Prompt
Default
N/A
Optional expression/style prompt
Resolution
Default
480p
Options (2)
480p, 720p
Video resolution
Right Audio URL
Default
N/A
Public URL for the person on the right
Human preference benchmarks sourced from Artificial Analysis.
No Artificial Analysis benchmark data is available yet for this model.
Artificial Analysis APILoading examples...