Private AI
Audio-driven talking or singing avatar generation from a single image with lip-synced motion and consistent identity. Supports 480p/720p output up to 2 minutes.
Added Dec 24, 2025
Approx. Price
$0.150 per video
Model Type
image-to-video
Generation controls available for this model.
Output Format
Default Duration
N/A
Prompt
Default
N/A
Optional expression/style prompt
Resolution
Default
480p
Options (2)
480p, 720p
Video resolution
Human preference benchmarks sourced from Artificial Analysis.
No Artificial Analysis benchmark data is available yet for this model.
Artificial Analysis APILoading examples…