Baidu's ERNIE Image model for high-quality multilingual text-to-image generation with built-in prompt expansion.
Added Apr 10, 2026
Approx. Price
$0.010 per image
Model Type
text-to-image
Preview Examples
3
Generation controls available for this model.
Images Per Run
Up to 4
Resolution Options
11
512x512 (Square), 1024x1024 (Square HD), 768x1024 (Portrait (3:4)) +8 more
Tunable Settings
9
Enable Safety Checker
Default
No
Guidance Scale
Default
5
How strongly to follow the prompt (1-20).
Inference Steps
Default
50
Number of denoising steps (1-100).
Negative Prompt
Default
N/A
Describe what to avoid in the image.
Number of Images
Default
1
Output Format
Default
jpeg
Options (2)
JPEG, PNG
Image output format.
Prompt Expansion
Human preference benchmarks sourced from Artificial Analysis.
No Artificial Analysis benchmark data is available yet for this model.
Artificial Analysis API
Cinematic portrait of a red fox standing in a snowy pine forest at sunrise, warm rim light, highly detailed fur, shallow depth of field
1024x1024, default settings

Vintage travel poster for Lisbon with clear headline text "LISBON" and subtitle "Summer by the Atlantic", warm orange and teal palette, art deco composition
512x512, default settings

国潮风格海报:一只白鹤掠过青山与云海,金色日出,细节丰富,电影感构图
512x512, default settings
Default
Yes
Enhance the prompt automatically for richer outputs.
Resolution
Default
1024x1024
Options (11)
512x512 (Square), 1024x1024 (Square HD), 768x1024 (Portrait (3:4)), 576x1024 (Portrait (9:16)) +7 more
Seed
Default
N/A
Random seed for reproducible outputs.