AI creators.tools

ERNIE-Image image model

Name: ERNIE-Image
Also Known As: ERNIE-Image, ERNIE-Image Turbo
Licence: Apache License 2.0
Creator: Baidu

ERNIE-Image is a text to image model from Baidu released in April 2026. It uses a single stream Diffusion Transformer setup.

There are two main versions. ERNIE-Image is the standard release that has 8B parameters and it usually runs at about 50 inference steps. ERNIE-Image-Turbo is the faster distilled version and it is built for about 8 steps.

Baidu presents ERNIE-Image as an open 8B text to image model built for more exact image making, not just nice looking art. It is said to do better with text inside images prompt following and layout control than many image tools. That makes it a good fit for posters, infographics, comics, and similar work where the words and placement matter a lot. 

Supports English, Chinese, and Japanese prompts with built-in light Prompt Enhancer. That helper can turn short prompts into fuller descriptions before the image is made.

Its main selling point is control. Baidu says the model is meant to handle prompts with many objects, relationships and text layout better than many art first models, which still mess up readable words or organized scenes.

That makes it more useful for real design work, not just AI art play. 

The model also seems to handle different aspect ratios with suggested sizes like 1024×1024, 848×1264, and 1264×848.

Baidu is known around the world for search, but it has also worked on the ERNIE model family for years. And more lately it has shared open ERNIE family projects like ERNIE 4.5 and related work. So ERNIE-Image looks like part of a bigger move to make more of that model set public instead of keeping it behind closed products.

Model Performance Editor’s Rating
No editor performance evaluations available for this model yet.
User Ratings
Censorship
--
Lower = less censorship. Higher = stricter filtering.
Creativity
--
Generation Speed
--
ID preservation
--
Prompt Following
--
Realism
--
Typography
--

ERNIE-Image Examples

Image output
Wow, check that out. Ernie with 8b parameters has done a great job with these stylized letters Generated on April 16, 2026
Compare With Other Models
Image output
Realism is not bad here Generated on April 16, 2026
Compare With Other Models
Image output
Quite expressive with oil painting styles, I'd say. Generated on April 16, 2026
Compare With Other Models
Image output
Mixed media style is solid, but poor capybara's got an extra hand. Baidu needs to train on capys harder. Generated on April 16, 2026
Compare With Other Models
Image output
Mixed media style from JSON prompt returned a simple realistic result, no doodles etc. I do like the realism here and the text handling. Generated on April 16, 2026
Compare With Other Models
Image output
A swerving car action test Generated on April 16, 2026
Compare With Other Models
Image output
No Will Smith but typical meme font understanding is great Generated on April 16, 2026
Compare With Other Models
Image output
I'd say a pretty artistic model. Generated on April 16, 2026

Where To Find ERNIE-Image

If you'd like to access this model, you can explore the following possibilities: