ComfyUI Ideogram 4.0 Open-Source Model Tutorial

Ideogram 4.0 is the latest text-to-image model from Ideogram, released as an open-source model that runs entirely on your own hardware. It delivers superior photorealistic quality, accurate text rendering, and precise style control. You can use either plain natural language or structured JSON prompts for fine-grained control over layout, colors, and in-image text.

Portable or self deployed users
Desktop or Cloud users

Make sure your ComfyUI is updated.

Workflows in this guide can be found in the Workflow Templates. If you can’t find them in the template, your ComfyUI may be outdated. (Desktop version’s update will delay sometime)If nodes are missing when loading a workflow, possible reasons:

You are not using the latest ComfyUI version (Nightly version)
Some nodes failed to import at startup

Ideogram 4.0 Text-to-Image Workflow

Run in Comfy Cloud

Open in Comfy Cloud

Download Workflow

Download JSON or search “Ideogram v4: Text to Image” in Template Library

Example output from the Ideogram 4.0 model

Prompt Format

The open-source workflow supports two prompt modes:

Natural language — Quick and easy, best for simple ideas
Structured JSON — For precise control over layout, colors, and style

The workflow includes a prompt construction template that you can use with any LLM to generate matching JSON prompts. A note inside the workflow explains:

The model is trained on structured JSON captions (scene summary, style block, background, and optional per-object descriptions with bounding boxes and hex color palettes). Official inference validates prompts against that schema.

Ideogram 4.0 model downloads

You can find all repackaged model files at Comfy-Org/Ideogram-4 on Hugging Face.

ideogram4_fp8_scaled.safetensors

Diffusion model for Ideogram 4.0 (~13.8 GB). Place in models/diffusion_models/

ideogram4_unconditional_fp8_scaled.safetensors

Unconditional diffusion model for Ideogram 4.0 (~13.8 GB). Place in models/diffusion_models/

qwen3vl_8b_fp8_scaled.safetensors

Text encoder for Ideogram 4.0 (~8 GB). Place in models/text_encoders/

gemma4_e4b_it_fp8_scaled.safetensors

Text encoder for Ideogram 4.0 (~2 GB). Place in models/text_encoders/

flux2-vae.safetensors

VAE for Ideogram 4.0 (~335 MB). Place in models/vae/

Model storage location

📂 ComfyUI/
├── 📂 models/
│   ├── 📂 diffusion_models/
│   │   ├── ideogram4_fp8_scaled.safetensors
│   │   └── ideogram4_unconditional_fp8_scaled.safetensors
│   ├── 📂 text_encoders/
│   │   ├── qwen3vl_8b_fp8_scaled.safetensors
│   │   └── gemma4_e4b_it_fp8_scaled.safetensors
│   └── 📂 vae/
│       └── flux2-vae.safetensors

Learn about Subgraph

This workflow uses Subgraph nodes for modular processing. Check the Subgraph documentation to learn how to customize and extend.

Workflow Steps

Download and place the models in the correct directories (see table above)
Download and drag the workflow file into ComfyUI
Enter your prompt (natural language or structured JSON) in the Ideogram4 subgraph node
(Optional) Adjust resolution using the ResolutionSelector node
Click Run or use shortcut Ctrl(cmd) + Enter to generate the image
View the result in the Save Image node — output files are saved to ComfyUI/output/

Note on Safety Filter

If you see “Image blocked by safety filter”, this is Ideogram 4.0’s built-in safety filter. Non-JSON (plain-text) prompts have a higher false-positive rate. Using structured JSON prompts reduces the chance of your prompt being blocked. For more details, see Ideogram 4’s official prompting guide.

Live Conversation with Ideogram & ComfyOrg

A special live conversation with Mohammad Norouzi (CEO, Ideogram) and Yoland Yan (CEO, ComfyOrg), hosted by Purz & Rob.

​Ideogram 4.0 Text-to-Image Workflow