- 🚀 Z-Image-Turbo – A distilled version that matches or exceeds leading competitors with only 8 NFEs (Number of Function Evaluations). It offers sub-second inference latency on enterprise-grade H800 GPUs and fits within 16GB VRAM consumer devices.
- 🧱 Z-Image-Base – The non-distilled foundation model for community-driven fine-tuning and custom development.
- ✍️ Z-Image-Edit – A variant fine-tuned for image editing tasks with impressive instruction-following capabilities.
- Photorealistic Quality: Delivers strong photorealistic image generation while maintaining excellent aesthetic quality
- Accurate Bilingual Text Rendering: Excels at accurately rendering complex Chinese and English text
- Prompt Enhancing & Reasoning: Prompt Enhancer empowers the model with reasoning capabilities
- Sub-second Inference: Achieves fast generation speed on supported hardware
Z-Image-Turbo text-to-image workflow
Download JSON Workflow File
Run on ComfyUI Cloud
Model links
text_encoders diffusion_models vae Model Storage LocationZ-Image-Turbo Fun Union ControlNet workflow
This workflow uses the Z-Image-Turbo Fun Union ControlNet model to generate images with ControlNet guidance. It applies Canny edge detection to a reference image and uses the ControlNet to guide the generation process.Download JSON Workflow File