ComfyUI Hunyuan Video Examples

Hunyuan Video series is developed and open-sourced by Tencent, featuring a hybrid architecture that supports both Text-to-Video and Image-to-Video generation with a parameter scale of 13B. Technical features:

Core Architecture: Uses a DiT (Diffusion Transformer) architecture similar to Sora, effectively fusing text, image, and motion information to improve consistency, quality, and alignment between generated video frames. A unified full-attention mechanism enables multi-view camera transitions while ensuring subject consistency.
3D VAE: The custom 3D VAE compresses videos into a compact latent space, making image-to-video generation more efficient.
Superior Image-Video-Text Alignment: Utilizing MLLM text encoders that excel in both image and video generation, better following text instructions, capturing details, and performing complex reasoning.

You can learn more through the official repositories: Hunyuan Video and Hunyuan Video-I2V. This guide will walk you through setting up both Text-to-Video and Image-to-Video workflows in ComfyUI.

The workflow images in this tutorial contain metadata with model download information.Simply drag them into ComfyUI or use the menu Workflows -> Open (ctrl+o) to load the corresponding workflow, which will prompt you to download the required models.Alternatively, this guide provides direct model links if automatic downloads fail or you are not using the Desktop version. All models are available here for download.

If you have not installed ComfyUI, please refer to the ComfyUI System Requirements section to install ComfyUI.If you find missing nodes when loading the workflow file below, it may be due to the following situations:

You are not using the latest Development (Nightly) version of ComfyUI.
You are using the Stable (Release) version or Desktop version of ComfyUI (which does not include the latest feature updates).
You are using the latest Commit version of ComfyUI, but some nodes failed to import during startup.

Please make sure you have successfully updated ComfyUI to the latest Development (Nightly) version. See: How to Update ComfyUI section to learn how to update ComfyUI.

Common Models for All Workflows

The following models are used in both Text-to-Video and Image-to-Video workflows. Please download and save them to the specified directories:

Storage location:

ComfyUI/
├── models/
│   ├── text_encoders/
│   │   ├── clip_l.safetensors
│   │   └── llava_llama3_fp8_scaled.safetensors
│   ├── vae/
│   │   └── hunyuan_video_vae_bf16.safetensors

Hunyuan Text-to-Video Workflow

Hunyuan Text-to-Video was open-sourced in December 2024, supporting 5-second short video generation through natural language descriptions in both Chinese and English.

1. Workflow

Download the image below and drag it into ComfyUI to load the workflow: ComfyUI Workflow - Hunyuan Text-to-Video

2. Manual Models Installation

Download hunyuan_video_t2v_720p_bf16.safetensors and save it to the ComfyUI/models/diffusion_models folder. Ensure you have all these model files in the correct locations:

ComfyUI/
├── models/
│   ├── text_encoders/
│   │   ├── clip_l.safetensors                       // Shared model
│   │   └── llava_llama3_fp8_scaled.safetensors      // Shared model
│   ├── vae/
│   │   └── hunyuan_video_vae_bf16.safetensors       // Shared model
│   └── diffusion_models/
│       └── hunyuan_video_t2v_720p_bf16.safetensors  // T2V model

3. Steps to Run the Workflow

Ensure the DualCLIPLoader node has loaded these models:
- clip_name1: clip_l.safetensors
- clip_name2: llava_llama3_fp8_scaled.safetensors
Ensure the Load Diffusion Model node has loaded hunyuan_video_t2v_720p_bf16.safetensors
Ensure the Load VAE node has loaded hunyuan_video_vae_bf16.safetensors
Click the Queue button or use the shortcut Ctrl(cmd) + Enter to run the workflow

When the length parameter in the EmptyHunyuanLatentVideo node is set to 1, the model can generate a static image.

Hunyuan Image-to-Video Workflow

Hunyuan Image-to-Video model was open-sourced on March 6, 2025, based on the HunyuanVideo framework. It transforms static images into smooth, high-quality videos and also provides LoRA training code to customize special video effects like hair growth, object transformation, etc. Currently, the Hunyuan Image-to-Video model has two versions:

v1 “concat”: Better motion fluidity but less adherence to the image guidance
v2 “replace”: Updated the day after v1, with better image guidance but seemingly less dynamic compared to v1

v1 “concat”

v2 “replace”

Shared Model for v1 and v2 Versions

Download the following file and save it to the ComfyUI/models/clip_vision directory:

llava_llama3_vision.safetensors

V1 “concat” Image-to-Video Workflow

1. Workflow and Asset

Download the workflow image below and drag it into ComfyUI to load the workflow: ComfyUI Workflow - Hunyuan Image-to-Video v1

Download the image below, which we’ll use as the starting frame for the image-to-video generation:

hunyuan_video_image_to_video_720p_bf16.safetensors

Ensure you have all these model files in the correct locations:

ComfyUI/
├── models/
│   ├── clip_vision/
│   │   └── llava_llama3_vision.safetensors                     // I2V shared model
│   ├── text_encoders/
│   │   ├── clip_l.safetensors                                  // Shared model
│   │   └── llava_llama3_fp8_scaled.safetensors                 // Shared model
│   ├── vae/
│   │   └── hunyuan_video_vae_bf16.safetensors                  // Shared model
│   └── diffusion_models/
│       └── hunyuan_video_image_to_video_720p_bf16.safetensors  // I2V v1 "concat" version model

3. Steps to Run the Workflow

Ensure that DualCLIPLoader has loaded these models:
- clip_name1: clip_l.safetensors
- clip_name2: llava_llama3_fp8_scaled.safetensors
Ensure that Load CLIP Vision has loaded llava_llama3_vision.safetensors
Ensure that Load Image Model has loaded hunyuan_video_image_to_video_720p_bf16.safetensors
Ensure that Load VAE has loaded vae_name: hunyuan_video_vae_bf16.safetensors
Ensure that Load Diffusion Model has loaded hunyuan_video_image_to_video_720p_bf16.safetensors
Click the Queue button or use the shortcut Ctrl(cmd) + Enter to run the workflow

v2 “replace” Image-to-Video Workflow

The v2 workflow is essentially the same as the v1 workflow. You just need to download the replace model and use it in the Load Diffusion Model node.

1. Workflow and Asset

Download the workflow image below and drag it into ComfyUI to load the workflow: ComfyUI Workflow - Hunyuan Image-to-Video v2

Download the image below, which we’ll use as the starting frame for the image-to-video generation:

hunyuan_video_v2_replace_image_to_video_720p_bf16.safetensors

Ensure you have all these model files in the correct locations:

ComfyUI/
├── models/
│   ├── clip_vision/
│   │   └── llava_llama3_vision.safetensors                                // I2V shared model
│   ├── text_encoders/
│   │   ├── clip_l.safetensors                                             // Shared model
│   │   └── llava_llama3_fp8_scaled.safetensors                            // Shared model
│   ├── vae/
│   │   └── hunyuan_video_vae_bf16.safetensors                             // Shared model
│   └── diffusion_models/
│       └── hunyuan_video_v2_replace_image_to_video_720p_bf16.safetensors  // V2 "replace" version model

3. Steps to Run the Workflow

Ensure the DualCLIPLoader node has loaded these models:
- clip_name1: clip_l.safetensors
- clip_name2: llava_llama3_fp8_scaled.safetensors
Ensure the Load CLIP Vision node has loaded llava_llama3_vision.safetensors
Ensure the Load Image Model node has loaded hunyuan_video_image_to_video_720p_bf16.safetensors
Ensure the Load VAE node has loaded hunyuan_video_vae_bf16.safetensors
Ensure the Load Diffusion Model node has loaded hunyuan_video_v2_replace_image_to_video_720p_bf16.safetensors
Click the Queue button or use the shortcut Ctrl(cmd) + Enter to run the workflow

Try it yourself

Here are some images and prompts we provide. Based on that content or make an adjustment to create your own video. example

Futuristic robot dancing ballet, dynamic motion, fast motion, fast shot, moving scene

Samurai waving sword and hitting the camera. camera angle movement, zoom in, fast scene, super fast, dynamic

flying car fastly moving and flying through the city

cyberpunk car race in night city, dynamic, super fast, fast shot

Get Started

Basic Concepts

Interface Guide

Tutorials

Troubleshooting

Community

ComfyUI Hunyuan Video Examples

Common Models for All Workflows

Hunyuan Text-to-Video Workflow

1. Workflow

2. Manual Models Installation

3. Steps to Run the Workflow

Hunyuan Image-to-Video Workflow

Shared Model for v1 and v2 Versions

V1 “concat” Image-to-Video Workflow

1. Workflow and Asset

3. Steps to Run the Workflow

v2 “replace” Image-to-Video Workflow

1. Workflow and Asset

3. Steps to Run the Workflow

Try it yourself

Get Started

Basic Concepts

Interface Guide

Tutorials

Troubleshooting

Community

​Common Models for All Workflows

​Hunyuan Text-to-Video Workflow

​1. Workflow

​2. Manual Models Installation

​3. Steps to Run the Workflow

​Hunyuan Image-to-Video Workflow

​Shared Model for v1 and v2 Versions

​V1 “concat” Image-to-Video Workflow

​1. Workflow and Asset

​2. Related models manual installation

​3. Steps to Run the Workflow

​v2 “replace” Image-to-Video Workflow

​1. Workflow and Asset

​2. Related models manual installation

​3. Steps to Run the Workflow

​Try it yourself

Common Models for All Workflows

Hunyuan Text-to-Video Workflow

1. Workflow

2. Manual Models Installation

3. Steps to Run the Workflow

Hunyuan Image-to-Video Workflow

Shared Model for v1 and v2 Versions

V1 “concat” Image-to-Video Workflow

1. Workflow and Asset

2. Related models manual installation

3. Steps to Run the Workflow

v2 “replace” Image-to-Video Workflow

1. Workflow and Asset

2. Related models manual installation

3. Steps to Run the Workflow

Try it yourself