About Wan2.1 Fun Camera
Wan2.1 Fun Camera is a video generation project launched by the Alibaba team, focusing on controlling video generation effects through camera motion. Model Weights Download: Code Repository: VideoX-Fun ComfyUI now natively supports the Wan2.1 Fun Camera model.Make sure your ComfyUI is updated.Workflows in this guide can be found in the Workflow Templates.
If you can’t find them in the template, your ComfyUI may be outdated.(Desktop version’s update will delay sometime)If nodes are missing when loading a workflow, possible reasons:
- You are not using the latest ComfyUI version(Nightly version)
- You are using Stable or Desktop version (Latest changes may not be included)
- Some nodes failed to import at startup
Model Installation
These models only need to be installed once. Additionally, model download information is included in the corresponding workflow images, so you can choose your preferred way to download the models. All of the following models can be found at Wan_2.1_ComfyUI_repackaged Diffusion Models choose either 1.3B or 14B: If you’ve used Wan2.1 related models before, you should already have the following models. If not, please download them: Text Encoders choose one: VAE CLIP Vision File Storage Location:ComfyUI Wan2.1 Fun Camera 1.3B Native Workflow Example
1. Workflow Related Files Download
1.1 Workflow File
Download the video below and drag it into ComfyUI to load the corresponding workflow:Download Json Workflow File
If you want to use the 14B version, simply replace the model file with the 14B version, but please be aware of the VRAM requirements.
1.2 Input Image Download
Please download the image below, which we will use as the starting frame:
2. Complete the Workflow Step by Step

- Ensure the correct version of model file is loaded:
- 1.3B version:
wan2.1_fun_camera_v1.1_1.3B_bf16.safetensors
- 14B version:
wan2.1_fun_camera_v1.1_14B_bf16.safetensors
- 1.3B version:
- Ensure the
Load CLIP
node has loadedumt5_xxl_fp8_e4m3fn_scaled.safetensors
- Ensure the
Load VAE
node has loadedwan_2.1_vae.safetensors
- Ensure the
Load CLIP Vision
node has loadedclip_vision_h.safetensors
- Upload the starting frame to the
Load Image
node - Modify the Prompt if you’re using your own input image
- Set camera motion in the
WanCameraEmbedding
node - Click the
Run
button or use the shortcutCtrl(cmd) + Enter
to execute generation
ComfyUI Wan2.1 Fun Camera 14B Workflow and Input Image
Download Json Workflow File
Input Image
Performance Reference
1.3B Version:- 512×512 resolution on RTX 4090 takes about 72 seconds to generate 81 frames
- RTX4090 24GB VRAM may experience insufficient memory when generating 512×512 resolution, and memory issues have also occurred on A100 when using larger sizes