Skip to main content
Wan2.7 is Alibaba’s latest video generation model, now available in ComfyUI via Partner Nodes. It is a comprehensive upgrade over version 2.6 with significant improvements across image quality, audio, motion dynamics, stylization, and consistency. This release brings a fully upgraded multimodal video pipeline directly into your node graph, supporting text, image, audio, and video inputs across five task types.

Key features

  • Image-to-video: First-frame, first+last-frame, and audio-driven generation
  • Text-to-video: Pure text prompts with optional audio input and multi-shot narration
  • Video continuation: Extend an existing clip with new content guided by a text prompt
  • Reference-to-video: Reference both a subject’s visual appearance and vocal timbre; supports up to 5 real-person inputs and multi-character interactions
  • Video edit: Edit or replicate videos via text prompts, reference image, or style transfer

Highlights

  • Supports up to 5 real-person image inputs for multi-character scenes
  • Vocal timbre reference for consistent audio-visual identity
  • 3x3 grid-based image generation
  • Significant improvements in motion dynamics, stylization, and consistency over Wan2.6
To use the API nodes, you need to ensure that you are logged in properly and using a permitted network environment. Please refer to the API Nodes Overview section of the documentation to understand the specific requirements for using the API nodes.
Make sure your ComfyUI is updated.Workflows in this guide can be found in the Workflow Templates. If you can’t find them in the template, your ComfyUI may be outdated. (Desktop version’s update will delay sometime)If nodes are missing when loading a workflow, possible reasons:
  1. You are not using the latest ComfyUI version (Nightly version)
  2. Some nodes failed to import at startup

Wan2.7 image-to-video

Generate video from image inputs. Supports first-frame, first+last-frame, and audio-driven generation modes.

Wan2.7 I2V Workflow

Get the Wan2.7 Image-to-Video workflow file.

Run Wan2.7 I2V on Cloud

Try the Image-to-Video workflow instantly on Comfy Cloud.

Wan2.7 text-to-video

Generate video from pure text prompts. Optionally include audio input and multi-shot narration for richer storytelling.

Wan2.7 T2V Workflow

Get the Wan2.7 Text-to-Video workflow file.

Run Wan2.7 T2V on Cloud

Try the Text-to-Video workflow instantly on Comfy Cloud.

Wan2.7 reference-to-video

Use reference images of a subject’s visual appearance along with an optional vocal timbre reference. Supports up to 5 real-person inputs for multi-character interaction scenes.

Wan2.7 R2V Workflow

Get the Wan2.7 Reference-to-Video workflow file.

Run Wan2.7 R2V on Cloud

Try the Reference-to-Video workflow instantly on Comfy Cloud.

Wan2.7 video edit

Edit or replicate existing videos using text prompts, a reference image, or style transfer.

Wan2.7 Video Edit Workflow

Get the Wan2.7 Video Edit workflow file.

Run Wan2.7 Video Edit on Cloud

Try the Video Edit workflow instantly on Comfy Cloud.