This is a native workflow example for Wan2.2-S2V audio-driven video generation in ComfyUI.
Download JSON Workflow
Download the following image and audio as input:Download Input Audio
wan2.2_s2v_14B_fp8_scaled.safetensors
, which requires less VRAM. But you can try wan2.2_s2v_14B_bf16.safetensors
to reduce quality degradation.
wan2.2_s2v_14B_fp8_scaled.safetensors
or wan2.2_s2v_14B_bf16.safetensors
wan2.2_s2v_14B_fp8_scaled.safetensors
, which requires less VRAMwan2.2_s2v_14B_bf16.safetensors
to reduce quality degradationumt5_xxl_fp8_e4m3fn_scaled.safetensors
wan_2.1_vae.safetensors
wav2vec2_large_english_fp16.safetensors
wan2.2_t2v_lightx2v_4steps_lora_v1.1_high_noise.safetensors
(Lightning LoRA)