Skip to main content
The Wan Image to Video node generates a video from a single input image and a text prompt. It uses the provided image as the first frame and creates a video sequence based on the description, with options for resolution, duration, audio, and other advanced settings.

Inputs

ParameterDescriptionData TypeRequiredRange
modelModel to use (default: “wan2.6-i2v”)COMBOYes”wan2.5-i2v-preview"
"wan2.6-i2v”
imageInput image that serves as the first frame for video generation. Exactly one image is required.IMAGEYes-
promptPrompt describing the elements and visual features. Supports English and Chinese (default: empty).STRINGYes-
negative_promptNegative prompt describing what to avoid (default: empty).STRINGNo-
resolutionVideo resolution quality (default: “720P”). The Wan 2.6 model does not support 480P.COMBONo”480P"
"720P"
"1080P”
durationDuration of the generated video in seconds. A 15-second duration is supported only by the Wan 2.6 model (default: 5).INTNo5-15 (step: 5)
audioAudio must contain a clear, loud voice, without extraneous noise or background music. When provided, audio duration must be between 3.0 and 29.0 seconds.AUDIONo-
seedSeed to use for generation (default: 0).INTNo0-2147483647
generate_audioIf no audio input is provided, generate audio automatically (default: False).BOOLEANNo-
prompt_extendWhether to enhance the prompt with AI assistance (default: True).BOOLEANNo-
watermarkWhether to add an AI-generated watermark to the result (default: False).BOOLEANNo-
shot_typeSpecifies the shot type for the generated video, that is, whether the video is a single continuous shot or multiple shots with cuts. This parameter takes effect only when prompt_extend is True (default: “single”).COMBONo”single"
"multi”
Constraints:
  • Exactly one input image is required for video generation.
  • The Wan 2.6 model (wan2.6-i2v) does not support 480P resolution.
  • A 15-second duration is supported only by the Wan 2.6 model (wan2.6-i2v).
  • When audio is provided, it must be between 3.0 and 29.0 seconds in duration.

Outputs

Output NameDescriptionData Type
outputGenerated video based on the input image and prompt.VIDEO
This documentation was AI-generated. If you find any errors or have suggestions for improvement, please feel free to contribute! Edit on GitHub

Source fingerprint (SHA-256): b8a75e324f7436e8a376e4a058b0a32556cafbe8e7975148cbc6302638f52058