Inputs
| Parameter | Description | Data Type | Required | Range |
|---|---|---|---|---|
clip | The CLIP model used for tokenization and encoding | CLIP | Yes | - |
prompt | Text instruction describing the desired image modification (supports multiline input and dynamic prompts) | STRING | Yes | - |
vae | Optional VAE model for generating reference latents from input images | VAE | No | - |
image1 | First optional input image for analysis and modification | IMAGE | No | - |
image2 | Second optional input image for analysis and modification | IMAGE | No | - |
image3 | Third optional input image for analysis and modification | IMAGE | No | - |
Outputs
| Output Name | Description | Data Type |
|---|---|---|
CONDITIONING | Encoded conditioning data containing text tokens and optional reference latents for image generation | CONDITIONING |
This documentation was AI-generated. If you find any errors or have suggestions for improvement, please feel free to contribute! Edit on GitHub
Source fingerprint (SHA-256):
40e0104e1a5fd88afb889948bc43559f99049a91c03c3f9885455b6dbfde343e