Inputs
| Parameter | Description | Data Type | Required | Range |
|---|---|---|---|---|
clip | The CLIP model used for text and image tokenization | CLIP | Yes | - |
prompt | Text prompt for conditioning generation, supports multiline input and dynamic prompts | STRING | Yes | - |
vae | Optional VAE model for encoding reference images into latents | VAE | No | - |
image | Optional input image for reference or editing purposes | IMAGE | No | - |
image and vae are provided, the node encodes the image into reference latents and attaches them to the conditioning output. The image is automatically resized to maintain a consistent processing scale of approximately 1024x1024 pixels.
Outputs
| Output Name | Description | Data Type |
|---|---|---|
CONDITIONING | Conditioning data containing text tokens and optional reference latents for image generation | CONDITIONING |
This documentation was AI-generated. If you find any errors or have suggestions for improvement, please feel free to contribute! Edit on GitHub
Source fingerprint (SHA-256):
7d1f82174d92ee86ca35e14a364e1a703fb9fc1ac9979611bd894a0a639c58a2