Inputs
| Parameter | Description | Data Type | Required | Range |
|---|---|---|---|---|
clip_vision | The CLIP vision model used for encoding the input image | CLIP_VISION | Yes | - |
init_image | The initial input image to be processed and encoded | IMAGE | Yes | - |
vae | The VAE model used for encoding image pixels into latent space | VAE | Yes | - |
width | The output width for the processed image (default: 256, must be divisible by 8) | INT | No | 16 to MAX_RESOLUTION |
height | The output height for the processed image (default: 256, must be divisible by 8) | INT | No | 16 to MAX_RESOLUTION |
batch_size | The number of conditioning samples to generate in the batch (default: 1) | INT | No | 1 to 4096 |
elevation | The initial camera elevation angle in degrees (default: 0.0) | FLOAT | No | -180.0 to 180.0 |
azimuth | The initial camera azimuth angle in degrees (default: 0.0) | FLOAT | No | -180.0 to 180.0 |
elevation_batch_increment | The amount to increment elevation for each batch item (default: 0.0) | FLOAT | No | -180.0 to 180.0 |
azimuth_batch_increment | The amount to increment azimuth for each batch item (default: 0.0) | FLOAT | No | -180.0 to 180.0 |
width and height parameters must be divisible by 8 as the node internally divides these dimensions by 8 for latent space generation.
Outputs
| Output Name | Description | Data Type |
|---|---|---|
positive | The positive conditioning data containing image embeddings and camera parameters | CONDITIONING |
negative | The negative conditioning data with zero-initialized embeddings | CONDITIONING |
latent | The latent representation of the processed image with batch indexing information | LATENT |
This documentation was AI-generated. If you find any errors or have suggestions for improvement, please feel free to contribute! Edit on GitHub
Source fingerprint (SHA-256):
fb181f43076087cb040db9404a42e27a39ac7eac7af3da283b3e040342374c74