TextEncodeQwenImageEdit - ComfyUI Built-in Node Documentation

The TextEncodeQwenImageEdit node processes text prompts and optional images to generate conditioning data for image generation or editing. It uses a CLIP model to tokenize the input and can optionally encode reference images using a VAE to create reference latents. When an image is provided, it automatically resizes the image to maintain consistent processing dimensions.

Inputs

Parameter	Description	Data Type	Required	Range
`clip`	The CLIP model used for text and image tokenization	CLIP	Yes	-
`prompt`	Text prompt for conditioning generation, supports multiline input and dynamic prompts	STRING	Yes	-
`vae`	Optional VAE model for encoding reference images into latents	VAE	No	-
`image`	Optional input image for reference or editing purposes	IMAGE	No	-

Note: When both image and vae are provided, the node encodes the image into reference latents and attaches them to the conditioning output. The image is automatically resized to maintain a consistent processing scale of approximately 1024x1024 pixels.

Outputs

Output Name	Description	Data Type
`CONDITIONING`	Conditioning data containing text tokens and optional reference latents for image generation	CONDITIONING

This documentation was AI-generated. If you find any errors or have suggestions for improvement, please feel free to contribute! Edit on GitHub

Source fingerprint (SHA-256): 7d1f82174d92ee86ca35e14a364e1a703fb9fc1ac9979611bd894a0a639c58a2

​Inputs

​Outputs

Inputs

Outputs