Skip to main content
The CLIPTextEncodeHunyuanDiT node converts text descriptions into a format that the HunyuanDiT model can understand. It is an advanced conditioning node designed for the dual text encoder architecture of HunyuanDiT, processing two separate text inputs through different tokenizers.

Inputs

ParameterDescriptionData TypeRequiredRange
clipA CLIP model instance used for text tokenization and encoding, which is core to generating conditions.CLIPYes-
bertText input for encoding via the BERT tokenizer. Prefers phrases and keywords. Supports multiline and dynamic prompts.STRINGYes-
mt5xlText input for encoding via the mT5-XL tokenizer. Supports multiline and dynamic prompts (multilingual). Can use complete sentences and complex descriptions.STRINGYes-

Outputs

Output NameDescriptionData Type
CONDITIONINGThe encoded conditioning output, combining both BERT and mT5-XL tokenized text, used for further processing in generation tasks.CONDITIONING
This documentation was AI-generated. If you find any errors or have suggestions for improvement, please feel free to contribute! Edit on GitHub

Source fingerprint (SHA-256): bde7c884f72829491090965bd9af34ad59ec326f96e88bb7cdb9ddc47592137e