Skip to main content
CLIPTextEncodeFlux is an advanced text encoding node designed for the Flux architecture. It processes two separate text inputs through different encoders—CLIP-L and T5XXL—and combines them with a guidance scale to produce a unified conditioning output for image generation.

Inputs

ParameterDescriptionData TypeRequiredRange
clipA CLIP model that supports the Flux architecture, including both CLIP-L and T5XXL encoders.CLIPYes-
clip_lText input processed by the CLIP-L encoder. Suitable for concise keyword descriptions, such as style or theme. Supports multiline input and dynamic prompts.STRINGYes-
t5xxlText input processed by the T5XXL encoder. Suitable for detailed natural language descriptions, expressing complex scenes and details. Supports multiline input and dynamic prompts.STRINGYes-
guidanceControls the influence of text conditions on the generation process. Higher values mean stricter adherence to the text. Default: 3.5.FLOATYes0.0 - 100.0

Outputs

Output NameDescriptionData Type
CONDITIONINGContains the fused embeddings from both encoders and the guidance parameter, used for conditional image generation.CONDITIONING
This documentation was AI-generated. If you find any errors or have suggestions for improvement, please feel free to contribute! Edit on GitHub

Source fingerprint (SHA-256): 63027b4a7c1868da27fb2644b0d6599d241fa0206a78d169110ce57f0cebf148