> ## Documentation Index
> Fetch the complete documentation index at: https://docs.comfy.org/llms.txt
> Use this file to discover all available pages before exploring further.

# CLIPLoader - ComfyUI Built-in Node Documentation

> Complete documentation for the CLIPLoader node in ComfyUI. Learn its inputs, outputs, parameters and usage.

> This documentation was AI-generated. If you find any errors or have suggestions for improvement, please feel free to contribute! [Edit on GitHub](https://github.com/Comfy-Org/embedded-docs/blob/main/comfyui_embedded_docs/docs/CLIPLoader/en.md)

The CLIPLoader node loads a text encoder model (CLIP, T5, or similar) from a file, making it available for use in other nodes that need to convert text prompts into numerical representations. It supports a wide variety of model architectures, each requiring a specific encoder type.

## Inputs

| Parameter   | Data Type | Required | Range                                                                                                                                                                                                                                                                                                                                                        | Description                                                                                                                                                    |
| ----------- | --------- | -------- | ------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------ | -------------------------------------------------------------------------------------------------------------------------------------------------------------- |
| `clip_name` | STRING    | Yes      | List of files found in the `text_encoders` folder                                                                                                                                                                                                                                                                                                            | The filename of the text encoder model to load. This must be a file located in the `ComfyUI/models/text_encoders/` or `ComfyUI/models/clip/` directory.        |
| `type`      | STRING    | Yes      | `"stable_diffusion"`<br />`"stable_cascade"`<br />`"sd3"`<br />`"stable_audio"`<br />`"mochi"`<br />`"ltxv"`<br />`"pixart"`<br />`"cosmos"`<br />`"lumina2"`<br />`"wan"`<br />`"hidream"`<br />`"chroma"`<br />`"ace"`<br />`"omnigen2"`<br />`"qwen_image"`<br />`"hunyuan_image"`<br />`"flux2"`<br />`"ovis"`<br />`"longcat_image"`<br />`"cogvideox"` | The architecture type of the model being loaded. This determines which specific encoder variant to use. The default is `"stable_diffusion"`.                   |
| `device`    | STRING    | No       | `"default"`<br />`"cpu"`                                                                                                                                                                                                                                                                                                                                     | The device to load the model onto. `"default"` uses the GPU if available, while `"cpu"` forces CPU loading. This is an advanced option (default: `"default"`). |

### Supported Type-to-Encoder Mappings

The `type` parameter selects the correct encoder for a given model architecture. The following are common mappings:

| Type              | Encoder                       |
| ----------------- | ----------------------------- |
| stable\_diffusion | clip-l                        |
| stable\_cascade   | clip-g                        |
| sd3               | t5 xxl / clip-g / clip-l      |
| stable\_audio     | t5 base                       |
| mochi             | t5 xxl                        |
| cogvideox         | t5 xxl (226-token padding)    |
| cosmos            | old t5 xxl                    |
| lumina2           | gemma 2 2B                    |
| wan               | umt5 xxl                      |
| hidream           | llama-3.1 (recommended) or t5 |
| omnigen2          | qwen vl 2.5 3B                |

## Outputs

| Output Name | Data Type | Description                                                                                             |
| ----------- | --------- | ------------------------------------------------------------------------------------------------------- |
| `clip`      | CLIP      | The loaded text encoder model, ready to be connected to other nodes for text encoding and conditioning. |

***

**Source fingerprint (SHA-256):** `1051bfe5570dff81719682cb09938bae4c03e94e0e72f7a2be84867cccb48017`