LTXVSeparateAVLatent - ComfyUI Built-in Node Documentation

The LTXVSeparateAVLatent node takes a combined audio-visual latent representation and splits it into two distinct parts: one for video and one for audio. It separates the samples and, if present, the noise masks from the input latent, creating two new latent objects.

Inputs

Parameter	Description	Data Type	Required	Range
`av_latent`	The combined audio-visual latent representation to be separated.	LATENT	Yes	N/A

Note: The input latent’s samples tensor is expected to have at least two elements along the first dimension (batch dimension). The first element is used for the video latent, and the second element is used for the audio latent. If a noise_mask is present, it is split in the same way.

Outputs

Output Name	Description	Data Type
`video_latent`	The latent representation containing the separated video data.	LATENT
`audio_latent`	The latent representation containing the separated audio data.	LATENT

This documentation was AI-generated. If you find any errors or have suggestions for improvement, please feel free to contribute! Edit on GitHub

Source fingerprint (SHA-256): 8e871b6163af27826c197c678214bac7a02c2a5b24279385ba34632c7116356c

LTXVLatentUpsampler - ComfyUI Built-in Node Documentation

EmptyMochiLatentVideo - ComfyUI Built-in Node Documentation

​Inputs

​Outputs

Inputs

Outputs