Skip to main content
The LTXVSeparateAVLatent node takes a combined audio-visual latent representation and splits it into two distinct parts: one for video and one for audio. It separates the samples and, if present, the noise masks from the input latent, creating two new latent objects.

Inputs

ParameterDescriptionData TypeRequiredRange
av_latentThe combined audio-visual latent representation to be separated.LATENTYesN/A
Note: The input latent’s samples tensor is expected to have at least two elements along the first dimension (batch dimension). The first element is used for the video latent, and the second element is used for the audio latent. If a noise_mask is present, it is split in the same way.

Outputs

Output NameDescriptionData Type
video_latentThe latent representation containing the separated video data.LATENT
audio_latentThe latent representation containing the separated audio data.LATENT
This documentation was AI-generated. If you find any errors or have suggestions for improvement, please feel free to contribute! Edit on GitHub

Source fingerprint (SHA-256): 8e871b6163af27826c197c678214bac7a02c2a5b24279385ba34632c7116356c