Skip to main content
The LTXV Empty Latent Audio node creates a batch of empty (zero-filled) latent audio tensors. It uses the configuration from a provided Audio VAE model to determine the correct dimensions for the latent space, such as the number of channels and frequency bins. This empty latent serves as a starting point for audio generation or manipulation workflows within ComfyUI.

Inputs

ParameterDescriptionData TypeRequiredRange
frames_numberNumber of frames. The default value is 97.INTYes1 to 1000
frame_rateNumber of frames per second. The default value is 25.INTYes1 to 1000
batch_sizeThe number of latent audio samples in the batch. The default value is 1.INTYes1 to 4096
audio_vaeThe Audio VAE model to get configuration from. This parameter is required.VAEYesN/A
Note: The audio_vae input is mandatory. The node will raise an error if it is not provided.

Outputs

Output NameDescriptionData Type
LatentAn empty latent audio tensor with the structure (batch_size, z_channels, num_audio_latents, audio_freq) configured to match the input Audio VAE. The output also includes a type field set to “audio”.LATENT
This documentation was AI-generated. If you find any errors or have suggestions for improvement, please feel free to contribute! Edit on GitHub

Source fingerprint (SHA-256): 1b0ecbeb1388d952f5c5178cdc70bb06aad1640112a0851802c387594629f4bf