Skip to main content
The ElevenLabs Speech to Speech node transforms an input audio file from one voice to another. It uses the ElevenLabs API to convert speech while preserving the original content and emotional tone of the audio.

Inputs

ParameterDescriptionData TypeRequiredRange
voiceTarget voice for the transformation. Connect from Voice Selector or Instant Voice Clone.CUSTOMYes-
audioSource audio to transform.AUDIOYes-
stabilityVoice stability. Lower values give broader emotional range, higher values produce more consistent but potentially monotonous speech (default: 0.5).FLOATNo0.0 - 1.0
modelModel to use for speech-to-speech transformation. Each option provides a specific set of voice settings (similarity_boost, style, use_speaker_boost, speed).DYNAMICCOMBONoeleven_multilingual_sts_v2
eleven_english_sts_v2
output_formatAudio output format (default: “mp3_44100_192”).COMBONo"mp3_44100_192"
"opus_48000_192"
seedSeed for reproducibility (default: 0).INTNo0 - 4294967295
remove_background_noiseRemove background noise from input audio using audio isolation (default: False).BOOLEANNo-

Outputs

Output NameDescriptionData Type
audioThe transformed audio file in the specified output format.AUDIO
This documentation was AI-generated. If you find any errors or have suggestions for improvement, please feel free to contribute! Edit on GitHub

Source fingerprint (SHA-256): ef065ffa78a63398e746b52c6c8f2c336e6a4137722537c8026d292ed397a246