This guide will help you create dynamic music using the ACE-Step model in ComfyUI
Download Json Format Workflow File
You can also manually download ace_step_v1_3.5b.safetensors and save it to theComfyUI/models/checkpoints
folder
Load Checkpoints
node has loaded the ace_step_v1_3.5b.safetensors
modelEmptyAceStepLatentAudio
node, you can set the duration of the music to be generatedLatentOperationTonemapReinhard
node, you can adjust the multiplier
to control the volume of the vocals (higher numbers result in more prominent vocals)tags
field of TextEncodeAceStepAudio
lyrics
field of TextEncodeAceStepAudio
Run
button, or use the shortcut Ctrl(cmd) + Enter
to execute the audio generationSave Audio
node. You can click to play and listen to it, and the audio will also be saved to ComfyUI/output/audio
(subdirectory determined by the Save Audio
node).denoise
parameter in the Ksampler
.
Download Json Format Workflow File
Download the following audio file as the input audio:Download Example Audio File for Input
Load Checkpoints
node has loaded the ace_step_v1_3.5b.safetensors
modelLoadAudio
nodetags
and lyrics
fields of TextEncodeAceStepAudio
. Providing lyrics is very important for audio editingdenoise
parameter in the Ksampler
node to adjust the noise added during sampling to control similarity with the original audio (smaller values result in more similarity to the original audio; setting it to 1.00
is approximately equivalent to having no audio input)Run
button, or use the shortcut Ctrl(cmd) + Enter
to execute the audio generationSave Audio
node. You can click to play and listen to it, and the audio will also be saved to ComfyUI/output/audio
(subdirectory determined by the Save Audio
node).tags
in TextEncodeAceStepAudio
from male voice
to female voice
to generate female vocals.lyrics
in TextEncodeAceStepAudio
to change the lyrics and thus the generated audio. Refer to the examples on the ACE-Step project page for more details.tags
and lyrics
.
tags
: Mainly used to describe music styles, scenes, etc. Similar to prompts we use for other generations, they primarily describe the overall style and requirements of the audio, separated by English commaslyrics
: Mainly used to describe lyrics, supporting lyric structure tags such as [verse], [chorus], and [bridge] to distinguish different parts of the lyrics. You can also input instrument names for purely instrumental musictags
and lyrics
on the ACE-Step model homepage. You can refer to these examples to try corresponding prompts. This document’s prompt guide is organized based on the project to help you quickly try combinations to achieve your desired effect.
lyrics
, such as Chinese [zh]
, Korean [ko]
, etc.