This article will introduce how to use Google Gemini API nodes in ComfyUI to complete conversational functions
Download Json Format Workflow File
Load Image
node, load the image you need AI to interpretGoogle Gemini
to have AI execute specific tasksRun
button, or use the shortcut Ctrl(cmd) + Enter
to execute the conversation.Preview Any
node.Gemini Input Files
requires files to be uploaded to the ComfyUI/input/
directory first. This node is being improved, and we will modify the template after updatesBatch Images
for input. If you have multiple images that need AI interpretation, you can refer to the step diagram and use right-click to set the corresponding node mode to Always
to enable it