GeminiImageNode - ComfyUI Built-in Node Documentation

The GeminiImage node generates text and image responses from Google’s Gemini AI models. It allows you to provide multimodal inputs including text prompts, images, and files to create coherent text and image outputs. The node handles all API communication and response parsing with the latest Gemini models.

Inputs

Parameter	Description	Data Type	Input Type	Default	Range
`prompt`	Text prompt for generation	STRING	required	""	-
`model`	The Gemini model to use for generating responses	COMBO	required	gemini_2_5_flash_image_preview	Available Gemini models Options extracted from GeminiImageModel enum
`seed`	When seed is fixed to a specific value, the model makes a best effort to provide the same response for repeated requests. Deterministic output isn’t guaranteed. Also, changing the model or parameter settings, such as the temperature, can cause variations in the response even when you use the same seed value. By default, a random seed value is used	INT	required	42	0 to 18446744073709551615
`images`	Optional image(s) to use as context for the model. To include multiple images, you can use the Batch Images node	IMAGE	optional	None	-
`files`	Optional file(s) to use as context for the model. Accepts inputs from the Gemini Generate Content Input Files node	GEMINI_INPUT_FILES	optional	None	-

Note: The node includes hidden parameters (auth_token, comfy_api_key, unique_id) that are automatically handled by the system and do not require user input.

Outputs

Output Name	Description	Data Type
`IMAGE`	The generated image response from the Gemini model	IMAGE
`STRING`	The generated text response from the Gemini model	STRING

This documentation was AI-generated. If you find any errors or have suggestions for improvement, please feel free to contribute! Edit on GitHub

​Inputs

​Outputs

Inputs

Outputs