GeminiNanoBanana2V2 - ComfyUI Built-in Node Documentation

Overview

This node generates or edits images by sending a text prompt to Google’s Vertex AI API. It uses the Gemini 3.1 Flash Image model to create new images or modify existing ones based on your instructions.

Inputs

Parameter	Description	Data Type	Required	Range
`prompt`	Text prompt describing the image to generate or the edits to apply. Include any constraints, styles, or details the model should follow.	STRING	Yes	N/A
`model`	Selects the Gemini model to use for image generation. This parameter includes additional sub-parameters for resolution, aspect ratio, thinking level, and image input.	COMBO	Yes	`"Nano Banana 2 (Gemini 3.1 Flash Image)"`
`seed`	When the seed is fixed to a specific value, the model makes a best effort to provide the same response for repeated requests. Deterministic output isn’t guaranteed. Also, changing the model or parameter settings, such as the temperature, can cause variations in the response even when you use the same seed value. By default, a random seed value is used. (default: 42)	INT	Yes	0 to 18446744073709551615
`response_modalities`	Determines the format of the response. Choose “IMAGE” to receive only an image, or “IMAGE+TEXT” to receive both an image and a text description. (default: “IMAGE”)	COMBO	Yes	`"IMAGE"` `"IMAGE+TEXT"`
`system_prompt`	Foundational instructions that dictate an AI’s behavior. This is an advanced parameter. (default: A pre-defined system prompt instructing the model to always produce an image)	STRING	No	N/A
`temperature`	Controls randomness in generation. Lower values produce more focused and deterministic results. This is an advanced parameter. (default: 1.0)	FLOAT	No	0.0 to 2.0
`top_p`	Nucleus sampling threshold. Lower values produce more focused results, higher values produce more diverse results. This is an advanced parameter. (default: 0.95)	FLOAT	No	0.0 to 1.0

Note on model parameter: The model parameter is a dynamic combo that includes additional sub-parameters for resolution, aspect ratio, thinking level, and image input. These sub-parameters are defined within the model selection and are not listed as separate inputs in this table. Note on image input: You can provide up to 14 images as input to the model. These images are passed through the model parameter’s image sub-field and are used for editing or as visual context for generation.

Outputs

Output Name	Description	Data Type
`IMAGE`	The generated or edited image.	IMAGE
`STRING`	A text description or caption generated by the model.	STRING
`thought_image`	First image from the model’s thinking process. Only available with thinking_level HIGH and IMAGE+TEXT modality.	IMAGE

This documentation was AI-generated. If you find any errors or have suggestions for improvement, please feel free to contribute! Edit on GitHub

Source fingerprint (SHA-256): e3806713e8c58ac9ba0fe41875ef7162b7e18d276c63be53e46865efcc359079

​Overview

​Inputs

​Outputs

Overview

Inputs

Outputs