What Gemini Omni Flash offers
- Conversational video editing: Refine and edit videos using natural language — swap characters, relight scenes, alter angles, add or remove objects while maintaining original audio and video tracks
- Multimodal input: Combine text, images, and video inputs to guide generation. Natively generates synchronized audio with every video output
- World knowledge and simulation: Combines physics understanding with Gemini’s knowledge of history, science, and cultural context, enabling meaningful storytelling beyond photorealism
- Text and action synchronization: Render legible text and graphics directly into video, syncing kinetic typography with on-screen movements
- Pricing: $0.10 per second of video output, matching Veo 3.1 Fast pricing
Workflows
Text to Video
Run in Comfy Cloud
Open in Comfy Cloud
Download Workflow
Download JSON or search “Gemini Omni Flash” in Template Library
Image to Video
Run in Comfy Cloud
Open in Comfy Cloud
Download Workflow
Download JSON or search “Gemini Omni Flash” in Template Library
Download Sample Image 1
Get the example input image for this workflow
Download Sample Image 2
Get the second example input image
Video Edit
Run in Comfy Cloud
Open in Comfy Cloud
Download Workflow
Download JSON or search “Gemini Omni Flash” in Template Library
Download Sample Video
Get the example input video for this workflow
Get started
- Update ComfyUI to the latest version
- Double-click the canvas and search for “Gemini Omni Flash” nodes
- Or go to the Template Library to use the ready-to-go workflows
- Choose the workflow that matches your input type (text, image, or video)
- Enter your prompt and generate
For the best results, combine Gemini Omni Flash with Nano Banana 2 Lite: generate images at high speed, then use Gemini Omni Flash to animate them into video.