Grok Imagine Video 1.5 Image to Video ComfyUI Official Example

The Grok Imagine Video 1.5 Partner Node enables high-quality video generation with native audio from a single image input. Powered by xAI’s latest Grok model, it produces realistic motion with synchronized audio and supports up to 1080p resolution. The node supports two model variants selected via its model parameter:

grok-imagine-video — the previous generation model, supports optional image input
grok-imagine-video-1.5 — the latest model, always requires an input image and supports 1080p output

Both variants generate native audio — sound effects, ambience, and dialogue are synthesized in the same pass, with no separate audio pipeline needed. Video duration ranges from 1 to 15 seconds.

To use the Partner Nodes, you need to ensure that you are logged in properly and using a permitted network environment. Please refer to the Partner Nodes Overview section of the documentation to understand the specific requirements for using the Partner Nodes.

Portable or self deployed users
Desktop or Cloud users

Make sure your ComfyUI is updated.

Workflows in this guide can be found in the Workflow Templates. If you can’t find them in the template, your ComfyUI may be outdated. (Desktop version’s update will delay sometime)If nodes are missing when loading a workflow, possible reasons:

You are not using the latest ComfyUI version (Nightly version)
Some nodes failed to import at startup

Grok Imagine Video 1.5: Image to Video

Run in Comfy Cloud

Open in Comfy Cloud

Download Workflow

Download JSON or search “Grok Imagine Video 1.5” in Template Library

Workflow Overview

This workflow uses three nodes:

LoadImage — provides the starting image frame
GrokVideoNode — the core node configured with the grok-imagine-video-1.5 model
SaveVideo — saves the generated video with native audio

Steps to Run

Upload a starting image — use the LoadImage node to load your reference image
Enter your prompt — describe the motion, atmosphere, and scene dynamics in the GrokVideoNode node
Select model — ensure grok-imagine-video-1.5 is selected
Set resolution — choose output resolution (720p recommended)
Set duration — choose the video length in seconds
Set seed — control reproducibility of results
Click Queue — press Ctrl+Enter to generate

Output

The generated video includes native audio synchronized with the motion, saved automatically via the SaveVideo node.

Tips

Use high-quality input images for best results
The prompt works best when it describes both the visual scene and the motion dynamics
For different results, try varying the seed value

Astra 2 - Creative diffusion video upscaling

Changelog

⌘I

​Grok Imagine Video 1.5: Image to Video

Run in Comfy Cloud

Download Workflow

​Workflow Overview

​Steps to Run

​Output

​Tips

Grok Imagine Video 1.5: Image to Video

Workflow Overview

Steps to Run

Output

Tips