Skip to main content
The Grok Imagine Video 1.5 Partner Node enables high-quality video generation with native audio from a single image input. Powered by xAI’s latest Grok model, it produces realistic motion with synchronized audio and supports up to 1080p resolution. The node supports two model variants selected via its model parameter:
  • grok-imagine-video — the previous generation model, supports optional image input
  • grok-imagine-video-1.5 — the latest model, always requires an input image and supports 1080p output
Both variants generate native audio — sound effects, ambience, and dialogue are synthesized in the same pass, with no separate audio pipeline needed. Video duration ranges from 1 to 15 seconds.
To use the Partner Nodes, you need to ensure that you are logged in properly and using a permitted network environment. Please refer to the Partner Nodes Overview section of the documentation to understand the specific requirements for using the Partner Nodes.
Make sure your ComfyUI is updated.Workflows in this guide can be found in the Workflow Templates. If you can’t find them in the template, your ComfyUI may be outdated. (Desktop version’s update will delay sometime)If nodes are missing when loading a workflow, possible reasons:
  1. You are not using the latest ComfyUI version (Nightly version)
  2. Some nodes failed to import at startup

Grok Imagine Video 1.5: Image to Video

Grok Imagine Video 1.5 Workflow

Run in Comfy Cloud

Open in Comfy Cloud

Download Workflow

Download JSON or search “Grok Imagine Video 1.5” in Template Library

Workflow Overview

This workflow uses three nodes:
  • LoadImage — provides the starting image frame
  • GrokVideoNode — the core node configured with the grok-imagine-video-1.5 model
  • SaveVideo — saves the generated video with native audio

Steps to Run

  1. Upload a starting image — use the LoadImage node to load your reference image
  2. Enter your prompt — describe the motion, atmosphere, and scene dynamics in the GrokVideoNode node
  3. Select model — ensure grok-imagine-video-1.5 is selected
  4. Set resolution — choose output resolution (720p recommended)
  5. Set duration — choose the video length in seconds
  6. Set seed — control reproducibility of results
  7. Click Queue — press Ctrl+Enter to generate

Output

The generated video includes native audio synchronized with the motion, saved automatically via the SaveVideo node.

Tips

  • Use high-quality input images for best results
  • The prompt works best when it describes both the visual scene and the motion dynamics
  • For different results, try varying the seed value