ComfyUI Bernini-R Introduction
Bernini-R is ByteDance’s renderer-only Wan 2.2 model for in-context image and video conditioning. It uses a set of conditioning streams (source video, reference images, reference video) to guide generation — no LoRA training or fine-tuning required. Key capabilities:- Multiple task types in one model: image/video generation, editing, relighting, restyling, subject insertion
- In-context conditioning: reference images/videos act as visual prompts, injected as tokens
- Lightweight: renderer-only model — no diffusion-based text-to-video backbone
- Flexible input support: single or multi-image references, video-to-video, reference-guided editing
| Task | Inputs | Description |
|---|---|---|
| t2v | Text prompt | Text-to-video generation |
| v2v | Source video | Video-to-video restyling |
| rv2v | Source video + ref images(s) | Reference-guided video editing (relighting, subject insertion) |
| r2v | Reference image(s) only | Reference-to-video generation |
| img | Source image + text prompt | Image editing (relighting, restyling, subject insertion) |
| ads2v | Source video + ref video | Insert image/video content into source video |
Model Installation
Download the required model weights and save them to the corresponding ComfyUI folders: text_encoders: vae: loras: diffusion_models:Example Workflows
1. Image Editing
What it does: Generate an edited image with matched lighting and view a side-by-side before/after comparison. Ideal for portrait and product relighting, consistent lighting across photo sets, and e-commerce catalog photography.Download Workflow
Download JSON or search “Bernini-R” in Template Library
Run on Comfy Cloud
Open in Comfy Cloud


Steps to Run
- Select Task Type — choose your task (Image Editing, Subject to Image, etc.)
- Connect Inputs — load source image and optional reference images
- Write Prompt — describe the desired edit
- Run — click Queue or use
Cmd+Enter
image0, image1, … to reference each image. Not needed for Image Editing — that task uses source_image instead.
Learn about Subgraph
This workflow uses Subgraph nodes for modular processing. Check out the Subgraph documentation to learn how to customize and extend the workflow.
2. Video Editing
What it does: Generate an edited video with consistent relighting using Bernini-R. Connect a source video, optional reference image(s) or reference video, pick the task type, write a prompt, and run.Download Workflow
Download JSON or search “Bernini-R” in Template Library
Run on Comfy Cloud
Open in Comfy Cloud
Steps to Run
- Load Source Video — connect your input video
- (Optional) Load References — reference image(s) or reference video
- Select Task Type — v2v, rv2v, r2v, or ads2v
- Write Prompt — describe the desired edit
- Run — click Queue or use
Cmd+Enter
image0, image1, … in the prompt if references play different roles.
Learn about Subgraph
This workflow uses Subgraph nodes for modular processing. Check out the Subgraph documentation to learn how to customize and extend the workflow.
Community Resources
- Bernini GitHub (bytedance/Bernini) — Research paper and task documentation
- Comfy-Org/Bernini-R — Official ComfyUI model weights
- Bernini: Latent Semantic Planning for Video Diffusion — Research paper