Documentation Index
Fetch the complete documentation index at: https://docs.comfy.org/llms.txt
Use this file to discover all available pages before exploring further.
This documentation was AI-generated. If you find any errors or have suggestions for improvement, please feel free to contribute! Edit on GitHub
Overview
Run MoGe on a single image to estimate depth and geometry. This node processes an input image through the MoGe model to generate a 3D point cloud, depth map, camera intrinsics, a mask, and surface normals.Inputs
| Parameter | Data Type | Required | Range | Description |
|---|---|---|---|---|
moge_model | MOGE_MODEL | Yes | N/A | The MoGe model to use for inference. |
image | IMAGE | Yes | N/A | The input image for depth and geometry estimation. |
resolution_level | INT | Yes | 0 to 9 | Controls the processing resolution. 0 is fastest, 9 provides the most detail. (default: 9) |
fov_x_degrees | FLOAT | Yes | 0.0 to 170.0 | Horizontal field of view of the source camera in degrees. Sets the focal length used to unproject the depth map into 3D. Set to 0.0 to automatically recover the field of view from the predicted points. (default: 0.0) |
batch_size | INT | Yes | 1 to 64 | Number of images processed per inference call. Lower this value if you run out of memory when processing long videos or large image sets. (default: 4) |
force_projection | BOOLEAN | Yes | True/False | (Advanced) Forces projection of the predicted points. (default: True) |
apply_mask | BOOLEAN | Yes | True/False | When enabled, sets masked-out (sky or invalid) pixels to infinity in the points and depth outputs. This helps meshing tools ignore these areas. Disable to keep the raw predicted geometry everywhere; the mask is still returned separately. (default: True) |
Outputs
| Output Name | Data Type | Description |
|---|---|---|
moge_geometry | MOGE_GEOMETRY | A dictionary containing the estimated geometry. It includes the original image, and may contain points (3D point cloud), depth (depth map), intrinsics (camera intrinsics matrix), mask (mask identifying valid pixels), and normal (surface normals). |
Source fingerprint (SHA-256):
5213b280513850eeef2e22ae723ebb015789109435e28ddd79f91f9a4b4a1e79