Model highlights
- Compact powerhouse: Delivers SOTA performance comparable to larger models while running on consumer hardware.
- Versatile generation: Supports high-quality Text-to-Video and Image-to-Video (5-10s) with exceptional consistency.
- Precise control: Strong instruction following for camera movements, physics, and emotional expressions.
- Cinematic quality: Native 720p output (upscalable to 1080p) with professional aesthetics.
- Rich features: Supports diverse styles (realistic, anime, 3D) and in-video text rendering (Chinese/English).