Upload a reference image, describe what you want in the video, choose the duration (4–10 s), and click Generate. Your prompt will be automatically enhanced into the optimal format before generation.
Model: 15B single-stream Transformer (distilled, 8-step inference) | Resolution: 448×256 → 540p | FPS: 25