daVinci-MagiHuman — Audio-Video Generation

Upload a reference image, describe what you want in the video, choose the duration (4–10 s), and click Generate. Your prompt will be automatically enhanced into the optimal format before generation.

Model: 15B single-stream Transformer (distilled, 8-step inference) | Resolution: 448×256 → 540p | FPS: 25

4 10