Wan Vace
Wan VACE (Wan 2.1-VACE) is a state-of-the-art open source AI video generator developed by Alibaba’s Tongyi Lab, released in early 2025. It is part of the Wan 2.1 model suite and supports text-to-video, image-to-video, video editing, and reference-based generation using a modular Video Condition Unit (VCU). Wan VACE is available in both 1.3B and 14B parameter sizes, under an Apache 2.0 license, and is hosted on GitHub, Hugging Face, and ModelScope. It enables multi-modal control including masks, flow, pose, and outpainting, and is optimized to run on consumer GPUs (8GB+), making it highly accessible for researchers and creators in the open source video AI community.

—>
This scenario is to test the AI models animation ability with natural physics like walking and the ability to keep the essence of image in the video
PROMPT USED IN AI
"A red Ford Mustang cruising smoothly down a long, empty highway during sunset. The camera follows the car from behind in a steady, cinematic motion, keeping the car centered while the distant mountains and road stretch ahead. The camera movement is slow, smooth, and dramatic, capturing the beauty of the scene and the motion of the car."
Show Observations
Below are the discovered pros and cons for
Wan Vace
image generator ai after using and testing it for the given prompts.
Pros
Available in two scales (1.3B for low-end GPUs and 14B for high-res output)
Optimized for both Chinese and English text prompts
Supports a wide range of tasks: text-to-video, image-to-video, video editing, pose and flow control
Cons
Quality can vary significantly between generations without tight prompt tuning
Motion in outputs is often minimal, especially in 1.3B version