Waiting for your creations!
Enjoy perfectly aligned sound and visuals, delivering lifelike voice quality, ASMR effects, ambient audio, music, and multilingual capabilities.

Produce stable 10-second 1080P videos at 24fps with enhanced motion, richer spatial-temporal detail, and full storytelling performance.

Achieve deeper natural language comprehension and accurate instruction execution, enabling creation of images or videos directly from text input.

Wan2.5 is Alibaba’s newest multimodal generative model. The Wan2.5-Preview version supports text-to-video, image-to-video, text-to-image, and image editing, introducing the industry’s first audio-visual synchronized video generation system capable of producing 1080P, 24fps HD videos with aligned voice, sound effects, and music.
Creators can convert text or images directly into HD videos with synchronized audio, enabling rapid production of short films, animations, and trailers while significantly reducing production time and cost.

Wan2.5 allows developers to generate animated scenes and characters from text or images, supporting cinematic cutscenes, CG sequences, and immersive world-building within games.

Educators can transform lessons, diagrams, and concepts into visually engaging teaching videos with narration and background sound, providing more intuitive and immersive learning experiences.

Marketing teams can turn product images and ad copy into promotional videos enhanced with voiceovers and music, enabling fast-turnaround, high-quality campaign content that boosts engagement and conversion.

With audio-visual synchronization, Wan2.5 enables the creation of virtual presenters or AI avatars for product showcases, live events, and interactive experiences, achieving more natural and expressive communication.

Wan2.5 integrates text, images, and video to generate high-quality multilingual content suited for international marketing, educational resources, and cross-platform media distribution, extending audience impact and reach.

Emily Carter
Wan2.5 completely changed my video workflow. I can turn simple text ideas into high-quality cinematic videos with perfectly synced audio in minutes. It saves me so much production time — absolutely impressive!
Daniel Novak
The ability to generate dynamic scenes and character animations from text is insane. It helped us prototype in-game cutscenes faster than ever. Wan2.5 is becoming an essential tool in our studio.
Sophia Lee
I used Wan2.5 to create educational videos with narration and background music. My students find the content more engaging and easier to understand. It’s a real game-changer for teaching.
Marco Hernández
Marketing videos that used to take days now take minutes. The voiceover syncing and cinematic visuals look professional and help boost campaign performance. Highly recommended!
Amina Yusuf
Wan2.5 shows remarkable reasoning and multimodal understanding. Generating videos from instructions and images demonstrates huge potential for future AI applications.
Thomas Miller
Creating virtual presenters with realistic voice and perfect audio-visual alignment is incredible. It feels like a real person is speaking. Wan2.5 opens a lot of opportunities for virtual media production.
