The goal of this lora is to reproduce the video style similar to live wallpaper, for those who play league of legends remember the launcher opening videos, that's the goal, but you can also use it to create your lofi videos :D enjoy.
[Wan I2V 720P Fast Fusion - 4 (or more) steps]
Wan I2V 720P Fast Fusion combines 2 Live Wallpaper LoRA (1 Exclusive) with Lightx2v, AccVid, MoviiGen and Pusa LoRAs for ultra-fast 4+ steps generation while maintaining cinematic quality.
🚀 Lightx2v LoRA – accelerates generation by 20x through 4-step distillation, enabling sub 2-minute videos on RTX 4090 with only 8GB VRAM requirements. 🎬 AccVid LoRA – improves motion accuracy and dynamics for expressive sequences. 🌌 MoviiGen LoRA – adds cinematic depth and flow to animation, enhancing visual storytelling. 🧠 Pusa LoRA – provides fine-grained temporal control with zero-shot multi-task capabilities (start-end frames, video extension) while achieving 87.32% VBench score. 🧠 Wan I2V 720p (14B) base model – providing strong temporal consistency and high-resolution outputs for expressive video scenes.
[Wan I2V 720P]
The dataset used consists of 149 videos (each one hand-selected) in 1280x720x96 resolution but was trained in 244p and 480p and 64 frames with 64 dim (L40s).
Trigger word was used so it needs to be included in the prompt: l1v3w4llp4p3r
[Hunyuan T2V]
The dataset used consists of 529 videos (each one hand-selected) in 1280x720x96 resolution but was trained in 244p and 72 frames with 64 dim (multiple RTX 4090).
No captions or activation words were used, the only control you will need to adjust is the lora strength.
Another important note is that it was trained in full blocks, I don't know how it will behave when mixing 2 or more loras, if you want to mix and are not getting a good result, try disabling single blocks.
I recommend using lora strength between 0.2 and 1.2 maximum, resolution 1280x720 or generate at 512 and upscale later, minimum 3 seconds (72 frames + 1).
[Wan I2V 720P]
The dataset used consists of 149 videos (each one hand-selected) in 1280x720x96 resolution but was trained in 244p and 480p and 64 frames with 64 dim (L40s).
Trigger word was used so it needs to be included in the prompt: l1v3w4llp4p3r
For more details see the version description Share your results.