This is a new family of open source video models designed to take on Gen-2. There is a 576x320 model that uses under 8gb of vram, and the 1024x576 model that uses under 16gb of vram. The recommended workflow is to render with the 576 model, then use vid2vid via the 1111 text2video extension to upscale to 1024x576. This allows for better compositions overall and faster exploration of ideas before committing to a high res render.
Just tried it and the result was goofy but the frame consistency and smoothness of movement is insane! I can’t wait to see how things develop.