Long_Video_Generation

A pipeline to generate long videos according to text prompt
Xinchen Zhang
Tsinghua University

Pipeline

A spectacular waterfall	A car driving down the road.

Astronauts traveling in space	A cat looking out the window

Before inference, you need to use LLMs to obtain segmented fragments based on the prompt, along with complex descriptions of each fragment.

We provide a template in template.txt. Then copy and paste the template to ChatGPT, you can get the generated prompts.

We offer two ways to generate a long video. If you choose I2VGen-XL as the backbone, run:

python pipeline_i2vgenxl.py --seed 1234 --fps 16

If you choose SVD as the backbone, run:

python pipeline_svd.py --seed 1234 --fps 16

After that, we use EMA-VFI to interpolate the video.

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
assets		assets
README.md		README.md
pipeline_i2vgenxl.py		pipeline_i2vgenxl.py
pipeline_svd.py		pipeline_svd.py
template.txt		template.txt