computer vision2, Conditional Control, diffusion model, Fast Fourier Convolution, Inpaint, text-to-image, Video Generation.