VideoFusion: 分解扩散模型用于生成高质量视频 (VideoFusion: Decomposed Diffusion Models for High-Quality Video Generation)

link之家

链接快照平台

输入网页链接，自动生成快照
标签化管理网页链接

相关文章推荐

很拉风的荒野 · 中国科大揭示HBV慢性感染导致肝癌发生的机制· 3 周前 ·

坚强的碗 · 王磊· 2 周前 ·

文质彬彬的作业本 · 小麦功能基因定位及基因组研究平台建成并成功应 ...· 1 周前 ·

茫然的风衣 · 药学院储凌课题组研发出蛋白质光稳定荧光成像的 ...· 1 周前 ·

聪明的小狗 · 钱兆生课题组联合金志刚团队成功研发可用于长时 ...· 5 天前 ·

威武的灯泡 · $("#grid").jqGrid('set ...· 2 月前 ·

憨厚的黑框眼镜 · 用CSS制作中秋之夜星星闪烁动效 - 掘金· 8 月前 ·

鼻子大的烈酒 · r - ggplot2 - One ...· 12 月前 ·

追风的斑马 · php格式怎么转换为pdf,PHP如何将将w ...· 1 年前 ·

力能扛鼎的夕阳 · d3.js中的zoom缩放_d3 ...· 1 年前 ·

· 3 月 22 日

VideoFusion: Decomposed Diffusion Models for High-Quality Video Generation

翻译：VideoFusion: 分解扩散模型用于生成高质量视频

Zhengxiong Luo,Dayou Chen,Yingya Zhang,Yan Huang,Liang Wang,Yujun Shen,Deli Zhao,Jingren Zhou,Tieniu Tan

from arxiv, Accepted to CVPR2023

A diffusion probabilistic model (DPM), which constructs a forward diffusion process by gradually adding noise to data points and learns the reverse denoising process to generate new samples, has been shown to handle complex data distribution. Despite its recent success in image synthesis, applying DPMs to video generation is still challenging due to high-dimensional data spaces. Previous methods usually adopt a standard diffusion process, where frames in the same video clip are destroyed with independent noises, ignoring the content redundancy and temporal correlation. This work presents a decomposed diffusion process via resolving the per-frame noise into a base noise that is shared among all frames and a residual noise that varies along the time axis. The denoising pipeline employs two jointly-learned networks to match the noise decomposition accordingly. Experiments on various datasets confirm that our approach, termed as VideoFusion, surpasses both GAN-based and diffusion-based alternatives in high-quality video generation. We further show that our decomposed formulation can benefit from pre-trained image diffusion models and well-support text-conditioned video creation.

翻译：扩散概率模型（DPM）通过逐渐向数据点添加噪音构建正向扩散过程，并学习反向降噪过程以生成新样本。尽管在图像合成方面取得了最近的成功，但将DPM应用于视频生成仍具有挑战性，因为数据空间的维数很高。以往的方法通常采用标准扩散过程，在同一个视频片段中的帧被独立噪声破坏，在忽略内容冗余和时间相关性方面。本研究通过将逐帧噪声分解为共享在所有帧之间的基础噪声和沿时间轴变化的残余噪声，提出了一种分解扩散过程。降噪管道采用两个联合学习的网络，相应地匹配噪声分解。在各种数据集上的实验验证了我们的方法（称为VideoFusion）在高质量视频生成方面超越了基于GAN和扩散的替代方法。我们进一步表明，我们的分解公式可以受益于预训练的图像扩散模型，并良好地支持基于文本的视频创作。

推荐文章

很拉风的荒野 · 中国科大揭示HBV慢性感染导致肝癌发生的机制

3 周前

坚强的碗 · 王磊

2 周前

文质彬彬的作业本 · 小麦功能基因定位及基因组研究平台建成并成功应用-四川农业大学西南作物基因资源发掘与利用国重实验室

1 周前

茫然的风衣 · 药学院储凌课题组研发出蛋白质光稳定荧光成像的新方法-清华大学

1 周前

聪明的小狗 · 钱兆生课题组联合金志刚团队成功研发可用于长时间跟踪生物学事件的细胞质膜荧光染料

5 天前

威武的灯泡 · $("#grid").jqGrid('setGridParam', {rowNum: newRowNum}).trigger('reloadGrid');改变rowNum不生效 - CSDN文库

2 月前

憨厚的黑框眼镜 · 用CSS制作中秋之夜星星闪烁动效 - 掘金

8 月前

鼻子大的烈酒 · r - ggplot2 - One facceted plot does not show stat_compare_means Kruskal - Stack Overflow

12 月前

追风的斑马 · php格式怎么转换为pdf,PHP如何将将word文件转为pdf-腾讯云开发者社区-腾讯云

1 年前

力能扛鼎的夕阳 · d3.js中的zoom缩放_d3 zoom-CSDN博客

1 年前