- 论文:Photorealistic Text-to-Image Diffusion Models with Deep Language Understanding
- 项目地址:https://imagen.research.google/
- 代码(非官方):https://github.com/deep-floyd/IF
- 模型权重:https://huggingface.co/DeepFloyd/IF-I-XL-v1.0
- 🤗关注公众号 funNLPer 白嫖有用的知识🤗
文章目录
- 1. Imagen 模型结构
-
- 1.1 Pretrain Text Encoder
- 1.2 基础扩散模型
-
- 1.2.1 classifier-free guidance
- 1.2.2 Large guidance weight samplers(Static thresholdi