목록ICLR (2)
hwchung 님의 블로그
AN IMAGE IS WORTH 16X16 WORDS:TRANSFORMERS FOR IMAGE RECOGNITION AT SCALE ICLR 2021논문 링크: https://arxiv.org/abs/2010.11929github 링크: https://github.com/google-research/vision_transformer0. AbstractWhile the Transformer architecture has become the de-facto standard for natural language processing tasks, its applications to computer vision remain limited. In vision, attention is either applied in ..
Flow Matching for Generative ModelingICLR 2023논문 링크: https://arxiv.org/abs/2210.027470. 핵심어떤 data-distribution 에서 simple-distribution (e.g. standard gaussian) 으로 변화하는 path (e.g. forward-diffusion process) 를 좀 더 잘 정의해서, 그것의 inverse (image generation via the diffusion model) 또한 더 잘 되도록 하고싶다가 핵심.노이즈를 더해가는 방식 비교(Top) Diffusion forward process(Middle) Diffusion Flow-matching(Bottom) Optimal transport..