목록VIT (1)
hwchung 님의 블로그
[Paper Review] ICLR 2021, AN IMAGE IS WORTH 16X16 WORDS: TRANSFORMERS FOR IMAGE RECOGNITION AT SCALE 논문리뷰
AN IMAGE IS WORTH 16X16 WORDS:TRANSFORMERS FOR IMAGE RECOGNITION AT SCALE ICLR 2021논문 링크: https://arxiv.org/abs/2010.11929github 링크: https://github.com/google-research/vision_transformer0. AbstractWhile the Transformer architecture has become the de-facto standard for natural language processing tasks, its applications to computer vision remain limited. In vision, attention is either applied in ..
[Paper Review]
2026. 2. 25. 16:15