跳至主導覽 跳至搜尋 跳過主要內容

Adaptive Locality Guidance: Enhancing Vision Transformers on Tiny Datasets

研究成果: 書貢獻/報告類型會議論文篇章

摘要

As studies show that lack of sufficient data leads Vision Transformers (VTs) to mainly learn global information from the input, the recently proposed Locality Guidance (LG) approach [1] uses a lightweight Convolutional Neural Network (CNN) pretrained on the same dataset to guide the VT into learning local features as well. Under a dual learning framework, the use of the LG significantly boosts the accuracy of different VTs on multiple tiny datasets, at the mere cost of a slight increase in training time. However, we also find that the use of the LG prevents the models from learning global aspects to their full ability. To remedy to this limitation, we propose the Adaptive Locality Guidance (ALG), an improved version which uses the LG as an initialization tool, and after a certain number of epochs lets the VT learn by itself in supervised fashion. Specifically, we estimate the needed duration for the LG based on a threshold set on the evolution of the distance separating the features of the VT to those of the lightweight CNN used for guidance. As our improved method can be used in plug-and-play fashion, we successfully apply it across 4 different VTs on the CIFAR-100 dataset. Experimental results show that the proposed ALG significantly reduces the computational cost added in training by the LG, and further increases the validation accuracy by up to 5.49%, thereby achieving new State-Of-The-Art (SOTA) results among tiny VTs.

原文英語
主出版物標題2025 IEEE International Conference on Consumer Electronics, ICCE 2025
發行者Institute of Electrical and Electronics Engineers Inc.
ISBN(電子)9798331521165
DOIs
出版狀態已發佈 - 2025
事件2025 IEEE International Conference on Consumer Electronics, ICCE 2025 - Las Vegas, 美国
持續時間: 2025 1月 112025 1月 14

出版系列

名字Digest of Technical Papers - IEEE International Conference on Consumer Electronics
ISSN(列印)0747-668X
ISSN(電子)2159-1423

會議

會議2025 IEEE International Conference on Consumer Electronics, ICCE 2025
國家/地區美国
城市Las Vegas
期間2025/01/112025/01/14

ASJC Scopus subject areas

  • 工業與製造工程
  • 電氣與電子工程

指紋

深入研究「Adaptive Locality Guidance: Enhancing Vision Transformers on Tiny Datasets」主題。共同形成了獨特的指紋。

引用此