Self-Supervised Multi-Label Classification with Global Context and Local Attention

Chun Yen Chen, Mei Chen Yeh

Research output: Chapter in Book/Report/Conference proceedingConference contribution

1 Citation (Scopus)

Abstract

Self-supervised learning has proven highly effective across various tasks, showcasing its versatility in different applications. Despite these achievements, the challenges inherent in multi-label classification have seen limited attention. This paper introduces GAELLE, a novel self-supervised multi-label classification framework that simultaneously captures image context and object information. GAELLE employs a combination of global context and local attention mechanisms to discern diverse levels of semantic information in images. The global component comprehensively learns image content while local attention eliminates object-irrelevant nuances by aligning embeddings with a projection head. The integration of global and local features in GAELLE effectively captures intricate object-scene relationships. To further enhance this capability, we introduce a global and local swap prediction technique, facilitating the nuanced interplay between various objects and scenes within images. Experimental results showcase GAELLE’s state-of-the-art performance in self-supervised multi-label classification tasks, highlighting its effectiveness in uncovering complex relationships between multiple objects and scenes in images.

Original languageEnglish
Title of host publicationICMR 2024 - Proceedings of the 2024 International Conference on Multimedia Retrieval
PublisherAssociation for Computing Machinery, Inc
Pages934-942
Number of pages9
ISBN (Electronic)9798400706028
DOIs
Publication statusPublished - 2024 May 30
Event2024 International Conference on Multimedia Retrieval, ICMR 2024 - Phuket, Thailand
Duration: 2024 Jun 102024 Jun 14

Publication series

NameICMR 2024 - Proceedings of the 2024 International Conference on Multimedia Retrieval

Conference

Conference2024 International Conference on Multimedia Retrieval, ICMR 2024
Country/TerritoryThailand
CityPhuket
Period2024/06/102024/06/14

Keywords

  • Attention
  • Multi-label classification
  • Self-supervised learning

ASJC Scopus subject areas

  • Computer Graphics and Computer-Aided Design
  • Human-Computer Interaction
  • Software

Fingerprint

Dive into the research topics of 'Self-Supervised Multi-Label Classification with Global Context and Local Attention'. Together they form a unique fingerprint.

Cite this