Improve Fine-grained Visual Classification Accuracy by Controllable Location Knowledge Distillation

You Lin Tsai, Cheng Hung Lin, Po Yung Chou*

*Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

The current state-of-the-art network models have achieved remarkable performance. However, they often face an issue of having excessively large architectures, making them challenging to deploy on edge devices. In response to this challenge, a groundbreaking solution known as knowledge distillation has been introduced. The concept of knowledge distillation involves transferring information from a complex teacher model to a simpler student model, effectively reducing the model's complexity. Prior approach has demonstrated promising transfer effects through this technique. Nevertheless, in the field of fine-grained image classification, there has been limited exploration of distillation methods custom-tailored specifically for this domain. In this paper, we focus on knowledge distillation specifically designed for fine-grained image recognition. Notably, this strategy is inspired by the Class Activation Maps (CAM). We first train a complex model and use it generated feature maps with spatial information, which is call hint maps. Furthermore, we propose an adjustment strategy for this hint map, which can control local distribution of information. Referring to it as Controllable Size CAM (CTRLS-CAM). Use it as a guide allowing the student model to effectively learn from the teacher model's behavior and concentrate on discriminate details. In contrast to conventional distillation models, this strategy proves particularly advantageous for fine-grained recognition, enhancing learning outcomes and enabling the student model to achieve superior performance. In CTRLS-CAM method, we refine the hint maps value distribution, redefining the relative relationships between primary and secondary feature areas. We conducted experiments using the CUB200-2011 dataset, and our results demonstrated a significant accuracy improvement of about 5% compared to the original non-distilled student model. Moreover, our approach achieved a 1.23% enhancement over traditional knowledge distillation methods.

Original languageEnglish
Title of host publication2024 IEEE International Conference on Consumer Electronics, ICCE 2024
PublisherInstitute of Electrical and Electronics Engineers Inc.
ISBN (Electronic)9798350324136
DOIs
Publication statusPublished - 2024
Event2024 IEEE International Conference on Consumer Electronics, ICCE 2024 - Las Vegas, United States
Duration: 2024 Jan 62024 Jan 8

Publication series

NameDigest of Technical Papers - IEEE International Conference on Consumer Electronics
ISSN (Print)0747-668X
ISSN (Electronic)2159-1423

Conference

Conference2024 IEEE International Conference on Consumer Electronics, ICCE 2024
Country/TerritoryUnited States
CityLas Vegas
Period2024/01/062024/01/08

Keywords

  • CAM
  • Deep Learning
  • Fine-grained Visual Classification
  • Knowledge Distillation

ASJC Scopus subject areas

  • Industrial and Manufacturing Engineering
  • Electrical and Electronic Engineering

Fingerprint

Dive into the research topics of 'Improve Fine-grained Visual Classification Accuracy by Controllable Location Knowledge Distillation'. Together they form a unique fingerprint.

Cite this