TY - JOUR
T1 - Self-supervised learning of pseudo classes for generalized zero-shot fine-grained recognition
AU - Chen, Yan He
AU - Yeh, Mei Chen
N1 - Publisher Copyright:
© The Author(s), under exclusive licence to Springer Science+Business Media, LLC, part of Springer Nature 2024.
PY - 2024
Y1 - 2024
N2 - Generalized zero-shot learning (GZSL) attempts to recognize visual instances from both seen and unseen classes by transferring knowledge from seen classes to unseen classes through semantic information (e.g., attributes). Generative methods are commonly employed to alleviate the issue of extreme data imbalance in which visual samples from unseen classes are not available during training, by synthesizing training samples for unseen classes from class prototypes. However, in the context of GZSL applied to fine-grained recognition, a notable complication arises. Similar class prototypes among different categories lead to ambiguity when generating synthetic data for classification. In response, we present a novel solution: a self-supervised pseudo-labeling (SSPL) module designed to enhance the generation of discerning synthetic data. This enhancement is achieved through an unsupervised grouping of fake and real samples using pseudo classes. By doing so, the SSPL module addresses the challenge of generating discriminative fake data, ultimately improving the overall quality of synthesized samples for classification. Our experimental results, conducted on three widely recognized GZSL datasets, demonstrate the effectiveness of the proposed method. Notably, the SSPL module not only produces well-distributed synthetic samples, but also enhances the discriminative and generalizable visual features derived from both real and synthetic samples within the GZSL framework.
AB - Generalized zero-shot learning (GZSL) attempts to recognize visual instances from both seen and unseen classes by transferring knowledge from seen classes to unseen classes through semantic information (e.g., attributes). Generative methods are commonly employed to alleviate the issue of extreme data imbalance in which visual samples from unseen classes are not available during training, by synthesizing training samples for unseen classes from class prototypes. However, in the context of GZSL applied to fine-grained recognition, a notable complication arises. Similar class prototypes among different categories lead to ambiguity when generating synthetic data for classification. In response, we present a novel solution: a self-supervised pseudo-labeling (SSPL) module designed to enhance the generation of discerning synthetic data. This enhancement is achieved through an unsupervised grouping of fake and real samples using pseudo classes. By doing so, the SSPL module addresses the challenge of generating discriminative fake data, ultimately improving the overall quality of synthesized samples for classification. Our experimental results, conducted on three widely recognized GZSL datasets, demonstrate the effectiveness of the proposed method. Notably, the SSPL module not only produces well-distributed synthetic samples, but also enhances the discriminative and generalizable visual features derived from both real and synthetic samples within the GZSL framework.
KW - Generalized zero-shot learning
KW - Generative modeling
KW - Self-supervised learning
KW - Visual-semantic embedding
UR - http://www.scopus.com/inward/record.url?scp=85191695737&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85191695737&partnerID=8YFLogxK
U2 - 10.1007/s11042-024-19266-w
DO - 10.1007/s11042-024-19266-w
M3 - Article
AN - SCOPUS:85191695737
SN - 1380-7501
JO - Multimedia Tools and Applications
JF - Multimedia Tools and Applications
ER -