TY - GEN
T1 - Artist-based classification via deep learning with multi-scale weighted pooling
AU - Jangtjik, Kevin Alfianto
AU - Yeh, Mei Chen
AU - Hua, Kai Lung
N1 - Publisher Copyright:
© 2016 ACM.
PY - 2016/10/1
Y1 - 2016/10/1
N2 - For analyzing digital images of paintings we propose a new approach to categorize them based on artist. Determining the authorship of a painting is challenging because common subjects are illustrated in paintings, and paintings of an artist may not have a unique style. The proposed approach is built upon convolutional neural networks (CNN)-a class of biologically inspired vision model that recently demonstrates near-human performance on several visual recognition tasks. However, training a CNN model requires large scale training data of a fixed input image size (e.g. 224 × 224). In this paper, we propose to construct a multi-layer pyramid from an image, providing 21X more features than using a single layer (i.e., the original image) alone. We train a CNN model for each layer, and propose a new weighted fusion scheme to adaptively combine the decision results. To evaluate the proposed methods, we collect a new painting image dataset, categorized into 13 artists. As demonstrated in the experimental results, the proposed method achieves a promising result-88.08% recall rate in top-2 retrieval on the challenging classification task.
AB - For analyzing digital images of paintings we propose a new approach to categorize them based on artist. Determining the authorship of a painting is challenging because common subjects are illustrated in paintings, and paintings of an artist may not have a unique style. The proposed approach is built upon convolutional neural networks (CNN)-a class of biologically inspired vision model that recently demonstrates near-human performance on several visual recognition tasks. However, training a CNN model requires large scale training data of a fixed input image size (e.g. 224 × 224). In this paper, we propose to construct a multi-layer pyramid from an image, providing 21X more features than using a single layer (i.e., the original image) alone. We train a CNN model for each layer, and propose a new weighted fusion scheme to adaptively combine the decision results. To evaluate the proposed methods, we collect a new painting image dataset, categorized into 13 artists. As demonstrated in the experimental results, the proposed method achieves a promising result-88.08% recall rate in top-2 retrieval on the challenging classification task.
KW - Convolutional neural network
KW - Deep learning
KW - Digital painting image classification
KW - Spatial pyramid representation
UR - http://www.scopus.com/inward/record.url?scp=84994663029&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=84994663029&partnerID=8YFLogxK
U2 - 10.1145/2964284.2967299
DO - 10.1145/2964284.2967299
M3 - Conference contribution
AN - SCOPUS:84994663029
T3 - MM 2016 - Proceedings of the 2016 ACM Multimedia Conference
SP - 635
EP - 639
BT - MM 2016 - Proceedings of the 2016 ACM Multimedia Conference
PB - Association for Computing Machinery, Inc
T2 - 24th ACM Multimedia Conference, MM 2016
Y2 - 15 October 2016 through 19 October 2016
ER -