TY - JOUR
T1 - Using Face Detection in Photographs and Cluster Analysis to Support Exploration of Social Relationships Between Historical Personages in a Biographical Database
AU - Sie, Shun Hong
AU - Ke, Hao Ren
AU - Chang, Su Bing
N1 - Publisher Copyright:
© 2021, The authors Published by WKW School of Communication & Information & NTU Libraries, Nanyang Technological University
PY - 2021
Y1 - 2021
N2 - Background. The Taiwan Biographical Database (TBDB) assembles biographical information of historical personages in Taiwan. It is a digital-humanities-oriented system that supports relational database operations, fulltext search, social network analysis, and geographic information system functions. Objectives.Through semi-automatic named entity recognition from the fulltext of biographies, TBDB assists historians to construct networks of social relationships. However, the fulltext of biographies may not describe all social relationships. Taking advantage of the fact that historical photographs were usually taken on formal occasions, historical photographs may be exploited to uncover additional relationships. This paper describes and evaluates a face detection function in TBDB that utilizes the OpenCV Library to detect faces of historical persons in old photographs. Furthermore, it employs hierarchical agglomerative clustering to combine fragmentary social networks. Results. An experiment using 45 historical photographs found that the face detection function achieved an average recall of 98% recall, but with low precision. To address the low precision rate, a user interface has been implemented in TBDB to facilitate review and deletion of false-positive faces in the photographs. Furthermore, cluster analysis is used to integrate social relationships found in biographies, those detected from historical photographs, and even relationships harvested from external sources, to produce comprehensive social networks for historical research.
AB - Background. The Taiwan Biographical Database (TBDB) assembles biographical information of historical personages in Taiwan. It is a digital-humanities-oriented system that supports relational database operations, fulltext search, social network analysis, and geographic information system functions. Objectives.Through semi-automatic named entity recognition from the fulltext of biographies, TBDB assists historians to construct networks of social relationships. However, the fulltext of biographies may not describe all social relationships. Taking advantage of the fact that historical photographs were usually taken on formal occasions, historical photographs may be exploited to uncover additional relationships. This paper describes and evaluates a face detection function in TBDB that utilizes the OpenCV Library to detect faces of historical persons in old photographs. Furthermore, it employs hierarchical agglomerative clustering to combine fragmentary social networks. Results. An experiment using 45 historical photographs found that the face detection function achieved an average recall of 98% recall, but with low precision. To address the low precision rate, a user interface has been implemented in TBDB to facilitate review and deletion of false-positive faces in the photographs. Furthermore, cluster analysis is used to integrate social relationships found in biographies, those detected from historical photographs, and even relationships harvested from external sources, to produce comprehensive social networks for historical research.
UR - http://www.scopus.com/inward/record.url?scp=85123615952&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85123615952&partnerID=8YFLogxK
U2 - 10.32655/LIBRES.2021.1.4
DO - 10.32655/LIBRES.2021.1.4
M3 - Article
AN - SCOPUS:85123615952
SN - 1058-6768
VL - 31
SP - 42
EP - 55
JO - Libres
JF - Libres
IS - 1
ER -