TY - JOUR
T1 - Building a smart lecture-recording system using MK-CPN network for heterogeneous data sources
AU - Fang, Chiung Yao
AU - Luo, An Chun
AU - Deng, Yu Shan
AU - Lu, Chia Ju
AU - Chen, Sei Wang
N1 - Publisher Copyright:
© 2018, The Natural Computing Applications Forum.
PY - 2019/8/1
Y1 - 2019/8/1
N2 - Nowadays, lecture-recording systems play a vital role in collecting spoken discourse for e-learning. However, in view of the growing development of e-learning, the lack of content is becoming a problem. This research presents a smart lecture-recording (SLR) system that can record orations at the same level of quality as a human team, but with a reduced degree of human involvement. The proposed SLR system is composed of two subsystems, referred to as virtual cameraman (VC), and virtual director (VD), respectively. All camera man components of VC subsystem are automatic and can take actions that include target and event detection, tracking, and view searching. The videos taken by these three components are forwarded to the VD subsystem, in which the representative shot is chosen for recording or direct broadcasting. We refer to this function of the VD subsystem as shot selection that is based on the content analysis. The capability of shot selection is pre-trained through a machine-learning process characterized by the counter-propagation neural (CPN) network. However, the CPN network yielded poor results when the input data were heterogeneous data. To increases the accuracy of shot selection, we applied multiple kernel learning (MKL) techniques into CPN network, called MK-CPN, to transform all the heterogeneous data from different content analysis methods into unified space. A series of experiments for real lecture has been conducted. The results showed that the proposed SLR system can provide oration records close to some extend to those taken by real human teams. We believe that the proposed system may not be limited to live speeches, if it can be configured with appropriate training materials.
AB - Nowadays, lecture-recording systems play a vital role in collecting spoken discourse for e-learning. However, in view of the growing development of e-learning, the lack of content is becoming a problem. This research presents a smart lecture-recording (SLR) system that can record orations at the same level of quality as a human team, but with a reduced degree of human involvement. The proposed SLR system is composed of two subsystems, referred to as virtual cameraman (VC), and virtual director (VD), respectively. All camera man components of VC subsystem are automatic and can take actions that include target and event detection, tracking, and view searching. The videos taken by these three components are forwarded to the VD subsystem, in which the representative shot is chosen for recording or direct broadcasting. We refer to this function of the VD subsystem as shot selection that is based on the content analysis. The capability of shot selection is pre-trained through a machine-learning process characterized by the counter-propagation neural (CPN) network. However, the CPN network yielded poor results when the input data were heterogeneous data. To increases the accuracy of shot selection, we applied multiple kernel learning (MKL) techniques into CPN network, called MK-CPN, to transform all the heterogeneous data from different content analysis methods into unified space. A series of experiments for real lecture has been conducted. The results showed that the proposed SLR system can provide oration records close to some extend to those taken by real human teams. We believe that the proposed system may not be limited to live speeches, if it can be configured with appropriate training materials.
KW - Automatic lecture-recording system
KW - Shot selection
KW - Virtual cameraman
KW - Virtual director
UR - http://www.scopus.com/inward/record.url?scp=85040086482&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85040086482&partnerID=8YFLogxK
U2 - 10.1007/s00521-017-3328-6
DO - 10.1007/s00521-017-3328-6
M3 - Article
AN - SCOPUS:85040086482
SN - 0941-0643
VL - 31
SP - 3759
EP - 3777
JO - Neural Computing and Applications
JF - Neural Computing and Applications
IS - 8
ER -