TY - JOUR
T1 - Dimensionality reduction for multivariate time-series data mining
AU - Wan, Xiaoji
AU - Li, Hailin
AU - Zhang, Liping
AU - Wu, Yenchun Jim
N1 - Publisher Copyright:
© 2022, The Author(s), under exclusive licence to Springer Science+Business Media, LLC, part of Springer Nature.
PY - 2022/5
Y1 - 2022/5
N2 - A multivariate time series is one of the most important objects of research in data mining. Time and variables are two of its distinctive characteristics that add the complication of the algorithms applied to data mining. Reduction in the dimensionality is often regarded as an effective way to address these issues. In this paper, we propose a method based on principal component analysis (PCA) to effectively reduce the dimensionality. We call it “piecewise representation based on PCA” (PPCA), which segments multivariate time series into several sequences, calculates the covariance matrix for each of them in terms of the variables, and employs PCA to obtain the principal components in an average covariance matrix. The results of the experiments, including retained information analysis, classification, and a comparison of the central processing unit time consumption, demonstrate that the PPCA method used to reduce the dimensionality in multivariate time series is superior to the prior methods.
AB - A multivariate time series is one of the most important objects of research in data mining. Time and variables are two of its distinctive characteristics that add the complication of the algorithms applied to data mining. Reduction in the dimensionality is often regarded as an effective way to address these issues. In this paper, we propose a method based on principal component analysis (PCA) to effectively reduce the dimensionality. We call it “piecewise representation based on PCA” (PPCA), which segments multivariate time series into several sequences, calculates the covariance matrix for each of them in terms of the variables, and employs PCA to obtain the principal components in an average covariance matrix. The results of the experiments, including retained information analysis, classification, and a comparison of the central processing unit time consumption, demonstrate that the PPCA method used to reduce the dimensionality in multivariate time series is superior to the prior methods.
KW - Covariance matrix
KW - Piecewise representation
KW - Principal component analysis
KW - Time-series data mining
UR - http://www.scopus.com/inward/record.url?scp=85123161609&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85123161609&partnerID=8YFLogxK
U2 - 10.1007/s11227-021-04303-4
DO - 10.1007/s11227-021-04303-4
M3 - Article
AN - SCOPUS:85123161609
SN - 0920-8542
VL - 78
SP - 9862
EP - 9878
JO - Journal of Supercomputing
JF - Journal of Supercomputing
IS - 7
ER -