TY - JOUR
T1 - Recurrent Learning on PM2.5 Prediction Based on Clustered Airbox Dataset
AU - Lo, Chia Yu
AU - Huang, Wen Hsing
AU - Ho, Ming Feng
AU - Sun, Min Te
AU - Chen, Ling Jyh
AU - Sakai, Kazuya
AU - Ku, Wei Shinn
N1 - Publisher Copyright:
© 1989-2012 IEEE.
PY - 2022/10/1
Y1 - 2022/10/1
N2 - The progress of industrial development naturally leads to the demand for more electrical power. Unfortunately, due to the fear of the safety of nuclear power plants, many countries have relied on thermal power plants, which will cause more air pollutants during the process of coal burning. This phenomenon as well as increased vehicle emissions around us, have constituted the primary factors of serious air pollution. Inhaling too much particulate air pollution may lead to respiratory diseases and even death, especially PM2.5. By predicting the air pollutant concentration, people can take precautions to avoid overexposure to air pollutants. Consequently, accurate PM2.5 prediction becomes more important. In this study, we propose a PM2.5 prediction system, which utilizes the dataset from EdiGreen Airbox and Taiwan EPA. Autoencoder and Linear interpolation are adopted for solving the missing value problem. Spearman's correlation coefficient is used to identify the most relevant features for PM2.5. Two prediction models (i.e., LSTM and LSTM based on K-means) are implemented which predict PM2.5 value for each Airbox device. To assess the performance of the model prediction, the daily average error and the hourly average accuracy for the duration of a week are calculated. The experimental results show that LSTM based on K-means has the best performance among all methods. Therefore, LSTM based on K-means is chosen to provide real-time PM2.5 prediction through the Linebot.
AB - The progress of industrial development naturally leads to the demand for more electrical power. Unfortunately, due to the fear of the safety of nuclear power plants, many countries have relied on thermal power plants, which will cause more air pollutants during the process of coal burning. This phenomenon as well as increased vehicle emissions around us, have constituted the primary factors of serious air pollution. Inhaling too much particulate air pollution may lead to respiratory diseases and even death, especially PM2.5. By predicting the air pollutant concentration, people can take precautions to avoid overexposure to air pollutants. Consequently, accurate PM2.5 prediction becomes more important. In this study, we propose a PM2.5 prediction system, which utilizes the dataset from EdiGreen Airbox and Taiwan EPA. Autoencoder and Linear interpolation are adopted for solving the missing value problem. Spearman's correlation coefficient is used to identify the most relevant features for PM2.5. Two prediction models (i.e., LSTM and LSTM based on K-means) are implemented which predict PM2.5 value for each Airbox device. To assess the performance of the model prediction, the daily average error and the hourly average accuracy for the duration of a week are calculated. The experimental results show that LSTM based on K-means has the best performance among all methods. Therefore, LSTM based on K-means is chosen to provide real-time PM2.5 prediction through the Linebot.
KW - Air quality
KW - clustering
KW - prediction
KW - recurrent neural network
UR - http://www.scopus.com/inward/record.url?scp=85099108694&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85099108694&partnerID=8YFLogxK
U2 - 10.1109/TKDE.2020.3047634
DO - 10.1109/TKDE.2020.3047634
M3 - Article
AN - SCOPUS:85099108694
SN - 1041-4347
VL - 34
SP - 4994
EP - 5008
JO - IEEE Transactions on Knowledge and Data Engineering
JF - IEEE Transactions on Knowledge and Data Engineering
IS - 10
ER -