Forecasting particulate matter concentration using nonlinear autoregression with exogenous input model

Rumaling, M.I.; Chee, F.P.; Chang, H.W.J.; Payus, C.M.; Kong, S.K.; Dayou, J.; Sentian, J.

doi:10.22034/GJESM.2022.01.03

Document Type : ORIGINAL RESEARCH ARTICLE

Authors

¹ Faculty of Science and Natural Resources, Universiti Malaysia Sabah, Kota Kinabalu, Sabah, Malaysia

² Preparatory Centre for Science and Technology, Universiti Malaysia Sabah, Kota Kinabalu, Sabah, Malaysia

³ Department of Atmospheric Sciences, National Central University, Taoyuan, 32001, Taiwan

⁴ Energy, Vibration and Sound Research Group, Faculty of Science and Natural Resources, Universiti Malaysia Sabah, Kota Kinabalu, Sabah, Malaysia

https://doi.org/10.22034/GJESM.2022.01.03

Abstract

BACKGROUND AND OBJECTIVES: Air quality in some developing countries is dominated by particulate matter, especially those with size 10 micrometers and smaller or PM₁₀. They can be inhaled and sometimes can get deep into lungs; some may even get into bloodstream and cause serious health problems. Therefore, future PM₁₀ concentration forecasting is important for early prevention and in urban development planning, which is crucial for developing cities. This paper presents the development of PM₁₀ forecasting model using nonlinear autoregressive with exogenous input model.
METHODS: To improve performance of nonlinear autoregressive with exogenous input model, principal component analysis is used prior to the model for variable selection. The first stage of principal component analysis involves Scree plot, which determines the number of principal components based on explained variance. This is then followed by selecting variables using a rotated component matrix, based on their strength of contribution towards variation of PM₁₀ concentration. To test the model, PM₁₀ data in Kota Kinabalu from 2003 – 2010 was used. Neural network models are developed using this data by varying number of input variables with the inclusion of temporal variables. The developed forecasting models are evaluated using data PM₁₀ in the city from 2011 to 2012. Four performance indicators, namely root mean square error, mean absolute error, index of agreement and fractional bias are reported.
FINDINGS: Results from principal component analysis show that five variables including wind direction index, relative humidity, ambient temperature, concentration of nitrogen dioxide and concentration of ozone strongly contribute to the variation of PM₁₀ concentration. By using these variables together with temporal variables as input in the nonlinear autoregressive with exogenous input models, the resultant model shows good forecasting performance, with root mean square error of 7.086±0.873 µg/m³. The selection of significant variables helps in reducing input variables inside the forecast model without degrading its forecast performance.
CONCLUSION: This model shows very promising performance in forecasting PM₁₀ concentration in Kota Kinabalu as it requires fewer input variables and does not require variable transformation.

Graphical Abstract

Highlights

Based on principal component analysis, five variables, namely WDI, RH, Temp, NO₂ and O₃, strongly contributes to the variation of PM₁₀ concentration in Kota Kinabalu;

Forecasting model that uses variable selected from PCA have forecast performance comparable to other forecast models that uses all variables as input variables;

Selection of significant parameters as inputs of forecast model is important because the model requires fewer variables and does not require variable transformation for accurate forecasting.

Keywords

Main Subjects

Environmental Engineering

Open Access

©2022 The author(s). This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit: http://creativecommons.org/licenses/by/4.0/

Publisher’s Note

GJESM Publisher remains neutral with regard to jurisdictional claims in published maps and institutional affliations.

Citation Metrics & Captures

References

Abdulkadir, S.J.; Yong, S.P., (2014). Empirical analysis of parallel-NARX recurrent network for long-term chaotic financial forecasting. International Conference on Computer and Information Sciences (ICCOINS). 1–6 (6 pages).

Abdullah, S.; Ismail, M.; Ghazali, N. A..; Ahmed, A.M.A.N., (2018). Forecasting particulate matter (PM10) concentration: A radial basis function neural network approach. AIP Conference Proceedings, 1–6 (6 pages).

Abdullah, S.; Ismail, M.; Ghazali, N. A..; Ahmed, A.M.A.N., (2019)., Multi-Layer Perceptron Model for Air Quality Prediction. Malaysian J. Math. Sci. 13: 85–90 (6 pages).

Abdullah, S.; Ismail, M.; Fong, S.Y.; Ahmed, A.M.A.N., (2016). Evaluation for long term PM10 concentration forecasting using multi linear regression (MLR) and principal component regression (PCR) models. Environ. Asia. 9(2): 101–110 (10 pages).

Abdul-Wahab, S.A.; Bakheit, C.S.; Al-Alawi, S.M., (2005) Principal component and multiple regression analysis in modelling of ground-level ozone and factors affecting its concentrations. Environ. Model. Software. 20(10): 1263–1271 (9 pages).

Antanasijević, D.Z.; Pocajt, V.V.; Povrenović, D.S.; Ristić, M.D.; Perić-Grujić, A.A., (2013). PM₁₀ emission forecasting using artificial neural networks and genetic algorithm input variable optimization. Sci. Total Environ., 443: 511–519 (9 pages).

Arhami, M.; Kamali, N.; Rajabi, M.M., (2013). Predicting hourly air pollutant levels using artificial neural networks coupled with uncertainty analysis by Monte Carlo simulations. Environ. Sci. Pollut. Res., 20(7): 4777–4789 (13 pages).

Azid, A.; Juahir, H.; Toriman, M. E.; Kamarudin, M. K. A.; Saudi, A. S. M.; Hasnam, C.N.C.; Aziz, N.A.A.; Azaman, F.; Latif, M. T.; Zainuddin, S. F. M.; Osman, M. R.; Yamin, M., (2014). Prediction of the level of air pollution using principal component analysis and artificial neural network techniques: A case study in Malaysia. Water Air Soil Pollut., 225(8): 2063–2076 (14 pages).

Besar, S.N.A.; Ladin, M.A.; Harith, N.S.H.; Bolong, N.; Saad, I.; Taha, N., (2020). An overview of the transportation issues in Kota Kinabalu, Sabah.IOP Conference Series: Earth and Environ. Sci., 1–9 (9 pages).

Biancofiore, F.; Busilacchio, M.; Verdecchia, M.; Tomassetti, B.; Aruffo, E.; Bianco, S.; Di Tomasso, S.; Colangeli, C.; Rosatelli, G.; Di Carlo, P., (2017). Recursive neural network model for analysis and forecast of PM₁₀ and PM_2.5. Atmos. Pollut. Res., 8: 652–659 (8 pages).

Cabaneros, S.M.L.S.; Calautit, J.K.S.; Hughes, B.R., (2017). Hybrid Artificial Neural Network Models for Effective Prediction and Mitigation of Urban Roadside NO₂ Pollution. Energy Procedia. 3524–3530 (7 pages).

Ceylan, Z.; Bulkan, S., (2018). Forecasting PM₁₀ Levels using ANN and MLR: A case study for Sakarya City. Global Nest J., 20(2): 281–290 (10 pages).

Chang, H.W. J.; Chee, F.P.; Kong, S.K.S.; Sentian, J., (2018). Variability of the PM₁₀ concentration in the urban atmosphere of Sabah and its responses to diurnal and weekly changes of CO, NO₂, SO₂ and Ozone. Asian J. Atmos. Environ., 12(2): 109–126 (18 pages).

Díaz-Robles, L.A.; Ortega, J.C.; Fu, J.S.; Reed, G.D.; Chow, J.C; Watson, J.G.; Moncada-Herrera, J.A., (2008). A hybrid ARIMA and artificial neural networks model to forecast particulate matter in urban areas: The case of Temuco, Chile. Atmos. Environ., 42: 8331–8340 (10 pages).

Djamila, H.; Ming, C.C.; Kumaresan, S., (2011). Estimation of exterior vertical daylight for the humid tropic of Kota Kinabalu city in East Malaysia. Renewable Energy. 36(1): 9–15 (7 pages).

Dominick, D.; Juahir, H.; Latif, M.T.; Zain, S.M.; Aris, A.Z., (2012). Spatial assessment of air quality patterns in Malaysia using multivariate analysis. Atmos. Environ., 60: 172–181 (10 pages).

Dotse, S.Q.; Petra, M.I.; Dagar, L.; De Silva, L.C., (2018). Application of computational intelligence techniques to forecast daily PM₁₀ exceedances in Brunei Darussalam. Atmos. Pollut. Res., 9: 358–368 (11 pages).

Elangasinghe, M.A.; Singhal, N.; Dirks, K.N.; Salmond, J.A.; Samarasinghe, S., (2014). Complex time series analysis of PM₁₀ and PM_2.5 for a coastal site using artificial neural network modelling and k-means clustering. Atmos. Environ., 94: 106–116 (11 pages).

Fan, J.; Li, Q.; Hou, J.; Feng, X.; Karimian, H.; Lin, S., (2013). A spatiotemporal prediction framework for air pollution based on deep RNN. ISPRS Ann. Photogramm. Remote Sens. Spatial Inf. Sci.: 15– 2 (8 pages).

Feng, X.; Li, Q.; Zhu, Y.; Hou, J.; Jin, L.; Wang, J., (2015). Artificial neural networks forecasting of PM_2.5 pollution using air mass trajectory based geographic model and wavelet transformation. Atmos. Environ., 107, 118–128 (11 pages).

Franceschi, F.; Cobo, M.; Figueredo, M., (2018). Discovering relationships and forecasting PM₁₀ and PM_2.5 concentrations in Bogotá Colombia, using Artificial Neural Networks, Principal Component Analysis, and k-means clustering. Atmos. Pollut. Res., 9: 912–922 (11 pages).

Graham, J.W., (2009). Missing Data Analysis: Making It Work in the Real World. Annu. Rev. Psychol., 60: 549–579 (31 pages).

Grivas, G.; Chaloulakou, A., (2006). Artificial neural network models for prediction of PM₁₀ hourly concentrations, in the Greater Area of Athens, Greece. Atmos. Environ., 40: 1216–1229 (14 pages).

Gvozdić, V.; Kovač-Andrić, E.; Brana, J., (2011). Influence of Meteorological Factors NO₂, SO₂, CO and PM₁₀ on the Concentration of O₃ in the Urban Atmosphere of Eastern Croatia. Environ. Model. Assess., 16(5): 491–501 (11 pages).

Karri, R.R.; Mohammadyan, M.; Ghoochani, M.; Mohammadpoure, R.A.; Yusup, Y.; Rafatullah, M.; Mohammadyan, M.; Sahu, J. N., (2018). Modeling airborne indoor and outdoor particulate matter using genetic programming. Sustainable Cities Soc., 43: 395–405 (11 pages).

Kim, K.H.; Kabir, E.; Kabir, S., (2015). A review on the human health impact of airborne particulate matter. Environ. Int., 74: 136–143 (8 pages).

Lou, C.; Liu, H.; Li, Y.; Peng, Y.; Wang, J.; Dai, L., (2017). Relationships of relative humidity with PM_2.5 and PM₁₀ in the Yangtze River Delta, China. Environ. Monit. Assess., 74: 136–143 (8 pages).

Muhammad Izzuddin, R.; Chee, F.P.; Dayou, J.; Chang, H.W. J.; Kong, S.K.S.; Sentian, J., (2019). Temporal Assessment on Variation of PM₁₀ Concentration in Kota Kinabalu using Principal Component Analysis and Fourier Analysis. Curr. World. Environ., 14(3): 400–410 (11 pages).

Muhammad Izzuddin, R.; Chee, F.P.; Dayou, J.; Chang, H.W.J.; Kong, S.K.S.; Sentian, J., (2020). Missing Value Imputation for PM₁₀ Concentration in Sabah using Nearest Neighbour Method (NNM) and Expectation Maximization (EM) Algorithm. Asian J. Atmos. Environ., 14(1): 62–72 (11 pages).

Munir, S.; Habeebullah; T.M.; Mohammed, A.M.F.; Morsy, E.A.; Rehan, M.; Ali, K., (2017). Analysing PM_2.5 and its association with PM₁₀ and meteorology in the arid climate of Makkah, Saudi Arabia. Aerosol Air Qual. Res., 17: 453–464 (12 pages).

Noor, H.M.; Nasrudin, N.; Foo, J., (2014). Determinants of Customer Satisfaction of Service Quality: City Bus Service in Kota Kinabalu, Malaysia. Procedia Social Behav. Sci., 153: 595–605 (11 pages).

Özbay, B.; Keskin, G.A.; Doǧruparmak, Ş.Ç.; Ayberk, S., (2011). Multivariate methods for ground-level ozone modeling. Atmos. Res., 102: 57–65 (9 pages).

Paschalidou, A.K.; Karakitsios, S.; Kleanthous, S.; Kassomenos, P.A., (2011). Forecasting hourly PM₁₀ concentration in Cyprus through artificial neural networks and multiple regression models: Implications to local environmental management. Environ. Sci. Pollut. Res., 18(2): 316–327 (10 pages).

Polat, E.; Gunay, S., (2015). The Comparison of Partial Least Squares Regression, Principal Component Regression and Ridge Regression With Multiple Linear Regression for Predicting PM₁₀ Concentration Level Based on Meteorological Parameters. J. Data Sci., 13: 663–692 (10 pages).

Potdar, K.; Pardawala, T.S., (2017). Forecasting Ambient Air Quality in Mumbai using Neural Networks. 5^th National Conference on Role of Engineers in Nation Building: 1–4 (4 pages).

Saxena, S.; Mathur, A.K., (2017). Prediction of Respirable Particulate Matter (PM₁₀) Concentration using Artificial Neural Network in Kota City. Asian J. of Convergence Technol., 3(3): 1–7 (7 pages).

Shahraiyni, H.T.; Sodoudi, S., (2016). Statistical modeling approaches for PM₁₀ prediction in urban areas; A review of 21st-century studies. Atmos., 2(15): 1–24 (24 pages).

Shekarrizfard, M.; Karimi-Jashni, A.; Hadad, K., (2012). Wavelet transform-based artificial neural networks (WT-ANN) in PM₁₀ pollution level estimation, based on circular variables. Environ. Sci. Pollut. Res., 19: 256–268 (13 pages).

Teong, K.V.; Sukarno, K.; Hian, J., Chang, W.; Chee, F.P.; Ho, C.M.; Dayou, J., (2017). The Monsoon effect on rainfall and solar radiation in Kota Kinabalu. Trans. Sci. Technol., 4(4): 460–465 (6 pages).

Ul-Saufie, A.Z.; Yahaya, A.S.; Ramli, N.A.; Hamid, H.A., (2011). Comparison Between Multiple Linear Regression And Feed forward Back propagation Neural Network Models For Predicting PM₁₀ Concentration Level Based On Gaseous And Meteorological Parameters. Int. J. App. Sci. Technol., 1(4): 42–49 (8 pages).

Ul-Saufie, A.Z.; Yahaya, A.S.; Ramli, N.A.; Hamid, H.A., (2015). PM₁₀ concentrations short term prediction using feedforward backpropagation and general regression neural network in a sub- urban area. J. Environ. Sci. Technol., 8(2): 59–73. (15 pages).

Ul-Saufie, A.Z.; Yahaya, A.S.; Ramli, N.A.; Rosaida, N.; Hamid, H.A., (2013). Future daily PM₁₀concentrations prediction by combining regression models and feedforward backpropagation models with principle component analysis (PCA). Atmos. Environ., 77: 621–630 (10 pages).

Vijayaraghavan, N.; Mohan, G.S., (2016). Air pollution analysis for Kannur City using artificial neural network. Int. J. Sci. Res., 5(10): 1399–1401 (3 pages).

Vlachogianni, A.; Kassomenos, P.; Karppinen, A.; Karakitsios, S.; Kukkonen, J., (2011). Evaluation of a multiple regression model for the forecasting of the concentrations of NOx and PM10 in Athens and Helsinki. Sci. Total Environ., 409: 1559–1571 (13 pages).

Voukantsis, D.; Karatzas, K.; Kukkonen, J.; Räsänen, T.; Karppinen, A.; Kolehmainen, M., (2011). Intercomparison of air quality data using principal component analysis, and forecasting of PM₁₀ and PM_2.5 concentrations using artificial neural networks, in Thessaloniki and Helsinki. Sci. Total Environ., 406: 1266–1276 (11 pages).

Willmott, C.J.; Matsuura, K., (2005). Advantages of the mean absolute error (MAE) over the root mean square error (RMSE) in assessing average model performance. Clim. Res., 30: 79–82 (4 pages).

Wu, Z.; Fan, J.; Gao, Y.; Shang, H.; Song, H., (2019). Study on prediction model of space-time Distribution of air pollutants based on artificial neural network. Environ. Eng. Manage. J., 18(7): 1876–1890 (15 pages).

Xie, Y.; Bin, Z.; Lin, Z.; Rong, L., (2015). Spatiotemporal variations of PM2.5 and PM10 concentrations between 31 Chinese cities and their relationships with SO2, NO2, CO and O3. Particuology, 20: 141–149 (9 pages).

Yu, H.; Wilamowski, B.M., (2016). Levenberg-marquardt training. Intell. Sys., 2: 1–16 (16 pages).

Letters to Editor

GJESM Journal welcomes letters to the editor for the post-publication discussions and corrections which allows debate post publication on its site, through the Letters to Editor. Letters pertaining to manuscript published in GJESM should be sent to the editorial office of GJESM within three months of either online publication or before printed publication, except for critiques of original research. Following points are to be considering before sending the letters (comments) to the editor.

[1] Letters that include statements of statistics, facts, research, or theories should include appropriate references, although more than three are discouraged.
[2] Letters that are personal attacks on an author rather than thoughtful criticism of the author’s ideas will not be considered for publication.
[3] Letters can be no more than 300 words in length.
[4] Letter writers should include a statement at the beginning of the letter stating that it is being submitted either for publication or not.
[5] Anonymous letters will not be considered.
[6] Letter writers must include their city and state of residence or work.
[7] Letters will be edited for clarity and length.

Name *

Email Address *

Affiliation *

Comments *

Security Code *

Global Journal of Environmental Science and Management

Forecasting particulate matter concentration using nonlinear autoregression with exogenous input model

References

References

Letters to Editor

Send comment about this article

Volume 8, Issue 1 - Serial Number 29
January 2022
Pages 27-44

Forecasting particulate matter concentration using nonlinear autoregression with exogenous input model

References

References

Letters to Editor

Send comment about this article

Volume 8, Issue 1 - Serial Number 29January 2022Pages 27-44

Volume 8, Issue 1 - Serial Number 29
January 2022
Pages 27-44