Empirical Comparison of Time Series Data Mining Algorithms

  • SAKINAT OLUWABUKONLA FOLORUNSO OLABISI ONABANJO UNIVERSITY
Abstract views: 312 , PDF downloads: 380

Abstract

Time series is a sequence of observed data that is usually ordered in time. Time series data mining is the innovative application of the principles and techniques of data mining in the analysis of time series.  This research is aimed to apply data mining techniques to forecasting time series data. Electric Power consumption data consumed by Nigerians from 2001 to 2017. Experiments are conducted with four data mining techniques: Random Regression Forest (RRF), Linear Regression (LR), Support Vector Regression (SVR) and Artificial Neural Network (ANN) which were evaluated based on their forecasting errors generated: Mean Absolute Error (MAE), Root Mean Square Error (RMSE), Mean Absolute Percentage Error (MAPE) and prediction accuracy on Waikato Environment for Knowledge Analysis (WEKA) platform. The combination of parameters that yields the best results in terms of predefined performance criteria was chosen as optimal for each regressor. A comparative analysis of the regressors’ performance was conducted. All the tested regressors have demonstrated the best prediction quality in short periods of time. SVR demonstrated the best results in terms of both error values and time expenses. 

 

Keywords: Time Series Forecast, Time Series Data Mining, Classification Algorithms, Regression analysis, Power consumption

Downloads

Download data is not yet available.

Author Biography

SAKINAT OLUWABUKONLA FOLORUNSO, OLABISI ONABANJO UNIVERSITY
Lecturer

References

M. Hayati and Y. Shirvany, "Artificial Neural Network approach for short term load forecasting for illam region," World Academy of Science, Engineering and Technology, vol. 28, pp. 280-284, 2007.

A. Anguera, J. M. Barreiro, J. A. Lara and D. Lizcano, "Applying data mining techniques to medical time series: An empirical case study in electroencephalography and stabilometry," Computational and Structural Biotechnology Journal, vol. 14, pp. 185 - 199, 2016.

A. Sorjamaa, J. Hao, N. Reyhani, Y. Ji and A. Lendasse, "Methodology for long-term prediction of time series," Neurocomputing 70 (2007) 2861–2869, vol. 70, p. 2861–2869, May 2007.

A. I. Taiwo, "Spectral and Fourier Parameter Estimation of Periodic Autocorrelated Time Series Data," Ago- Iwoye, Nigeria, 2017.

Q. Yang and X. Wu, "10 Challenging Problems in Data mining," International Journal of Information Technology and Decision Making, vol. 5, no. 4, pp. 597 - 604, 2005.

K. Chakrabarti, E. Keogh, M. Pazzani and S. Mehrotra, "Locally Adaptive Dimensionality Reduction for Indexing Large Time Series Database," ACM Transactions on Database Systems, vol. 27, no. 2, pp. 188-228, June 2002.

A. Fouilloy, C. Voyant, G. Notton, M. L. Nivet and L. J. Duchaud, "Machine Learning Methods for Solar Irradiation Forecasting: A Comparison in a Mediterranean Site," 2017.

Y. Fu, Z. Li, H. Zhang and P. Xu, "Using Support Vector Machine to Predict Next Day Electricity Load of Public Buildings with Sub-metering Devices," in Procedia Engineering, 2015.

S. S. Kathait, A. Kaur, S. Tiwari and A. Varshney, "Integrating Neural Networks with Time Series Forecasting:," International Journal of Engineering Research And Management (IJERM), vol. 04, no. 1, pp. 98-100, January 2017.

L. Breiman, "Random forests," Machine Learning, vol. 45, no. 1, pp. 5-32, 2001.

A. J. Smola and B. Schölkopf, "A tutorial on support vector regression," Stat Comput, vol. 14, pp. 199-222, 2004.

N. K. Ahmed, A. F. Atiya, N. E. Gayar and H. El-Shishiny, "An Empirical Comparison of Machine Learning Models for Time Series Forecasting," Econometric Reviews, 2010.

A. T. Sergio and T. B. Ludermir, "Deep Learning for Wind Speed Forecasting in Northeastern Region of Brazil," in Brazilian Conference on Intelligent Systems (BRACIS), Anderson Tenorio Sergio, Teresa B. Ludermir, Anderson Tenorio Sergio, and Teresa B. Ludermir, “ ,†2015 B, 2015.

A. H. Jiang, X. C. Huang, Z. H. Zhang, J. Li, Z. Y. Zhang and H. X. Hua, "Mutual Information Algorithms," Mech Syst Signal Process, vol. 24, pp. 2947-2960, 2010.

T. K. Ho, "The random subspace method for constructing decision forests," IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 20, no. 8, pp. 832 - 844, August 1998.

D. A. Freedman, Statistical Models: Theory and Practice, Cambridge University Press, 2009.

S. K. Shevade, S. S. Keerthi, C. Bhattacharyya and K. R. K. Murthy, "Improvements to the SMO Algorithm for SVM Regression," IEEE Transactions on Neural Networks, 1999.

R. R. Bouckaert, E. Frank, M. A. Hall, B. Holmes, P. Reutemann and I. A. Witten, "WEKA — Experiences with a Java Open-Source Project. Journal of Machine Learning Research," vol. 11, pp. 2533-2541, 2010.

"https://www.indexmundi.com/g/g.aspx?c=ni&v=81," 2018. [Online]. Available: https://www.indexmundi.com/g/g.aspx?c=ni&v=81. [Accessed 2018].

R. Samsudin, A. Shabri and P. Saad, "A Comparisonof Time Series Forecasting using Support Vector Machine and Artificial Neural Network Model," Journal of Applied Sciences, vol. 10, no. 11, pp. 950-958, 2010.

Published
2019-09-09
How to Cite
FOLORUNSO, S. O. (2019). Empirical Comparison of Time Series Data Mining Algorithms. Indonesia Journal on Computing (Indo-JC), 4(2), 109-118. https://doi.org/10.34818/INDOJC.2019.4.2.329
Section
Computational and Simulation