Rubén Pérez Chacón

Citations and socials

Prof. Rubén Pérez Chacón, Ph.D. holds a Master’s Degree in Information Security (Autonomous University of Barcelona, 2014) and completed his Ph.D. in Computer Science (Pablo de Olavide University, 2021).

Currently, he leads an IT team in the public sector with over 10 years of experience, and he balances this role with his postdoctoral research in the Data Science and Big Data Lab at Pablo de Olavide University, where he also serves as an Assistant Professor in the area of Languages and Information Systems. His research focuses on data mining and machine learning, particularly in the prediction of time series of large volumes of data.

Publications

2024

R. Pérez-Chacón and G. Asencio-Cortés and A. Troncoso and F. Martínez-Álvarez

Pattern sequence-based algorithm for multivariate big data time series forecasting: Application to electricity consumption Journal Article

In: Future Generation Computer Systems, vol. 154, pp. 397-412, 2024.

Abstract | Links | BibTeX

@article{PEREZ24,

title = {Pattern sequence-based algorithm for multivariate big data time series forecasting: Application to electricity consumption},

author = {R. Pérez-Chacón and G. Asencio-Cortés and A. Troncoso and F. Martínez-Álvarez},

url = {https://www.sciencedirect.com/science/article/pii/S0167739X23004752},

doi = {https://doi.org/10.1016/j.future.2023.12.021},

year  = {2024},

date = {2024-01-29},

journal = {Future Generation Computer Systems},

volume = {154},

pages = {397-412},

abstract = {Several interrelated variables typically characterize real-world processes, and a time series cannot be predicted without considering the influence that other time series might have on the target time series. This work proposes a novel algorithm to forecast multivariate big data time series. This new general-purpose approach consists first of a previous pattern recognition performed jointly using all time series that form the multivariate time series and then predicts the target time series by searching for similarities between pattern sequences. The proposed algorithm is designed to tackle multivariate time series forecasting problems within the context of big data. In particular, the algorithm has been developed with a distributed nature to enhance its efficiency in analyzing and processing large volumes of data. Moreover, the algorithm is straightforward to use, with only two parameters needing adjustment. Another advantage of the MV-bigPSF algorithm is its ability to perform multi-step forecasting, which is particularly useful in many practical applications. To evaluate the algorithm’s performance, real-world data from Uruguay’s power consumption has been utilized. Specifically, MV-bigPSF has been compared with both univariate and multivariate methods. Regarding the univariate ones, MV-bigPSF improved 12.8% in MAPE compared to the second-best method. Regarding the multivariate comparison, MV-bigPSF improved 44.8% in MAPE with respect to the second most accurate method. Regarding efficiency, the execution time of MV-bigPSF was 1.83 times faster than the second-fastest multivariate method, both in a single-core environment. Therefore, the proposed algorithm can be a valuable tool for practitioners and researchers working in multivariate time series forecasting, particularly in big data applications.},

keywords = {},

pubstate = {published},

tppubtype = {article}

}

2020

F. Martínez-Álvarez and G. Asencio-Cortés and J. F. Torres and D. Gutiérrez-Avilés and L. Melgar-García and R. Pérez-Chacón and C. Rubio-Escudero and A. Troncoso and J. C. Riquelme

Coronavirus Optimization Algorithm: A bioinspired metaheuristic based on the COVID-19 propagation model Journal Article

In: Big Data, vol. 8, no. 4, pp. 308-322, 2020.

Abstract | Links | BibTeX

@article{MARTINEZ-ALVAREZ20,

title = {Coronavirus Optimization Algorithm: A bioinspired metaheuristic based on the COVID-19 propagation model},

author = {F. Martínez-Álvarez and G. Asencio-Cortés and J. F. Torres and D. Gutiérrez-Avilés and L. Melgar-García and R. Pérez-Chacón and C. Rubio-Escudero and A. Troncoso and J. C. Riquelme},

url = {https://www.liebertpub.com/doi/full/10.1089/big.2020.0051},

doi = {10.1089/big.2020.0051},

year  = {2020},

date = {2020-07-22},

journal = {Big Data},

volume = {8},

number = {4},

pages = {308-322},

abstract = {This work proposes a novel bioinspired metaheuristic, simulating how the coronavirus spreads and infects healthy people. From a primary infected individual (patient zero), the coronavirus rapidly infects new victims, creating large populations of infected people who will either die or spread infection. Relevant terms such as reinfection probability, super-spreading rate, social distancing measures or traveling rate are introduced into the model in order to simulate the coronavirus activity as accurately as possible. The infected population initially grows exponentially over time, but taking into consideration social isolation measures, the mortality rate and number of recoveries, the infected population gradually decreases. The Coronavirus Optimization Algorithm has two major advantages when compared to other similar strategies. Firstly, the input parameters are already set according to the disease statistics, preventing researchers from initializing them with arbitrary values. Secondly, the approach has the ability to end after several iterations, without setting this value either. Furthermore, a parallel multi-virus version is proposed, where several coronavirus strains evolve over time and explore wider search space areas in less iterations. Finally, the metaheuristic has been combined with deep learning models, in order to find optimal hyperparameters during the training phase. As application case, the problem of electricity load time series forecasting has been addressed, showing quite remarkable performance.},

keywords = {},

pubstate = {published},

tppubtype = {article}

}

R. Pérez-Chacón and G. Asencio-Cortés and F. Martínez-Álvarez and A. Troncoso

Big data time series forecasting based on pattern sequence similarity and its application to the electricity demand Journal Article

In: Information Sciences, vol. 540, pp. 160-174, 2020.

Abstract | Links | BibTeX

2019

R. Talavera-Llames and R. Pérez-Chacón and A. Troncoso and F. Martínez-Álvarez

MV-kWNN: A novel multivariate and multi-output weighted nearest neighbors algorithm for big data time series forecasting Journal Article

In: Neurocomputing, vol. 353, pp. 56-73, 2019.

Abstract | Links | BibTeX

2018

R. Talavera-Llames and R. Pérez-Chacón and A. Troncoso and F. Martínez-Álvarez

Big data time series forecasting based on nearest neighbors distributed computing with Spark Journal Article

In: Knowledge-Based Systems, vol. 161, no. 1, pp. 12-25, 2018.

Abstract | Links | BibTeX

R. Pérez-Chacón and J. M. Luna and A. Troncoso and F. Martínez-Álvarez and J. C. Riquelme

Big data analytics for discovering electricity consumption patterns in smart cities Journal Article

In: Energies, vol. 11, no. 3, pp. 683, 2018.

Abstract | Links | BibTeX

2016

R. Talavera-Llames and R. Pérez-Chacón and M. Martínez-Ballesteros and A. Troncoso and F. Martínez-Álvarez

A Nearest Neighbours - Based Algorithm for Big Time Series Data Forecasting Conference

HAIS 11th International Conference on Hybrid Artificial Intelligence Systems, Lecture Note in Computer Science 2016.

Links | BibTeX

R. Pérez-Chacón and R. Talavera-Llames and F. Martínez-Álvarez and A. Troncoso

Finding Electric Energy Consumption Patterns in Big Time Series Data Conference

DCAI 13th International Conference on Distributed Computing and Artificial Intelligence, Advances in Intelligent Systems and Computing 2016.

Links | BibTeX

Profile details

Citations and socials

Publications

2024

2020

2019

2018

2016