Comparative analysis of traditional methods and a deep learning approach for multivariate imputation of missing values in the meteorological field

 

Αποθηκεύτηκε σε:
Λεπτομέρειες βιβλιογραφικής εγγραφής
Συγγραφείς: Arias-Muñoz, Ana Cristina, Cob-García, Susana, Calvo-Valverde, Luis Alexander
Μορφή: artículo original
Κατάσταση:Versión publicada
Ημερομηνία έκδοσης:2024
Περιγραφή:Climate observations are the groundwork for several real-world applications such as weather forecasting, climate change monitoring and environmental impact assessments. However, the data is mostly measured and recorded by external devices exposed to numerous variables, causatives of malfunctions and, therefore, missing values. Nowadays, data imputation in the time series field has been researched in depth and a wide variety of methods have been proposed, where traditional classification and regression algorithms predominate, even though there are also deep learning approaches that manage to capture temporal relationships between observations. In this article, a comparative analysis between a classification imputation algorithm, a regression imputation algorithm, and a deep learning imputation model is made: MissForest algorithm, based on random trees; Expectation Maximization with Bootstrap (EMB), the maximum likelihood estimation algorithm; and a proposed deep learning model, based on the Long-Short Term Memory (LSTM) architecture. Data from the Costa Rica meteorological field were used, which consist of multivariate data coming from several weather stations in the same geographical area.
Χώρα:Portal de Revistas TEC
Ίδρυμα:Instituto Tecnológico de Costa Rica
Repositorio:Portal de Revistas TEC
Γλώσσα:Inglés
Español
OAI Identifier:oai:ojs.pkp.sfu.ca:article/6746
Διαθέσιμο Online:https://revistas.tec.ac.cr/index.php/tec_marcha/article/view/6746
Λέξη-Κλειδί :data imputation
EMB
MissForest
LSTM
time series
imputación de datos
series de tiempo