Comparative analysis of traditional methods and a deep learning approach for multivariate imputation of missing values in the meteorological field

 

Guardado en:
Detalles Bibliográficos
Autores: Arias-Muñoz, Ana Cristina, Cob-García, Susana, Calvo-Valverde, Luis Alexander
Formato: artículo original
Estado:Versión publicada
Fecha de Publicación:2024
Descripción:Climate observations are the groundwork for several real-world applications such as weather forecasting, climate change monitoring and environmental impact assessments. However, the data is mostly measured and recorded by external devices exposed to numerous variables, causatives of malfunctions and, therefore, missing values. Nowadays, data imputation in the time series field has been researched in depth and a wide variety of methods have been proposed, where traditional classification and regression algorithms predominate, even though there are also deep learning approaches that manage to capture temporal relationships between observations. In this article, a comparative analysis between a classification imputation algorithm, a regression imputation algorithm, and a deep learning imputation model is made: MissForest algorithm, based on random trees; Expectation Maximization with Bootstrap (EMB), the maximum likelihood estimation algorithm; and a proposed deep learning model, based on the Long-Short Term Memory (LSTM) architecture. Data from the Costa Rica meteorological field were used, which consist of multivariate data coming from several weather stations in the same geographical area.
País:Portal de Revistas TEC
Institución:Instituto Tecnológico de Costa Rica
Repositorio:Portal de Revistas TEC
Lenguaje:Inglés
Español
OAI Identifier:oai:ojs.pkp.sfu.ca:article/6746
Acceso en línea:https://revistas.tec.ac.cr/index.php/tec_marcha/article/view/6746
Palabra clave:data imputation
EMB
MissForest
LSTM
time series
imputación de datos
series de tiempo