Proposal of self and semi-supervised learning for imbalanced classification of coronary heart disease tabular data

 

Bewaard in:
Bibliografische gegevens
Auteurs: Xie-Li, Danny, González-Hernández, Manfred
Formaat: artículo original
Status:Versión publicada
Publicatiedatum:2024
Omschrijving:Triple Mixup is an augmentation policy in the hidden latent space we introduced in the Contrastive Mixup Self-Semi Supervised learning framework, to address the imbalanced data problem, for Cardiovascular Heart Diseases tabular dataset. Medical tabular datasets are known to present challenges as high imbalanced class, limited annotated quality samples due to the domain nature. Recent literature in Self and Semi supervised learning, has shown tremendous progress in learning useful representations, and leveraging unlabeled dataset and labeled dataset to train a learning model. Most existing methods are not feasible for tabular data due to the data augmentation scheme. In addition, the high imbalanced problem can show lower performance on machine learning algorithms. For this work, we propose the triple data augmentation method in hidden space to attack the unbalanced challenge in self-supervised and semi-supervised learning, from the possible applications of Contrastive Mixup, thus we will study the influence of it.
Land:Portal de Revistas TEC
Instelling:Instituto Tecnológico de Costa Rica
Repositorio:Portal de Revistas TEC
Taal:Inglés
OAI Identifier:oai:ojs.pkp.sfu.ca:article/7295
Online toegang:https://revistas.tec.ac.cr/index.php/tec_marcha/article/view/7295
Keyword:Self-supervised learning
semi-supervised learning
data augmentation
contrastive learning
imbalanced
medical datasets
Aprendizaje Autosupervisado
Aprendizaje Semisupervisado
Aumentación de datos
prendizaje por Contraste
Desbalance de datos
datos médicos