AutoML approaches to the identification of novel biomarkers associated with thalassemia

 

Guardado en:
Bibliografiske detaljer
Autores: Mora Jiménez, Luis Diego, Guevara Coto, Jose, Berrocal Rojas, Allan
Format: comunicación de congreso
Fecha de Publicación:2023
Beskrivelse:Thalassemias are a group of genetic blood disorders in which abnormal hemoglobin production occurs. Currently, there are obstacles in its diagnostic methods and approaches. In addition, its treatment represents a significantly high cost. This work proposes the use of machine learning techniques, and prior knowledge of known genes associated with thalassemia, to find novel biomarkers associated with the disease. This may eventually help in detection efforts. Also, we propose to evaluate automated machine learning (AutoML) approaches as an alternative to using traditional algorithms. The AutoML tools we decided to use were Auto-Sklearn and Tree-based Pipeline Optimization Tool (TPOT). In this way, we synthesize the experience of using these tools and compared their performance against a Support Vector Machine based Model. This was done through a comparison of performance metrics. Finally, we found that TPOT offers certain ease of use, such as the option to export the best pipeline found, as well as an improvement in performance compared to other methods. This opens the possibility to test new configurations of the tool, as well as other AutoML tools.
País:Kérwá
Institution:Universidad de Costa Rica
Repositorio:Kérwá
Sprog:Inglés
OAI Identifier:oai:kerwa.ucr.ac.cr:10669/104578
Online adgang:https://hdl.handle.net/10669/104578
Palabra clave:Thalassemia
Machine Learning
Biomarkers
Automl
Genetic Expression
Artificial intelligence
Preventive medicine
Automation