Multidimensional Machine Learning Model to Calculate a COVID-19 Vulnerability Index

Perez, Paula Andrea Rosero and Gonzalez, Juan Sebastian Realpe and Salazar-Cabrera, Ricardo and Restrepo, David and Lopez, Diego M. and Blobel, Bernd (2023) Multidimensional Machine Learning Model to Calculate a COVID-19 Vulnerability Index. JOURNAL OF PERSONALIZED MEDICINE, 13 (7): 1141. ISSN , 2075-4426

Full text not available from this repository. (Request a copy)

Abstract

In Colombia, the first case of COVID-19 was confirmed on 6 March 2020. On 13 March 2023, Colombia registered 6,360,780 confirmed positive cases of COVID-19, representing 12.18% of the total population. The National Administrative Department of Statistics (DANE) in Colombia published in 2020 a COVID-19 vulnerability index, which estimates the vulnerability (per city block) of being infected with COVID-19. Unfortunately, DANE did not consider multiple factors that could increase the risk of COVID-19 (in addition to demographic and health), such as environmental and mobility data (found in the related literature). The proposed multidimensional index considers variables of different types (unemployment rate, gross domestic product, citizens' mobility, vaccination data, and climatological and spatial information) in which the incidence of COVID-19 is calculated and compared with the incidence of the COVID-19 vulnerability index provided by DANE. The collection, data preparation, modeling, and evaluation phases of the Cross-Industry Standard Process for Data Mining methodology (CRISP-DM) were considered for constructing the index. The multidimensional index was evaluated using multiple machine learning models to calculate the incidence of COVID-19 cases in the main cities of Colombia. The results showed that the best-performing model to predict the incidence of COVID-19 in Colombia is the Extra Trees Regressor algorithm, obtaining an R-squared of 0.829. This work is the first step toward a multidimensional analysis of COVID-19 risk factors, which has the potential to support decision making in public health programs. The results are also relevant for calculating vulnerability indexes for other viral diseases, such as dengue.

Item Type: Article
Uncontrolled Keywords: ; COVID-19; dataset; machine learning; vulnerability index
Subjects: 600 Technology > 610 Medical sciences Medicine
Divisions: Medicine > Zentren des Universitätsklinikums Regensburg > eHealth Competence Center
Depositing User: Dr. Gernot Deinzer
Date Deposited: 13 Mar 2024 10:07
Last Modified: 13 Mar 2024 10:07
URI: https://pred.uni-regensburg.de/id/eprint/59901

Actions (login required)

View Item View Item