Machine Learning Potential for Identifying and Forecasting Complex Environmental Drivers of Vibrio vulnificus Infections in the United States

Loading...
Thumbnail Image
Identifiers

Publication date

Advisors

Tutors

Editors

Journal Title

Journal ISSN

Volume Title

Publisher

Public Health Services, US Dept of Health and Human Services
Metrics
Google Scholar
lacobus
Export

Research Projects

Organizational Units

Journal Issue

Abstract

Background: Environmental change in coastal areas can drive marine bacteria and resulting infections, such as those caused by Vibrio vulnificus, with both foodborne and nonfoodborne exposure routes and high mortality. Although ecological drivers of V. vulnificus in the environment have been well-characterized, fewer models have been able to apply this to human infection risk due to limited surveillance. Objectives: The Cholera and Other Vibrio Illness Surveillance (COVIS) system database has reported V. vulnificus infections in the United States since 1988, offering a unique opportunity to both explore the forecasting capabilities machine learning could provide and to characterize complex environmental drivers of V. vulnificus infections. Methods: Machine learning models, in the form of random forest classification models, were trained and refined using the epidemiological data from 2008 to 2018, six environmental variables (sea surface temperature, salinity, chlorophyll a concentration, sea level, land surface temperature, and runoff rate) and categorical encoders to assess our predictive potential to forecast V. vulnificus infections based on environmental data. Results: The highest-performing model, which used balanced classes, had an Area Under the Curve score of 0.984 and a sensitivity of 0.971, highlighting the potential of machine learning to anticipate areas and periods of V. vulnificus risk. A higher false positive rate was found when the model was applied to real-world imbalanced surveillance data, which is pertinent amid modeled underreporting and misdiagnosis ratios of V. vulnificus infections. Further models were also developed to explore multilevel spatial resolution, finding state-specific models can improve specificity and early warning system potential by exclusively using lagged environmental data. Discussion: The machine learning approach was able to characterize nonlinear and interacting environmental associations driving V. vulnificus infections. This study accentuates the potential of machine learning and robust surveillance for forecasting environmentally associated marine infections, providing future directions for improvements, further application, and operationalization. https://doi.org/10.1289/EHP15593

Description

Keywords

Bibliographic citation

Campbell, A. M., Cabrera-Gumbau, J. M., Trinanes, J., Baker-Austin, C., & Martinez-Urtaza, J. (2025). Machine learning potential for identifying and forecasting complex environmental drivers of Vibrio vulnificus infections in the United States. Environmental Health Perspectives, 133(1), 017006.

Relation

Has part

Has version

Is based on

Is part of

Is referenced by

Is version of

Requires

Sponsors

A.M.C. was supported by the Natural Environment Research Council (grant number NE/S007210/1) and Centre for Environment, Fisheries and Aquaculture Science (CEFAS) internal Seedcorn funding. J.M.U. was funded by Grants PID2021-127107NB-I00 from the Ministerio de Ciencia e Innovación (Spain), 2021 SGR 00526 from Generalitat de Catalunya and European Union’s Horizon Europe research and innovation program under Grant Agreement No. 101057554 (IDAlert).

Rights

EHP is an open-access journal published with support from the National Institute of Environmental Health Sciences, National Institutes of Health. All content is public domain unless otherwise noted.