Machine Learning Potential for Identifying and Forecasting Complex Environmental Drivers of Vibrio vulnificus Infections in the United States

dc.contributor.affiliationUniversidade de Santiago de Compostela. Departamento de Electrónica e Computación
dc.contributor.authorCampbell, Amy Marie
dc.contributor.authorCabrera-Gumabu, Jordi Manuel
dc.contributor.authorTriñanes Fernández, Joaquín Ángel
dc.contributor.authorBaker-Austin, Craig
dc.contributor.authorMartínez-Urtaza, Jaime
dc.date.accessioned2026-01-16T10:38:17Z
dc.date.available2026-01-16T10:38:17Z
dc.date.issued2025-01-23
dc.description.abstractBackground: Environmental change in coastal areas can drive marine bacteria and resulting infections, such as those caused by Vibrio vulnificus, with both foodborne and nonfoodborne exposure routes and high mortality. Although ecological drivers of V. vulnificus in the environment have been well-characterized, fewer models have been able to apply this to human infection risk due to limited surveillance. Objectives: The Cholera and Other Vibrio Illness Surveillance (COVIS) system database has reported V. vulnificus infections in the United States since 1988, offering a unique opportunity to both explore the forecasting capabilities machine learning could provide and to characterize complex environmental drivers of V. vulnificus infections. Methods: Machine learning models, in the form of random forest classification models, were trained and refined using the epidemiological data from 2008 to 2018, six environmental variables (sea surface temperature, salinity, chlorophyll a concentration, sea level, land surface temperature, and runoff rate) and categorical encoders to assess our predictive potential to forecast V. vulnificus infections based on environmental data. Results: The highest-performing model, which used balanced classes, had an Area Under the Curve score of 0.984 and a sensitivity of 0.971, highlighting the potential of machine learning to anticipate areas and periods of V. vulnificus risk. A higher false positive rate was found when the model was applied to real-world imbalanced surveillance data, which is pertinent amid modeled underreporting and misdiagnosis ratios of V. vulnificus infections. Further models were also developed to explore multilevel spatial resolution, finding state-specific models can improve specificity and early warning system potential by exclusively using lagged environmental data. Discussion: The machine learning approach was able to characterize nonlinear and interacting environmental associations driving V. vulnificus infections. This study accentuates the potential of machine learning and robust surveillance for forecasting environmentally associated marine infections, providing future directions for improvements, further application, and operationalization. https://doi.org/10.1289/EHP15593
dc.description.peerreviewedSI
dc.description.sponsorshipA.M.C. was supported by the Natural Environment Research Council (grant number NE/S007210/1) and Centre for Environment, Fisheries and Aquaculture Science (CEFAS) internal Seedcorn funding. J.M.U. was funded by Grants PID2021-127107NB-I00 from the Ministerio de Ciencia e Innovación (Spain), 2021 SGR 00526 from Generalitat de Catalunya and European Union’s Horizon Europe research and innovation program under Grant Agreement No. 101057554 (IDAlert).
dc.identifier.citationCampbell, A. M., Cabrera-Gumbau, J. M., Trinanes, J., Baker-Austin, C., & Martinez-Urtaza, J. (2025). Machine learning potential for identifying and forecasting complex environmental drivers of Vibrio vulnificus infections in the United States. Environmental Health Perspectives, 133(1), 017006.
dc.identifier.doi10.1289/EHP15593
dc.identifier.urihttps://hdl.handle.net/10347/45222
dc.issue.number1
dc.journal.titleEnvironmental Health Perspectives
dc.language.isoeng
dc.publisherPublic Health Services, US Dept of Health and Human Services
dc.relation.projectIDinfo:eu-repo/grantAgreement/AEI/Plan Estatal de Investigación Científica y Técnica y de Innovación 2021-2023/PID2021-127107NB-I00/ES/GENOMICA DE POBLACIONES DE LA ADAPTACION
dc.relation.projectIDinfo:eu-repo/grantAgreement/EC/HE/101057554
dc.relation.publisherversionhttps://pmc.ncbi.nlm.nih.gov/articles/PMC11756857/
dc.rightsEHP is an open-access journal published with support from the National Institute of Environmental Health Sciences, National Institutes of Health. All content is public domain unless otherwise noted.
dc.rights.accessRightsopen access
dc.titleMachine Learning Potential for Identifying and Forecasting Complex Environmental Drivers of Vibrio vulnificus Infections in the United States
dc.typejournal article
dc.type.hasVersionVoR
dc.volume.number133
dspace.entity.typePublication
relation.isAuthorOfPublicationf3ae8b69-c8bc-44a7-a47c-2061d38f1d89
relation.isAuthorOfPublication.latestForDiscoveryf3ae8b69-c8bc-44a7-a47c-2061d38f1d89

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
envheaper_a2025m1v133n1p17006.pdf
Size:
18.54 MB
Format:
Adobe Portable Document Format