Evaluation of the application of sequence data to the identification of outbreaks of disease using anomaly detection methods

dc.contributor.affiliationUniversidade de Santiago de Compostela. Departamento de Patoloxía Animal
dc.contributor.authorLiu, Xin
dc.contributor.authorKim, Jeonghoon
dc.contributor.authorClavijo, María José
dc.contributor.authorMartínez-López, Beatriz
dc.contributor.authorDíaz Cao, José Manuel
dc.date.accessioned2025-01-28T09:09:07Z
dc.date.available2025-01-28T09:09:07Z
dc.date.issued2023-09-08
dc.description.abstractAnomaly detection methods have a great potential to assist the detection of diseases in animal production systems. We used sequence data of Porcine Reproductive and Respiratory Syndrome (PRRS) to define the emergence of new strains at the farm level. We evaluated the performance of anomaly detection methods based on machine learning, regression, time series techniques, and control charts to identify outbreaks in time series of new strains and compared the best methods using different time series: PCR positives, PCR requests, and laboratory requests. We introduced synthetic outbreaks of different sizes and calculated the probability of detection of outbreaks (POD), sensitivity (Se), probability of detection of outbreaks in the first week of appearance (POD1w), and background alarm rate (BAR). The use of time series of new strains from sequence data outperformed the other types of data, but POD, Se, and POD1w were only high when outbreaks were large. The methods based on Long Short-Term Memory (LSTM) and Bayesian approaches presented the best performance. Using anomaly detection methods with sequence data may help to identify the emergence of cases in multiple farms, but more work is required to improve detection with time series of high variability. Our results suggest a promising application of sequence data for early detection of diseases at a production system level. This may provide a simple way to extract additional value from routine laboratory analysis. Next steps should include validation of this approach in different settings and with different diseases.
dc.description.peerreviewedSI
dc.description.sponsorshipNSF BigData AI award; 1838207
dc.description.sponsorshipNSF Convergence Accelerator Track-D award; 2134901
dc.description.sponsorshipUSDA Awards; 2019-67015-28981
dc.description.sponsorshipUSDA Awards; 2021-68014-34143
dc.description.sponsorshipXunta de Galicia; 2019-HG005
dc.identifier.citationDíaz-Cao, J. M., Liu, X., Kim, J., Clavijo, M. J., & Martínez-López, B. (2023). Evaluation of the application of sequence data to the identification of outbreaks of disease using anomaly detection methods. Veterinary research, 54(1), 75. https://doi.org/10.1186/s13567-023-01197-3
dc.identifier.doi10.1186/s13567-023-01197-3
dc.identifier.issn0928-4249
dc.identifier.urihttps://hdl.handle.net/10347/39111
dc.journal.titleVeterinary Research
dc.language.isoeng
dc.publisherSpringer Nature
dc.relation.publisherversionhttps://veterinaryresearch.biomedcentral.com/articles/10.1186/s13567-023-01197-3
dc.rightsAttribution-NonCommercial-NoDerivatives 4.0 Internationalen
dc.rights.accessRightsopen access
dc.rights.urihttp://creativecommons.org/licenses/by-nc-nd/4.0/
dc.subject.classification310907 Patología
dc.titleEvaluation of the application of sequence data to the identification of outbreaks of disease using anomaly detection methods
dc.typejournal article
dc.type.hasVersionVoR
dc.volume.number54
dspace.entity.typePublication
relation.isAuthorOfPublicationed7cf73d-46cc-442f-b16c-6180071c164b
relation.isAuthorOfPublication.latestForDiscoveryed7cf73d-46cc-442f-b16c-6180071c164b

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
2023_Evaluation of the application of sequence data.pdf
Size:
2.98 MB
Format:
Adobe Portable Document Format