Evaluation of the application of sequence data to the identification of outbreaks of disease using anomaly detection methods

Loading...
Thumbnail Image
Identifiers

Publication date

Advisors

Tutors

Editors

Journal Title

Journal ISSN

Volume Title

Publisher

Springer Nature
Metrics
Google Scholar
lacobus
Export

Research Projects

Organizational Units

Journal Issue

Abstract

Anomaly detection methods have a great potential to assist the detection of diseases in animal production systems. We used sequence data of Porcine Reproductive and Respiratory Syndrome (PRRS) to define the emergence of new strains at the farm level. We evaluated the performance of anomaly detection methods based on machine learning, regression, time series techniques, and control charts to identify outbreaks in time series of new strains and compared the best methods using different time series: PCR positives, PCR requests, and laboratory requests. We introduced synthetic outbreaks of different sizes and calculated the probability of detection of outbreaks (POD), sensitivity (Se), probability of detection of outbreaks in the first week of appearance (POD1w), and background alarm rate (BAR). The use of time series of new strains from sequence data outperformed the other types of data, but POD, Se, and POD1w were only high when outbreaks were large. The methods based on Long Short-Term Memory (LSTM) and Bayesian approaches presented the best performance. Using anomaly detection methods with sequence data may help to identify the emergence of cases in multiple farms, but more work is required to improve detection with time series of high variability. Our results suggest a promising application of sequence data for early detection of diseases at a production system level. This may provide a simple way to extract additional value from routine laboratory analysis. Next steps should include validation of this approach in different settings and with different diseases.

Description

Keywords

Bibliographic citation

Díaz-Cao, J. M., Liu, X., Kim, J., Clavijo, M. J., & Martínez-López, B. (2023). Evaluation of the application of sequence data to the identification of outbreaks of disease using anomaly detection methods. Veterinary research, 54(1), 75. https://doi.org/10.1186/s13567-023-01197-3

Relation

Has part

Has version

Is based on

Is part of

Is referenced by

Is version of

Requires

Sponsors

NSF BigData AI award; 1838207
NSF Convergence Accelerator Track-D award; 2134901
USDA Awards; 2019-67015-28981
USDA Awards; 2021-68014-34143
Xunta de Galicia; 2019-HG005

Rights

Attribution-NonCommercial-NoDerivatives 4.0 International