Evaluation of the application of sequence data to the identification of outbreaks of disease using anomaly detection methods
| dc.contributor.affiliation | Universidade de Santiago de Compostela. Departamento de Patoloxía Animal | |
| dc.contributor.author | Liu, Xin | |
| dc.contributor.author | Kim, Jeonghoon | |
| dc.contributor.author | Clavijo, María José | |
| dc.contributor.author | Martínez-López, Beatriz | |
| dc.contributor.author | Díaz Cao, José Manuel | |
| dc.date.accessioned | 2025-01-28T09:09:07Z | |
| dc.date.available | 2025-01-28T09:09:07Z | |
| dc.date.issued | 2023-09-08 | |
| dc.description.abstract | Anomaly detection methods have a great potential to assist the detection of diseases in animal production systems. We used sequence data of Porcine Reproductive and Respiratory Syndrome (PRRS) to define the emergence of new strains at the farm level. We evaluated the performance of anomaly detection methods based on machine learning, regression, time series techniques, and control charts to identify outbreaks in time series of new strains and compared the best methods using different time series: PCR positives, PCR requests, and laboratory requests. We introduced synthetic outbreaks of different sizes and calculated the probability of detection of outbreaks (POD), sensitivity (Se), probability of detection of outbreaks in the first week of appearance (POD1w), and background alarm rate (BAR). The use of time series of new strains from sequence data outperformed the other types of data, but POD, Se, and POD1w were only high when outbreaks were large. The methods based on Long Short-Term Memory (LSTM) and Bayesian approaches presented the best performance. Using anomaly detection methods with sequence data may help to identify the emergence of cases in multiple farms, but more work is required to improve detection with time series of high variability. Our results suggest a promising application of sequence data for early detection of diseases at a production system level. This may provide a simple way to extract additional value from routine laboratory analysis. Next steps should include validation of this approach in different settings and with different diseases. | |
| dc.description.peerreviewed | SI | |
| dc.description.sponsorship | NSF BigData AI award; 1838207 | |
| dc.description.sponsorship | NSF Convergence Accelerator Track-D award; 2134901 | |
| dc.description.sponsorship | USDA Awards; 2019-67015-28981 | |
| dc.description.sponsorship | USDA Awards; 2021-68014-34143 | |
| dc.description.sponsorship | Xunta de Galicia; 2019-HG005 | |
| dc.identifier.citation | Díaz-Cao, J. M., Liu, X., Kim, J., Clavijo, M. J., & Martínez-López, B. (2023). Evaluation of the application of sequence data to the identification of outbreaks of disease using anomaly detection methods. Veterinary research, 54(1), 75. https://doi.org/10.1186/s13567-023-01197-3 | |
| dc.identifier.doi | 10.1186/s13567-023-01197-3 | |
| dc.identifier.issn | 0928-4249 | |
| dc.identifier.uri | https://hdl.handle.net/10347/39111 | |
| dc.journal.title | Veterinary Research | |
| dc.language.iso | eng | |
| dc.publisher | Springer Nature | |
| dc.relation.publisherversion | https://veterinaryresearch.biomedcentral.com/articles/10.1186/s13567-023-01197-3 | |
| dc.rights | Attribution-NonCommercial-NoDerivatives 4.0 International | en |
| dc.rights.accessRights | open access | |
| dc.rights.uri | http://creativecommons.org/licenses/by-nc-nd/4.0/ | |
| dc.subject.classification | 310907 Patología | |
| dc.title | Evaluation of the application of sequence data to the identification of outbreaks of disease using anomaly detection methods | |
| dc.type | journal article | |
| dc.type.hasVersion | VoR | |
| dc.volume.number | 54 | |
| dspace.entity.type | Publication | |
| relation.isAuthorOfPublication | ed7cf73d-46cc-442f-b16c-6180071c164b | |
| relation.isAuthorOfPublication.latestForDiscovery | ed7cf73d-46cc-442f-b16c-6180071c164b |
Files
Original bundle
1 - 1 of 1
Loading...
- Name:
- 2023_Evaluation of the application of sequence data.pdf
- Size:
- 2.98 MB
- Format:
- Adobe Portable Document Format