Diversity and random forest models of oral microbiomes in periodontal health using publicly available data

dc.contributor.affiliationUniversidade de Santiago de Compostela. Departamento de Estomatoloxía
dc.contributor.authorRegueira Iglesias, Alba
dc.contributor.authorSuárez Rodríguez, Berta
dc.contributor.authorBlanco Pintos, Triana
dc.contributor.authorSánchez Barco, Alba
dc.contributor.authorRelvas, Marta Mendonça Moutinho
dc.contributor.authorBalsa Castro, Carlos
dc.contributor.authorTomás Carmona, Inmaculada
dc.date.accessioned2025-11-10T08:40:27Z
dc.date.available2025-11-10T08:40:27Z
dc.date.issued2025-08-21
dc.description.abstractBackground Evidence on the 16S metabarcoding of supragingival, subgingival, and salivary microbiomes in periodontal health remains limited. We aimed to analyze the diversity and potential of machine-learning models of supragingival, subgingival, and salivary microbiomes in periodontal health. Methods A total of 848 samples (supragingival = 210; subgingival = 155; saliva = 483) from 491 periodontally healthy subjects were included. Publicly available Illumina sequences were processed with mothur, and taxonomy was assigned using an oral-specific database. Random forest (RF) models were built on the training set (2/3 of the samples) using a 3-fold cross-validation. They were tested on the test set (1/3). Results A total of 121 amplicon sequence variants (ASVs) presented with differential abundances between the two types of plaque, 212 between the supragingival and saliva samples, and 160 between the subgingival and saliva (p < 0.01). Furthermore, the supragingival versus subgingival model consisted of five ASVs. The performance parameters on the test set were area under the curve (AUC) = 0.908, accuracy (ACC) = 84.30%, sensitivity = 95.71%, and specificity = 68.63%. Both the supragingival and subgingival versus saliva models also had five ASVs. These two models revealed similar performance (AUC = 0.992 and 0.986, ACC > 95%, sensitivity > 90%, specificity > 95%). Conclusion Although supragingival and subgingival bacterial profiles diverged only modestly, primarily due to taxa with small effect sizes, they were both compositionally distinct from the salivary microbiome. RF models accurately classified samples by niche, with higher performance in distinguishing saliva from plaques. Specific ASVs from Escherichia, Fusobacterium, Granulicatella, Treponema, Peptostreptococcaceae [XI][G-9], and Prevotella were identified in subgingival plaque, while Oribacterium and Solobacterium were identified in saliva, indicating potential niche-specific microbial signatures in periodontal health. Plain Language Summary Mapping oral microbes in relation to periodontal health is essential for microbiome-based diagnostics and the development of new preventive/therapeutic strategies. Our two-by-two predictive models demonstrated that a small set of bacterial ASVs can accurately classify periodontally healthy samples according to their oral niche. Notably, models distinguishing saliva from dental plaques achieved superior performance compared to those discriminating between plaques. This likely reflects the greater resemblance in dominant microbial taxa between the two plaque niches. These findings underscore the potential of machine-learning approaches to identify key microbial signatures and highlight the predictive ASVs as promising biomarkers for characterizing oral niches in periodontal health.
dc.description.peerreviewedSI
dc.description.sponsorshipThis study has been funded by the Instituto de Salud Calos III (ISCIII) through project PI24/00222 and co-fundedby the European Union (EU). The funders had no role inthe study’s design, data collection and analysis, publicationchoice, or manuscript preparation.
dc.identifier.citationRegueira-Iglesias A,Suárez-Rodríguez B, Blanco-Pintos T, et al.Diversity and random forest models of oralmicrobiomes in periodontal health using publiclyavailable data. J Periodontol. 2025;1-14.https://doi.org/10.1002/jper.70000
dc.identifier.doi10.1002/jper.70000
dc.identifier.issn0022-3492
dc.identifier.urihttps://hdl.handle.net/10347/43633
dc.journal.titleJournal of Periodontology
dc.language.isoeng
dc.page.final14
dc.page.initial1
dc.publisherWiley
dc.relation.publisherversionhttps://doi.org/10.1002/jper.70000
dc.rights© 2025 The Author(s). Journal of Periodontology published by Wiley Periodicals LLC on behalf of American Academy of Periodontology. This is an open access article under the terms of the Creative Commons Attribution-NonCommercial-NoDerivs License, which permits use and distribution in any medium,provided the original work is properly cited, the use is non-commercial and no modifications or adaptations are made.© 2025 The Author(s). Journal of Periodontology published by Wiley Periodicals LLC on behalf of American Academy of Periodontology.
dc.rightsAttribution-NonCommercial-NoDerivatives 4.0 International
dc.rights.accessRightsopen access
dc.rights.urihttp://creativecommons.org/licenses/by-nc-nd/4.0/
dc.subject16S
dc.subjectrRNA gene
dc.subjectDental plaque
dc.subjectMachine learning
dc.subjectMicrobiome
dc.subjectPeriodontal health
dc.subjectSaliva
dc.subjectSequencing
dc.titleDiversity and random forest models of oral microbiomes in periodontal health using publicly available data
dc.typejournal article
dc.type.hasVersionVoR
dspace.entity.typePublication
relation.isAuthorOfPublication5cd58015-b2e7-479a-afe0-4d8b4a7594ff
relation.isAuthorOfPublicationf9cb8fca-ac19-4df5-819b-5a2dcf6ce966
relation.isAuthorOfPublication.latestForDiscovery5cd58015-b2e7-479a-afe0-4d8b4a7594ff

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
2025_journal_tomas_diversity.pdf
Size:
1.49 MB
Format:
Adobe Portable Document Format