Unravelling cyclic peptide membrane permeability prediction: a study on data augmentation, architecture choices, and representation schemes

dc.contributor.affiliationUniversidade de Santiago de Compostela. Departamento de Química Orgánica
dc.contributor.affiliationUniversidade de Santiago de Compostela. Centro de Investigación en Química Biolóxica e Materiais Moleculares (CiQUS)
dc.contributor.authorCabezón Vizoso, Alfonso
dc.contributor.authorOtović, Erik
dc.contributor.authorKalafatovic, Daniela
dc.contributor.authorPiñeiro Guillén, Ángel
dc.contributor.authorGarcía Fandiño, Rebeca
dc.contributor.authorMauša, Goran
dc.date.accessioned2025-10-13T11:45:56Z
dc.date.available2025-10-13T11:45:56Z
dc.date.issued2025
dc.description.abstractCyclic peptides have emerged as promising candidates for drug development due to their unique structural properties and potential therapeutic benefits. However, clinical applications are limited by their low membrane permeability, which is difficult to predict. This study explores the impact of data augmentation and the inclusion of cyclic structure information in ML modeling to enhance the prediction of membrane permeability of cyclic peptides from their amino acid sequence. Various peptide representation strategies in combination with data augmentation techniques based on amino acid mutations and cyclic permutations were investigated to address the limited availability of experimental data. Moreover, cyclic convolutional layers were explored to explicitly model the cyclic nature of the peptides. The results indicated that combining sequential and peptide properties demonstrated superior performance across multiple metrics. The model performance is highly sensitive to the number and degree of similarity of amino acids involved in mutations. Cyclic permutations improved model performance, particularly in a larger and more diverse dataset and standard architectures captured most of the relevant cyclic information. Highlighting the complexity of peptide-membrane interactions, these results lay a foundation for future improvements in computational methods for the design of cyclic peptide drugs and offer practical guidelines for researchers in this field. The best-performing model was integrated into a user-friendly web-based tool, CYCLOPS: CYCLOpeptide Permeability Simulator (available at http://cyclopep.com/cyclops), to facilitate wider accessibility and application in drug discovery community. This tool allows for rapid predictions of the membrane permeability for cyclic peptides with a classification accuracy score of 0.824 and a regression mean absolute error of 0.477.
dc.description.peerreviewedSI
dc.description.sponsorshipThis work was supported by the Croatian Science Foundation [grant numbers UIP-2019-04-7999 (D. K.) and DOK-2020-01-4659 (G. M.)]; the University of Rijeka [grant numbers UNIRI-23-78 (G. M.), UNIRI-INOVA-3-23-1 (G. M.), UNIRI-23-16 (D. K.), UNIRI-INOVA-3-23-2 (D. K.)]; by the Spanish Agencia Estatal de Investigación (AEI) and the ERDF [PDC2022-133402-I00 (R. GF.), PID2022-141534OB-I00 (R. GF.) and CNS2023-144353 (R. GF.)]; by Xunta de Galicia [proceedings ED431C 2021/21 (R.GF.), ED481A 2023/1 (A. C.) and Centro de investigación do Sistema universitario de Galicia accreditation 2023–2027, ED431G 2023/ 03 (CiQUS)]; and the European Union [European Regional Development Fund – ERDF]. This publication is based upon work from COST Action CA23111 – SNOOPY, supported by COST (European Cooperation in Science and Technology). All calculations were carried out at the Centro de Supercomputación de Galicia (CESGA)
dc.identifier.citationCabezón, A., Otović, E., Kalafatovic, D., Piñeiro, Á., García-Fandiño, R., Mauša, G. (2025). Unravelling cyclic peptide membrane permeability prediction: A study on data augmentation, architecture choices, and representation schemes. "Digital Discovery", 4, 1259-1275
dc.identifier.doi10.1039/d4dd00375f
dc.identifier.essn2635-098X
dc.identifier.urihttps://hdl.handle.net/10347/43064
dc.issue.number4
dc.journal.titleDigital Discovery
dc.language.isoeng
dc.page.final1275
dc.page.initial1259
dc.publisherRoyal Society of Chemistry
dc.relation.projectIDinfo:eu-repo/grantAgreement/AEI/Plan Estatal de Investigación Científica y Técnica y de Innovación 2021-2023/PDC2022-133402-I00/ES
dc.relation.projectIDinfo:eu-repo/grantAgreement/AEI/Plan Estatal de Investigación Científica y Técnica y de Innovación 2021-2023/PID2022-141534OB-I000/ES/DESCIFRANDO LA CONEXION DEL CODIGO LIPIDICO ENTRE CANCER, INFECCION Y ENVEJECIMIENTO: HACIA HERRAMIENTAS TERANOSTICAS NO CONVENCIONALES Y VACUNAS BASADAS EN LA MEMORIA INNATA
dc.relation.publisherversionhttps://doi.org/10.1039/D4DD00375F
dc.rightsAttribution 4.0 Internationalen
dc.rights.accessRightsopen access
dc.rights.urihttp://creativecommons.org/licenses/by/4.0/
dc.subject.classification2306 Química orgánica
dc.titleUnravelling cyclic peptide membrane permeability prediction: a study on data augmentation, architecture choices, and representation schemes
dc.typejournal article
dc.type.hasVersionVoR
dspace.entity.typePublication
relation.isAuthorOfPublicationf4d82ce1-22fa-4ac4-a7f7-71690607ae55
relation.isAuthorOfPublication7207f196-ba01-47c3-a5a7-dac268e007d3
relation.isAuthorOfPublication.latestForDiscoveryf4d82ce1-22fa-4ac4-a7f7-71690607ae55

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
2025_digital discovery_garcia fandiño_unravelling.pdf
Size:
1004.92 KB
Format:
Adobe Portable Document Format