Unravelling cyclic peptide membrane permeability prediction: a study on data augmentation, architecture choices, and representation schemes
| dc.contributor.affiliation | Universidade de Santiago de Compostela. Departamento de Química Orgánica | |
| dc.contributor.affiliation | Universidade de Santiago de Compostela. Centro de Investigación en Química Biolóxica e Materiais Moleculares (CiQUS) | |
| dc.contributor.author | Cabezón Vizoso, Alfonso | |
| dc.contributor.author | Otović, Erik | |
| dc.contributor.author | Kalafatovic, Daniela | |
| dc.contributor.author | Piñeiro Guillén, Ángel | |
| dc.contributor.author | García Fandiño, Rebeca | |
| dc.contributor.author | Mauša, Goran | |
| dc.date.accessioned | 2025-10-13T11:45:56Z | |
| dc.date.available | 2025-10-13T11:45:56Z | |
| dc.date.issued | 2025 | |
| dc.description.abstract | Cyclic peptides have emerged as promising candidates for drug development due to their unique structural properties and potential therapeutic benefits. However, clinical applications are limited by their low membrane permeability, which is difficult to predict. This study explores the impact of data augmentation and the inclusion of cyclic structure information in ML modeling to enhance the prediction of membrane permeability of cyclic peptides from their amino acid sequence. Various peptide representation strategies in combination with data augmentation techniques based on amino acid mutations and cyclic permutations were investigated to address the limited availability of experimental data. Moreover, cyclic convolutional layers were explored to explicitly model the cyclic nature of the peptides. The results indicated that combining sequential and peptide properties demonstrated superior performance across multiple metrics. The model performance is highly sensitive to the number and degree of similarity of amino acids involved in mutations. Cyclic permutations improved model performance, particularly in a larger and more diverse dataset and standard architectures captured most of the relevant cyclic information. Highlighting the complexity of peptide-membrane interactions, these results lay a foundation for future improvements in computational methods for the design of cyclic peptide drugs and offer practical guidelines for researchers in this field. The best-performing model was integrated into a user-friendly web-based tool, CYCLOPS: CYCLOpeptide Permeability Simulator (available at http://cyclopep.com/cyclops), to facilitate wider accessibility and application in drug discovery community. This tool allows for rapid predictions of the membrane permeability for cyclic peptides with a classification accuracy score of 0.824 and a regression mean absolute error of 0.477. | |
| dc.description.peerreviewed | SI | |
| dc.description.sponsorship | This work was supported by the Croatian Science Foundation [grant numbers UIP-2019-04-7999 (D. K.) and DOK-2020-01-4659 (G. M.)]; the University of Rijeka [grant numbers UNIRI-23-78 (G. M.), UNIRI-INOVA-3-23-1 (G. M.), UNIRI-23-16 (D. K.), UNIRI-INOVA-3-23-2 (D. K.)]; by the Spanish Agencia Estatal de Investigación (AEI) and the ERDF [PDC2022-133402-I00 (R. GF.), PID2022-141534OB-I00 (R. GF.) and CNS2023-144353 (R. GF.)]; by Xunta de Galicia [proceedings ED431C 2021/21 (R.GF.), ED481A 2023/1 (A. C.) and Centro de investigación do Sistema universitario de Galicia accreditation 2023–2027, ED431G 2023/ 03 (CiQUS)]; and the European Union [European Regional Development Fund – ERDF]. This publication is based upon work from COST Action CA23111 – SNOOPY, supported by COST (European Cooperation in Science and Technology). All calculations were carried out at the Centro de Supercomputación de Galicia (CESGA) | |
| dc.identifier.citation | Cabezón, A., Otović, E., Kalafatovic, D., Piñeiro, Á., García-Fandiño, R., Mauša, G. (2025). Unravelling cyclic peptide membrane permeability prediction: A study on data augmentation, architecture choices, and representation schemes. "Digital Discovery", 4, 1259-1275 | |
| dc.identifier.doi | 10.1039/d4dd00375f | |
| dc.identifier.essn | 2635-098X | |
| dc.identifier.uri | https://hdl.handle.net/10347/43064 | |
| dc.issue.number | 4 | |
| dc.journal.title | Digital Discovery | |
| dc.language.iso | eng | |
| dc.page.final | 1275 | |
| dc.page.initial | 1259 | |
| dc.publisher | Royal Society of Chemistry | |
| dc.relation.projectID | info:eu-repo/grantAgreement/AEI/Plan Estatal de Investigación Científica y Técnica y de Innovación 2021-2023/PDC2022-133402-I00/ES | |
| dc.relation.projectID | info:eu-repo/grantAgreement/AEI/Plan Estatal de Investigación Científica y Técnica y de Innovación 2021-2023/PID2022-141534OB-I000/ES/DESCIFRANDO LA CONEXION DEL CODIGO LIPIDICO ENTRE CANCER, INFECCION Y ENVEJECIMIENTO: HACIA HERRAMIENTAS TERANOSTICAS NO CONVENCIONALES Y VACUNAS BASADAS EN LA MEMORIA INNATA | |
| dc.relation.publisherversion | https://doi.org/10.1039/D4DD00375F | |
| dc.rights | Attribution 4.0 International | en |
| dc.rights.accessRights | open access | |
| dc.rights.uri | http://creativecommons.org/licenses/by/4.0/ | |
| dc.subject.classification | 2306 Química orgánica | |
| dc.title | Unravelling cyclic peptide membrane permeability prediction: a study on data augmentation, architecture choices, and representation schemes | |
| dc.type | journal article | |
| dc.type.hasVersion | VoR | |
| dspace.entity.type | Publication | |
| relation.isAuthorOfPublication | f4d82ce1-22fa-4ac4-a7f7-71690607ae55 | |
| relation.isAuthorOfPublication | 7207f196-ba01-47c3-a5a7-dac268e007d3 | |
| relation.isAuthorOfPublication.latestForDiscovery | f4d82ce1-22fa-4ac4-a7f7-71690607ae55 |
Files
Original bundle
1 - 1 of 1
Loading...
- Name:
- 2025_digital discovery_garcia fandiño_unravelling.pdf
- Size:
- 1004.92 KB
- Format:
- Adobe Portable Document Format