QSAR Study for Macromolecular RNA Folded Secondary Structures of Mycobacterial Promoters with Low Sequence Homology

dc.contributor.affiliationUniversidade de Santiago de Compostela. Departamento de Química Orgánicagl
dc.contributor.authorGonzález Díaz, Humberto
dc.contributor.authorPérez Bello, Alcides
dc.contributor.authorUriarte Villares, Eugenio
dc.contributor.authorGonzález Díaz, Yenny
dc.date.accessioned2021-08-19T12:22:23Z
dc.date.available2021-08-19T12:22:23Z
dc.date.issued2005
dc.descriptionThe 9th International Electronic Conference on Synthetic Organic Chemistry session Computational Chemistrygl
dc.description.abstractThe general belief is that quantitative structure-activity relationships (QSAR) techniques work only for small molecules and, proteins sequences or, more recently, DNA sequences. However, with non-branched graph for proteins and DNA sequences the QSAR often have to be based on powerful non-linear techniques such as support vector machines. In our opinion linear QSAR models based in RNA could be useful to assign biological activity when alignment techniques fail due to low sequence homology. The idea bases in the high level of branching for the RNA graph. This work introduces the so called Markov electrostatic potentials k?M as a new class of RNA 2D-structure descriptors. Subsequently, we validate these molecular descriptors solving a QSAR classification problem for mycobacterial promoter sequences (mps), which constitute a very low sequence homology problem. The model developed (mps = –4.664·0cM + 0.991·1cM – 2.432) was intended to predict whether a naturally occurring sequence is an mps or not on the basis of the calculated kcM value for the corresponding RNA secondary structure. The RNAQSAR approach recognises 115/135 mps (85.2%) and 100% of control sequences. Average predictability and robustness were greater than 95%. A previous non-linear model predicts mps with slightly higher accuracy (97%) but uses a very large parameter space for DNA sequences. Conversely, the kcM-based RNA-QSAR encodes more structural information and needs only two variablesgl
dc.description.sponsorshipGonzález-Díaz, H. thanks the Xunta de Galicia (BTF20301PR) for partial financial supportgl
dc.identifier.citationProceedings of the 9th International Electronic Conference on Synthetic Organic Chemistry, 1–30 November 2005, MDPI: Basel, Switzerland, doi:10.3390/ecsoc-9-01654gl
dc.identifier.doi10.3390/ecsoc-9-01654
dc.identifier.isbn3-906980-16-2
dc.identifier.urihttp://hdl.handle.net/10347/26888
dc.language.isoenggl
dc.publisherMDPIgl
dc.relation.ispartofseriesElectronic Conference on Synthetic Organic Chemistry;9
dc.relation.publisherversionhttps://doi.org/10.3390/ecsoc-9-01654gl
dc.rights© 2005 The author(s). Published by MDPI, Basel, Switzerland. Open Accessgl
dc.rights.accessRightsopen accessgl
dc.subjectQSARgl
dc.subjectRNA secondary structuregl
dc.subjectSequence homologygl
dc.subjectMarkov modelgl
dc.subjectMycobacterial promotersgl
dc.subjectElectrostatic potentialgl
dc.titleQSAR Study for Macromolecular RNA Folded Secondary Structures of Mycobacterial Promoters with Low Sequence Homologygl
dc.typebook partgl
dspace.entity.typePublication
relation.isAuthorOfPublication769c5d0c-04c9-43f2-89dc-e4eb770227d5
relation.isAuthorOfPublication.latestForDiscovery769c5d0c-04c9-43f2-89dc-e4eb770227d5

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
2005_ecsoc9_gonzalez_qsar.pdf
Size:
749.93 KB
Format:
Adobe Portable Document Format
Description: