Efficient query over large datasets of analytical chemistry

dc.contributor.advisorRíos Viqueira, José Ramón
dc.contributor.advisorFernández Pena, Anselmo Tomás
dc.contributor.affiliationUniversidade de Santiago de Compostela. Escola de Doutoramento Internacional (EDIUS)
dc.contributor.authorLuaces Cachaza, David
dc.date.accessioned2023-08-07T07:19:45Z
dc.date.available2023-08-07T07:19:45Z
dc.date.issued2023
dc.description.abstractThe efficient management of molecular data is one of the most demanded technologies by the industry. A very important type of search is the substructure searching. The molecular structures may be encoded as graphs where the vertices and bonds represent the atoms and bonds, respectively. In this Thesis, a cutting edge system that enables the storage and querying of molecular data has been designed and implemented, paying attention to the molecular substructure search, where new filter-then-verify(FTV) methods, beyond the state-of-the-art, were designed, implemented, and tested, achieving performance gains over 75% in the filtering stage. A generic framework for the implementation of FTV techniques on a distributed architecture was also developed, enabling the application of the FTV methods on very large graph databases, achieving a great performance gain in both index building and query execution. Finally, the Thesis presents a study for the use of different FTV solutions to obtain approximate results in an interactive searching application.es_ES
dc.description.programaUniversidade de Santiago de Compostela. Programa de Doutoramento en Investigación en Tecnoloxías da Información
dc.identifier.urihttp://hdl.handle.net/10347/30938
dc.language.isoenges_ES
dc.rightsAttribution-NonCommercial-NoDerivatives 4.0 Internacional
dc.rights.accessRightsopen accesses_ES
dc.rights.urihttp://creativecommons.org/licenses/by-nc-nd/4.0/
dc.subjectmolecular databaseses_ES
dc.subjectgraph databaseses_ES
dc.subjectsubgraph searches_ES
dc.subjectgraph query processinges_ES
dc.subjectgraph indexinges_ES
dc.subjectsubgraph isomorphismes_ES
dc.subjectlarge scale processinges_ES
dc.subjectcheminformaticses_ES
dc.subject.classification120312 Bancos de datoses_ES
dc.subject.classification120317 Informáticaes_ES
dc.subject.classification330418 Dispositivos de almacenamientoes_ES
dc.titleEfficient query over large datasets of analytical chemistryes_ES
dc.typedoctoral thesises_ES
dspace.entity.typePublication
relation.isAdvisorOfPublication61678fc8-bbf4-4466-8736-0d433fbaba1e
relation.isAdvisorOfPublicationdecb372f-b9cd-4237-8dda-2c0f5c40acbe
relation.isAdvisorOfPublication.latestForDiscovery61678fc8-bbf4-4466-8736-0d433fbaba1e

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
rep_3155.pdf
Size:
12.1 MB
Format:
Adobe Portable Document Format
Description: