A survey on machine learning in array databases

dc.contributor.affiliationUniversidade de Santiago de Compostela. Centro de Investigación en Tecnoloxías da Informacióngl
dc.contributor.affiliationUniversidade de Santiago de Compostela. Departamento de Electrónica e Computacióngl
dc.contributor.areaÁrea de Enxeñaría e Arquitectura
dc.contributor.authorVillarroya Fernández, Sebastián
dc.contributor.authorBaumann, Peter
dc.date.accessioned2022-08-26T12:14:02Z
dc.date.available2022-08-26T12:14:02Z
dc.date.issued2022
dc.description.abstractThis paper provides an in-depth survey on the integration of machine learning and array databases. First,machine learning support in modern database management systems is introduced. From straightforward implementations of linear algebra operations in SQL to machine learning capabilities of specialized database managers designed to process specific types of data, a number of different approaches are overviewed. Then, the paper covers the database features already implemented in current machine learning systems. Features such as rewriting, compression, and caching allow users to implement more efficient machine learning applications. The underlying linear algebra computations in some of the most used machine learning algorithms are studied in order to determine which linear algebra operations should be efficiently implemented by array databases. An exhaustive overview of array data and relevant array database managers is also provided. Those database features that have been proven of special importance for efficient execution of machine learning algorithms are analyzed in detail for each relevant array database management system. Finally, current state of array databases capabilities for machine learning implementation is shown through two example implementations in Rasdaman and SciDBgl
dc.description.peerreviewedSIgl
dc.description.sponsorshipOpen Access funding provided thanks to the CRUE-CSIC agreement with Springer Naturegl
dc.identifier.citationApplied Intelligence (2022). https://doi.org/10.1007/s10489-022-03979-2gl
dc.identifier.doi10.1007/s10489-022-03979-2
dc.identifier.essn1573-7497
dc.identifier.issn0924-669X
dc.identifier.urihttp://hdl.handle.net/10347/29162
dc.language.isoenggl
dc.publisherSpringergl
dc.relation.publisherversionhttps://doi.org/10.1007/s10489-022-03979-2gl
dc.rights©2022, The Authors This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons. org/licenses/by/4.0/gl
dc.rights.accessRightsopen accessgl
dc.rights.urihttp://creativecommons.org/licenses/by/4.0/
dc.subjectArray datagl
dc.subjectArray database managersgl
dc.subjectMachine learninggl
dc.subjectEfficient array machine learninggl
dc.titleA survey on machine learning in array databasesgl
dc.typejournal articlegl
dc.type.hasVersionVoRgl
dspace.entity.typePublication
relation.isAuthorOfPublication8473f69a-64ab-4e16-8d71-ce1edce20b04
relation.isAuthorOfPublication.latestForDiscovery8473f69a-64ab-4e16-8d71-ce1edce20b04

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
2022_appint_villarroya_survey.pdf
Size:
2.55 MB
Format:
Adobe Portable Document Format
Description: