Enabling Efficient Distributed Spatial Join on Large Scale Vector-Raster Data Lakes

dc.contributor.affiliationUniversidade de Santiago de Compostela. Departamento de Electrónica e Computación
dc.contributor.authorRíos Viqueira, José Ramón
dc.contributor.authorCotos Yáñez, José Manuel
dc.contributor.authorTaboada González, José Ángel
dc.contributor.authorVillarroya Fernández, Sebastián
dc.date.accessioned2025-02-03T08:01:17Z
dc.date.available2025-02-03T08:01:17Z
dc.date.issued2022-03-08
dc.description.abstractBoth the increasing number of GPS-enabled mobile devices and the geographic crowd-sourcing initiatives, such as Open Street Map, are determinants for the large amount of vector spatial data that is currently being produced. On the other hand, the automatic generation of raster data by remote sensing devices and environmental modeling processes was always leading to very large datasets. Currently, huge data generation rates are reached by improved sensor observation systems and data processing infrastructures. As an example, the Sentinel Data Access System of the Copernicus Program of the European Space Agency (ESA) was publishing 38.71 TB of data per day during 2020. This paper shows how the assumption of a new spatial data model that includes multi-resolution parametric spatial data types, enables achieving an efficient implementation of a large scale distributed spatial analysis system for integrated vector-raster data lakes. In particular, the proposed implementation outperforms the state-of-the-art Spark-based spatial analysis systems by more than one order of magnitude during vector-raster spatial join evaluation.
dc.description.peerreviewedSI
dc.identifier.citationS. Villarroya, J. R. R. Viqueira, J. M. Cotos and J. A. Taboada, "Enabling Efficient Distributed Spatial Join on Large Scale Vector-Raster Data Lakes," in IEEE Access, vol. 10, pp. 29406-29418, 2022, doi: 10.1109/ACCESS.2022.3157405.
dc.identifier.doi10.1109/ACCESS.2022.3157405
dc.identifier.essn2169-3536
dc.identifier.urihttps://hdl.handle.net/10347/39471
dc.journal.titleIEEE Access
dc.language.isoeng
dc.publisherIEEE
dc.relation.publisherversionhttps://ieeexplore.ieee.org/document/9729731
dc.rightsAttribution-NonCommercial-NoDerivatives 4.0 International
dc.rights.accessRightsopen access
dc.rights.urihttp://creativecommons.org/licenses/by-nc-nd/4.0/
dc.titleEnabling Efficient Distributed Spatial Join on Large Scale Vector-Raster Data Lakes
dc.typejournal article
dc.type.hasVersionVoR
dspace.entity.typePublication
relation.isAuthorOfPublication61678fc8-bbf4-4466-8736-0d433fbaba1e
relation.isAuthorOfPublicationdf8d5480-a8c8-43ec-8e3b-cf5a939ad831
relation.isAuthorOfPublication8bb83013-4d6a-47a0-b809-d167fb797e9e
relation.isAuthorOfPublication8473f69a-64ab-4e16-8d71-ce1edce20b04
relation.isAuthorOfPublication.latestForDiscoverydf8d5480-a8c8-43ec-8e3b-cf5a939ad831

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
2022_Enabling_Efficient.pdf
Size:
1.91 MB
Format:
Adobe Portable Document Format