Enabling Efficient Distributed Spatial Join on Large Scale Vector-Raster Data Lakes
Loading...
Identifiers
Publication date
Advisors
Tutors
Editors
Journal Title
Journal ISSN
Volume Title
Publisher
IEEE
Abstract
Both the increasing number of GPS-enabled mobile devices and the geographic crowd-sourcing initiatives, such as Open Street Map, are determinants for the large amount of vector spatial data that is currently being produced. On the other hand, the automatic generation of raster data by remote sensing devices and environmental modeling processes was always leading to very large datasets. Currently, huge data generation rates are reached by improved sensor observation systems and data processing infrastructures. As an example, the Sentinel Data Access System of the Copernicus Program of the European Space Agency (ESA) was publishing 38.71 TB of data per day during 2020. This paper shows how the assumption of a new spatial data model that includes multi-resolution parametric spatial data types, enables achieving an efficient implementation of a large scale distributed spatial analysis system for integrated vector-raster data lakes. In particular, the proposed implementation outperforms the state-of-the-art Spark-based spatial analysis systems by more than one order of magnitude during vector-raster spatial join evaluation.
Description
Keywords
Bibliographic citation
S. Villarroya, J. R. R. Viqueira, J. M. Cotos and J. A. Taboada, "Enabling Efficient Distributed Spatial Join on Large Scale Vector-Raster Data Lakes," in IEEE Access, vol. 10, pp. 29406-29418, 2022, doi: 10.1109/ACCESS.2022.3157405.
Relation
Has part
Has version
Is based on
Is part of
Is referenced by
Is version of
Requires
Publisher version
https://ieeexplore.ieee.org/document/9729731Sponsors
Rights
Attribution-NonCommercial-NoDerivatives 4.0 International








