FOSS4G 2023

OpenDataCube Fast Deploy using Docker (Fast Cubing)
06-28, 15:10–15:15 (Europe/Tirane), UBT D / N113 - Second Floor

Geospatial information from satellites is increasingly being used by decision-makers and scientists alike. However, there are two fundamental issues with this kind of data and related handling technologies. Firstly, data processing typically requires long time and a-priori expert knowledge compared to traditional data sources. Second, integrating satellite data into processing pipelines can be expensive in terms of software and application development efforts. The OpenDataCube (ODC) was created to help users solve these issues. Although ODC offers an alternative to being used as a data management application, its deployment is typically challenging for inexperienced users. Therefore, the primary purpose of this work is to provide potential ODC users with a ready-to-use, portable instance of this software.
The software is produced and published in a Docker container. In comparison to the traditional installation and configuration of the ODC, the tool proposed here provides an environment where the ODC database is already set up. It helps to avoid occasional conflicts that are common in SQL and Python installations. Even though other ODC implementations are available as a Docker container, the proposed solution has some advantages. Specifically, Python geospatial libraries are integrated in the container to support data manipulation. While available ODC instances are designed to process satellite images only (mainly Sentinel and Landsat data), the tool contains scripts to automatically adapt and ingest non-satellite data (e.g. raw ground-sensor network data, land cover/soil maps, etc.) by creating also metadata files when they are missing. The proposed solution makes available processing pipelines to re-grid, georeference and import datasets into the ODC. Both scripts and pipelines can be used through Jupyter notebook interfaces, which allow users also to perform exploratory analyses on the ingested data.
The source code is available at (https://github.com/gisgeolab/LCZ-ODC) and is released under a MIT license. The software is being developed within the LCZ-ODC project (agreement n. 2022-30-HH.0) funded by the Italian Space Agency (ASI) and aimed to identify Local Climate Zones within the Metropolitan City of Milan. Given the nature of the datacube development, this tool promotes Open Geospatial Consortium (OGC) compliant data sharing. Ongoing work focuses on the development and integration of additional pre-processing scripts with the aim of supporting the ingestion of additional types of data as well as providing new ready-to-use embedded processing functionalities.

Rodrigo Cedeño was born in Mexico City. After obtaining the B.Sc. in Mechatronics Engineering he worked for 8 years at FIAT, first as a Motion Analysis Engineer and afterwards as a Vehicle Dynamics Engineer. In 2021 he obtained an M.Sc. in Geoinformatics Engineering at Politecnico di Milano. In 2022 he obtained an MSc. in Environmental and Land Engineering at Politecnico di Torino. During his Master's he participated in the multidisciplinary and international honour program Alta Scuola Politecnica. Currently, he is a PhD student in Geomatics at Politecnico di Milano. His main fields of research are Artificial Intelligence applied to atmospheric pollution using Earth Observation. He is also specialized in open geospatial infrastructures such as the Open Data Cube and Blockchain.