From Tile-Based Processing to Distributed Execution: Extending the mapchete stack
2026-06-29 , A13

Over the past years, mapchete has evolved from a tile-based raster and vector processing library into a modular ecosystem for building and operating large-scale geospatial data processing pipelines. Previous presentations at FOSS4G focused on the core package and touching scalable processing patterns using dask and mapchete.

This talk presents the next step: the open source publication of additional components developed in a production context, including mapchete EO, mapchete Hub, and mapchete Hub CLI. These packages extend mapchete's core processing model towards Earth Observation (EO) use cases and distributed execution, with a focus on reproducibility, scalability, and a variety of (pre)processesing capabilities relevant for EO.

mapchete EO provides higher-level primitives for working with satellite imagery (primarily Sentinel-2), including typical preprocessing steps such as cloud masking, BRDF correction, and temporal compositing. These components are derived from operational pipelines used in the EOxCloudles (cloudless.eox.at) product line, where consistent large-scale processing and data quality constraints are critical.

mapchete Hub introduces a service layer for orchestrating distributed processing of mapchete tasks. Processing jobs can be submitted, scheduled, and monitored via an API. The API design is oriented towards the OGC API - Processes standard, aligning mapchete-based workflows with emerging interoperable interfaces in the geospatial ecosystem. The accompanying CLI (mapchete hub CLI) provides a minimal interface for interacting with this system without requiring custom integration.

In addition to the software components, the talk covers recent changes in packaging and distribution. All packages are now published via both PyPI and Conda, and container images are provided through GitHub Container Registry. All packages were moved to a dedicated mapchete organization.


Indicate what is (are) the open source project(s) essential in your talk:

mapchete
mapchete-eo
mapchete-hub
mapchete-hub-cli

Give indication of resources (video, web pages, papers, etc.) to read in advance, that will help get up to speed on advanced topics.:

https://github.com/mapchete/
https://cloudless.eox.at/

Assign a number between 1 and 4 indicating the level of technical complexity of your contribution.: 4: advanced technical/ thematicskills required Select at least one general theme that best defines your proposal: Software status, new project development, Analysis, manipulation and visualization of geospatial data Under which license do you make your contribution available? The conference contribution comprises the abstract, the text contribution for the conference proceedings, the presentation materials as well as the video recording and live transmission of the presentation: CC BY