FOSS4G 2022 general tracks

Processing and publishing big data with GeoServer and Azure in the cloud
2022-08-26, 12:00–12:30 (Europe/Rome), Room Verde

The amount of data we have to process and publish keeps growing every day, fortunately, the infrastructure, technologies, and methodologies to handle such streams of data keep improving and maturing. GeoServer is a web service for publishing your geospatial data using industry standards for vector, raster, and mapping. It powers a number of open source projects like GeoNode and geOrchestra and it is widely used throughout the world by organizations to manage and disseminate data at scale. We integrated GeoServer with some well-known big data technologies like Kafka and Databricks, and deployed the systems in Azure cloud, to handle use cases that required near-realtime displaying of the latest received data on a map as well background batch processing of historical data.

This presentation will describe the architecture put in place, and the challenges that GeoSolutions had to overcome to publish big data through GeoServer OGC services (WMS, WFS, and WPS), finding the correct balance that maximized ingestion performance and visualization performance. We had to integrate with a streaming processing platform that took care of most of the processing and storing of the data in an Azure data lake that allows GeoServer to efficiently query for the latest available features, respecting all the authorization policies that were put in place. A few custom GeoServer extensions were implemented to handle the authorization complexity, the advanced styling needs, and big data integration needs.

Nuno earned his BS and MS in Software Engineering from the University of Minho. He started his career in the telecommunications industry by developing solutions for managing and monitoring telecommunications infrastructures. Currently, he works at GeoSolutions where he develops advanced solutions for GIS challenges using open-source software. In the last years, he focused on distributed systems, big data technologies, and GIS. He contributes to several open-source projects and is a committer of GeoServer, MapStore, and GeoTools.

This speaker also appears in: