FOSS4G 2022 general tracks

How to deal with a massive geographic database when surrounded by datascientists ?
2022-08-26, 14:15–14:45 (Europe/Rome), Room Limonaia

The Scientific and Technical Center for Building (CSTB) built the first French database of buildings and houses to address climate change challenge, helping knowledge and decision making for massive retrofit.
The pipeline factory intersects massive datasets (21 Millions buildings, >400 descriptors) and keeps adding new predictions and external datasets all the time. It allows to run analyses and predictions for all the climate change related indicators, such as housing price and energetic performance relation, heat wave impact, solar potential, etc..
While the first versions where a direct image of the classical datascientist’s approach -ie a massive dataframe driven by massive yaml config files and cryptic meta-templated scripts– ease of use and access performance soon became a limiting factor. This is a major concern since this dataset will be one long term foundation of derived information systems.
Between brute force approach based on scaling resources up, and the old fashioned « data diet » normalization and optimization process, the truth is not easy to find.
Abusing from cartoonish humor, this talk will try to explore the benefits of normalizing back hugely redundant geographic datasets and making public interfaces (public SQL model, API’s, vector tiles, OGC API’s) so that both end users can analyze efficiently this dataset, and the data manager team can rely on more stability using those good old’ database constraints.

Passionate open source GIS data manager since 2005, I have been deeply involved in the OSGeo and QGIS project. I worked for Oslandia from 2016 to 2021. I decided to focus back on geodata management applied to energy transition and massive housing retrofit in 2021 at the Scientific and Technical Center for Building (CSTB). I was the previsou chairman of the French Osgeo local chapter and am still involved in the current board.

This speaker also appears in: