Sebastiaan Schmidt


Session

06-30
15:00
30min
Asking a Province a Question: LLMs, H3, and Open Dutch Geodata
Thijs Oosterhuis, Sebastiaan Schmidt

The Province of South Holland manages one of the most densely packed, data-rich regions in Europe. Housing pressure, nitrogen deposition, flooding risk, ageing infrastructure, biodiversity loss — the questions are urgent, and the data to answer them mostly already exists. It's public, it's free, and almost none of it talks to anything else.
We built something to fix that.
Over the past two years, we've assembled a spatial data warehouse for Zuid-Holland: an H3 geo-datacube at resolution 9, combining six years of Dutch open datasets — demographics and housing (CBS), land use and nature (LGN), water quality (IHW/KRW), air quality (RIVM), ground height (AHN), noise exposure, and accessibility distances. Every dataset on the same hexagonal grid. Every hexagon its own small story about a patch of land.
Then we gave it a voice.
A natural-language assistant — built entirely on open source tools — lets policy advisors, planners, and analysts ask questions in plain Dutch and get a map back in seconds. No SQL. No data wrangling. No waiting for a colleague who knows which table holds what. "Where did nature expand while population shrank between 2018 and 2023?" becomes a map. "Which areas combine flooding risk with high housing pressure and poor accessibility?" becomes a map. Decisions that used to take a week of preparation start with a conversation.
The stack is fully open: LangGraph for the AI workflow, DuckDB for query execution, FastAPI for the backend, Deck.gl and MapLibre GL JS for rendering. The warehouse itself is built on Delta Lake — a living system, not a static file — designed to grow as new provincial datasets are onboarded.
This talk covers the architecture, the hard-won lessons (LLM hallucinations hitting production queries is a fun problem to debug), and a live demo against the actual provincial data warehouse. We'll show what the hexagons reveal about Zuid-Holland that spreadsheets never could — and what it looks like when a province can finally ask itself the right questions.
All code is open source. The pattern is replicable. The data is already yours.
Stack: H3 · DuckDB · LangGraph · FastAPI · Deck.gl · PDOK · Delta Lake

Use cases & applications
A02