Methods for Introducing Geospatial Awareness to Large Language Models with Integrity, Provenance, and Trust through Open Standards and Open Source
2026-09-02 , Conference Management Room5

While LLMs excel at synthesizing text, they lack geospatial awareness and cannot reason over spatial networks. This talk demonstrates how Geo-GraphRAG and DGGS, grounded in open standards, enable geospatial reasoning with transparent, traceable results.


Although Large Language Models (LLMs) excel at natural language processing and general reasoning, they lack the cross-sectoral, domain-specific, and up-to-date geospatial knowledge needed for decision support in disaster and crisis response, public health, economic analysis, and security applications. National Spatial Data Infrastructures (NSDIs) contain a wealth of such data, but they are not published in interoperable formats suitable for use by LLMs. Incompatible representations of the same locations across systems make feature-level integration (i.e., common geographies) prohibitively labor-intensive, particularly in lower-resourced settings.

This talk presents two complementary methods for introducing geospatial awareness into LLMs by shifting interoperability to the point of data publication rather than relying on point-to-point integrations. Discrete Global Grid Systems (DGGS)—a new Open Geospatial Consortium (OGC) standard—implement the concept of common geographies by quantizing data into hierarchical grid cells identified by standardized zone IDs. When datasets are published using the same DGGS reference system, they become immediately interoperable. LLM agents can translate natural language questions into DGGS API calls with CQL2 parameters, enabling queries such as identifying areas where flood levels exceed specified thresholds.

Spatial Knowledge Graphs (SKGs) provide a complementary capability by representing networks of interconnected geographic features and their semantic relationships across domains. Through Geo-GraphRAG, LLM agents can translate natural language questions into GeoSPARQL queries, enabling reasoning over feature networks such as infrastructure, populations, and services. When combined with DGGS, these approaches enable integrated analysis linking semantic networks with aggregated and statistical data, while maintaining transparency and traceability in model outputs.

However, no standard currently exists for publishing SKGs with interoperability by common geography. This makes maintaining referential integrity across evolving datasets complex and resource-intensive. Interoperable Spatial Knowledge Graphs (iSKGs) address this challenge by enabling geo-objects to be aligned at publication, allowing graphs from different organizations to be integrated on demand in a federated architecture. By incorporating metadata standards such as DCAT and leveraging OGC Building Blocks, provenance, traceability, and shared geo-ontologies can be established to support trusted, reusable knowledge. This talk will demonstrate how these capabilities can be implemented using open-source software. The work builds on collaboration with the OGC community through disaster and resilience research pilots, as well as contributions developed in partnership with the U.S. Army Corps of Engineers Civil Works Division.

By shifting the paradigm to interoperability at the point of publication, this approach creates a force multiplier—enabling local talent to build and sustain LLM-based geospatial applications by providing access to integrated, trustworthy geospatial data that has historically been out of reach.


Level of technical complexity: 2 - intermediate Indicate what is (are) the open source project(s) essential in your talk:

DGGAL – a Discrete Global Grid Abstraction Library
https://github.com/ecere/dggal

Apache Jena
https://jena.apache.org

pydggsapi - an open-source OGC API DGGS web service
https://github.com/LandscapeGeoinformatics/pydggsapi

GeoPrism – a Common Geo-Registry implementation for managing interlinked data in graphs as they change over time
https://github.com/terraframe/geoprism-registry

I make my conference contribution available under the CC BY 4.0 license. The conference contribution comprises the abstract, the text contribution for the conference proceedings, the presentation materials as well as the video recording and live transmission of the presentation: