2026-09-02 –, Conference Management Room3
Japan’s cadastral land-parcel open data ships as massive proprietary XML files. This talk is about how I made a Rust port of the official Python converter in Rust, rewriting and optimizing, cutting nationwide conversion down from hours to minutes.
Japan’s Ministry of Justice has released nationwide cadastral land parcel data every year since 2022. The data is provided in a proprietary XML format, so it must be converted before it can be used in common GIS tools.
The Digital Agency released a Python converter, but the full dataset is huge (around 90–100GB). Converting everything can take 6 to 8 hours, which slows down repeatable builds and downstream workflows.
This talk explains the XML format, how I ported the converter to Rust, then reworked it for maximum speed using profiling, streaming processing, and parallelism. The current Rust tool can convert the entire dataset in about 15 minutes.
Repository: https://github.com/KotobaMedia/mojxml-rs
Links in Japanese
Indicate what is (are) the open source project(s) essential in your talk:The tool the talk is about: https://github.com/kotobaMedia/mojxml-rs
- FlatGeobuf (https://flatgeobuf.org/) and its Rust implementation (https://crates.io/crates/flatgeobuf)
- The GeoRust Project (https://georust.org/)
Born Tokyo, raised US, living in Yakushima. Founder KotobaMedia, Inc - working on creating high-performance geospatial tools.