08-25, 09:30–10:00 (Europe/Rome), Room 4
Come have a look under the covers at the data structures that enable geospatial and multi-dimensional indexing and search at massive scale in Apache Lucene and OpenSearch. This talk will cover not only the indexing structures considered and ultimately implemented in the Apache Lucene Open Source Project but the exceptional performance improvements and centimeter spatial accuracy obtained in the latest release. As a bonus, this talk will cover new and upcoming Spatial Analysis Aggregations and Processing available in the OpenSearch Open Source project.
From tessellation to multidimension encoding and block KD trees this talk will cover the algorithms and data structures written and committed to the following open source projects:
Apache Lucene (specifically the release of BKD based geo indexing https://issues.apache.org/jira/browse/LUCENE-8396)
Performance benchmarks for Lucene Spatial Indexing: https://home.apache.org/~mikemccand/geobench.html
Finally, we will discuss the future of the project including existing and evolving support for custom coordinate reference systems and projections, spatial regression modeling and statistics, and spatial visualizations with OpenSearch Dashboards.
Nicholas Knize serves as a committer and PMC member for the Apache Lucene and Sole projects. He is currently a Principal Engineer for Amazon Search Services. He spent 10 years as a Government Contractor working on highly scalable spatial database systems specifically for wide area motion imagery surveillance applications in there intelligence community. He has served on the Motion Image Standards Board and multiple OpenSource conference committees. He has a Bachelors of Science and Masters of Science in Computer Science and a PhD in Geospatial Information Sciences and currently resides in Lucas, TX.