FOSS4G 2023

Yunzhi Lin

Yunzhi is a Maps Quality Analyst at Meta. She leads quality control of machine learning data, 3D data and vandalism detection outputs, and supports various community communications


Sessions

06-29
11:30
30min
A review of Mapillary Traffic Sign Data Quality and OpenStreetMap Coverage
Yunzhi Lin, Said Turksever

Traffic signs are a key feature for navigating and managing traffic safely, affecting all of us on a daily basis. However, traffic sign datasets are lacking on open government data portals as well as OpenStreetMap (OSM).

Mapillary’s computer vision capabilities can extract more than 1,500 classes of traffic signs globally from street-level imagery. Generated traffic signs are available on iD Editor, Rapid and JOSM Mapillary plugin to enrich OpenStreetMap data.

Our team wanted to know how the accuracy of traffic signs detected by Mapillary compared with the reality on the ground (the ground truth). To answer this question we collected more than thousands ground truth data in San Francisco and used this information to produce the recall, precision, and positional accuracy of our machined generated traffic sign data. This provided some interesting insights in OpenStreetMap and the level of completeness and gaps of that dataset.

In this talk, we will cover Mapillary’s traffic sign extraction capabilities, Mapillary generated traffic sign data against ground truth data and OSM’s traffic sign coverage in San Francisco’s downtown. We will be also addressing how data quality can be improved using various data collection techniques and the role of post-processing with Structure from Motion and control points annotations.

Open Data
UBT C / N111 - Second Floor
06-30
15:00
30min
Building heights: From open data to open maps
Yunzhi Lin

In the US, less than 20% of OpenStreetMap (OSM) buildings have a height tag (less than 10% globally). Providing buildings with height tags helps several use cases including 3D map visualization. At Meta, we have begun using open mapping data to estimate building heights and providing them back to the community. At the end of 2022, we used data from city GIS departments to estimate millions of heights and release them to the public through the Daylight Map Distribution (https://daylightmap.org/2022/12/02/building-heights.html). In 2023, we are using publicly available USGS/3DEP aerial lidar and releasing to the public through the Overture Maps Foundation – processing millions of square kilometers. This talk will cover the challenges, algorithm, QA process, and accuracy metrics from this effort. It is our hope that over the course of the year, we can estimate and publish heights for the majority of the buildings in the US and begin work on non-US open data sources as well.

Open Data
UBT C / N110 - Second Floor