FOSS4G 2022 general tracks

Stefano Campus

Stefano Campus is a civil engineer who works at the SDI (Spatial Data Infrastructure) of Regione Piemonte (an Italian local authority), contributing to all aspects related to the geographical infrastructure.
Another field of interest is natural disaster prevention, in terms of assessing and mitigating hazard and risk through a spatial approach the use of exclusively Free and Open Source Geographic software.
He is the Italian coordinator of QGIS translation and he is past President of the Italian Association for Free Geographical Information (GFOSS.it), the OSGeo Local Chapter, from 2013 to 2017 and actually he is the vice President.


Sessions

08-25
15:15
5min
A Python tool for QGIS for gender recognition in street directories
Stefano Campus

In the last years the attention for gender equality in all context has increased all over the world.
Nowadays the sensibility of Public Administrations towards the naming of streets, roads, squares and monuments after women has highlighted that, instead, the toponymy has always been oriented to the choice of male figures.
In this work we present a Python script for QGIS, that allows to verify if a proper name, contained in a street directory, is of male or female gender.
There are other Open Source projects that, starting from an address, verify the gender of the represented person; the most famous is the GeoChicas Project [1]; in Italy it is worth mentioning the "Toponomastica Femminile" Association [2] that manually verifies the streets dedicated to women, according to a predefined taxonomy (religious women, artists, etc.).
The goal of the present work is to automate the gender reconnaissance starting from a list of names; however, unlike GeoChicas that use as a base parameter a dictionary of names with which to compare the list, we propose to make a query of DBpedia via SPARQL in order to identify the subject and derive its gender.
If the address is the attribute of a spatial dataset, then it is possible to add a new attribute (the gender) to the vector layer table as a result of DBpedia query.
This approach overcomes language limitations (which would require differentiated dictionaries) and the ambiguities that some names would have (for example the nome "Andrea" is used as both a masculine and feminine name).
The script is created using the SPARQL language with a very simple structure, in which the triplet of data is constructed in order to obtain the gender from the name of a person through the query of DBpedia.
The script can be run in QGIS environment associating the data outputs directly to the geometry or even outside of QGIS and as a result you will have a list of "genders".
The process of relying on Wikipedia/DBpedia has the twofold advantage that, where the name dedicated to street exists, then the desired information is taken, the gender in our case, while if it missing it can be added or enriched.
The script is currently under validation and will be published in the dedicated git repository [3].

[1] https://github.com/geochicasosm
[2] https://www.toponomasticafemminile.com
[3] https://github.com/skampus/toponomasticafemminile

Use cases & applications
Room 4