FOSS4G 2022 academic track

Vinayaraj

Currently, I am working as a data scientist at Rakuten group Inc., Japan. I have been working for GeoAI related for 2-3 years. I am also an OSGEO charter member since 2017. Developed modules for GRASS GIS. I received Ph.D. degree in geoinformatics from Osaka City University, Osaka, Japan, in 2017, and the Ph.D. degree in geoinformatics from a Japanese government scholarship Ministry of Education, Culture, Sports, Science and Technology, Japan. He also worked as a Postdoctoral Researcher with the National Institute of Advanced Industrial Science and Technology, Japan. From 2009 to 2011.

The speaker's profile picture

Sessions

08-24
12:05
5min
Multi-branch Deep learning Based Transport Mode Detection using Weakly Supervised Labels
Vinayaraj

Mobility data, based on global positioning system (GPS) tracking, have been widely used in many areas.  These include analyzing travel patterns, investigating transport safety and efficiency, and evaluating travel impacts. Transport Mode Detection (TMD) is an essential factor in understanding mobility within the transport system.  A TMD model assigns a GPS point or a GPS trajectory to a particular transport mode based on the user's activity and medium of travel [1].  However, the complexity of the prediction procedure increases with the number of modes that need to be predicted. For example, it is comparatively easy to predict whether a user is 'static' or 'slow moving' or 'fast moving' but it's hard to predict detailed transport modes such as walk, bike, car, bus, train, boat, etc.  Therefore, this study proposes a multi-branch deep learning-based TMD model which can predict multi-class transport modes. 

Two major challenges need to be addressed in order to generate a state-of-the-art deep learning model.

The first is to prepare ground-truth data. There are insufficient open-sourced ground-truth data available for transport modes in Japan. Hence, we proposed a transport mode label generation approach using snorkel [2]. Snorkel is a weakly supervised labeling function, a first-of-its-kind system that enables users to train state-of-the-art models without hand labeling any training data. Instead, experts write labeling functions that express arbitrary heuristics based on the logic that can be drawn from understanding the data and the physical actions they represent. In this study, we used snorkel for generating the ground truth data for transport mode. Initially, we considered publicly available road networks, railway networks, bus routes, etc., for creating road, bus, train labels by overlaying GPS points on these transportation networks. However, there are multiple occasions where the road, bus, and train classes overlap each other, especially in a city region. Hence, we introduced a boolean (True/False) based soft-labeling function, where the same GPS point might have multiple True values for road or railway. 

Second, we derived mobility-related features from the raw GPS data. Raw GPS raw data is typically composed of latitude, longitude, and timestamps. The raw GPS data were used to generate point-level features such as speed, speed difference, acceleration, acceleration difference, initial bearing, and bearing difference. Apart from that, we also generated trajectory level features such as average speed and average acceleration.  

Transportation network-based soft-labeling and other mobility features are used to define labeling functions in the snorkel. These label functions are used to create true ground truths using a generative machine learning model with a portion of the GPS data. The generated labels (walk, cycle, bus, car, train, boat/ship) were then used to train the proposed deep learning model.  To construct the model we opted to use two branches where raw GPS latitude and longitude values were used in one and the derived mobility features are used in the other. We used 3 fully-connected hidden layers for raw GPS data (lat/lon) and 4 fully connected hidden layers for mobility features.  Features derived from the two branches are concatenated. Further, 3 fully connected hidden layers and softmax cross-entropy were used as a loss function. The proposed deep learning model has 108,614 trainable parameters and Adam is used as an optimizer. This particular two-branch model structure achieves better accuracy as it combines raw data as well as the derived mobility features in the network.  An example of the benefit from this approach benefit can be the network's ability to relate GPS coordinates with road driving classes, thus inherently inferring that location as on a road.  Note, many of these inferences that improve classification accuracy are possible via dramatically more advanced pre-processing to build out additional features.  However, that approach is more time-consuming and could never catch all the potential inferences that an unbiased set of deep learning layers can inherently extract.

We evaluated the trained model's effectiveness in two ways.  We compared the results against the popular XGBoost classifier, with our model producing over 5% higher accuracy for the benchmark Geolife dataset [3].  Moreover, we collected smartphone-based GPS trajectories for multiple modes of transportation collected by testers in Bengaluru, India, and Tokyo, Japan.  With this new absolute ground truth data, we compared the resulting predicted classes between operating system-provided activity classifications, the above XGBoost model, and our own.  Our experiments show promising results with improved accuracy and increases in number of labeled data points.  Of key note is that the iOS [4] and android [5]in-built activity recognition tools provide the 'automotive' class as a single class, while our proposed model efficiently distinguishes automotive classes as car, bus, and train with improved accuracy. This work completely depends upon Free and Open Source Solutions (FOSS) for data preparation, mobility feature generation, deep learning model training, and big data computing. That includes various geospatial libraries such as geopandas, shapely, rtree, weakly label generation platform snorkel, deep learning platform tensorflow,  keras, big-data computing platforms such as pyspark, hadoop, hive, etc.

Room Hall 3A