Detecting and reducing biases in cellular-based mobility data sets Articles uri icon

publication date

  • September 2018

issue

  • 10

volume

  • 20

international standard serial number (ISSN)

  • 1099-4300

abstract

  • Correctly estimating the features characterizing human mobility from mobile phone traces is a key factor to improve the performance of mobile networks, as well as for mobility model design and urban planning. Most related works found their conclusions on location data based on the cells where each user sends or receives calls or messages, data known as Call Detail Records (CDRs). In this work, we test if such data sets provide enough detail on users' movements so as to accurately estimate some of the most studied mobility features. We perform the analysis using two different data sets, comparing CDRs with respect to an alternative data collection approach. Furthermore, we propose three filtering techniques to reduce the biases detected in the fraction of visits per cell, entropy and entropy rate distributions, and predictability. The analysis highlights the need for contextualizing mobility results with respect to the data used, since the conclusions are biased by the mobile phone traces collection approach.

keywords

  • human mobility; cell-based location; ping-pong effect; mobility data sets entropy;
    mobility data sets predictability