UCL  IRIS
Institutional Research Information Service
UCL Logo
Please report any queries concerning the funding data grouped in the sections named "Externally Awarded" or "Internally Disbursed" (shown on the profile page) to your Research Finance Administrator. Your can find your Research Finance Administrator at https://www.ucl.ac.uk/finance/research/rs-contacts.php by entering your department
Please report any queries concerning the student data shown on the profile page to:

Email: portico-services@ucl.ac.uk

Help Desk: http://www.ucl.ac.uk/ras/portico/helpdesk
Publication Detail
The Impact of Biases in the Crowdsourced Trajectories on the Output of Data Mining Processes
  • Publication Type:
    Conference
  • Authors:
    Basiri A, Haklay M, Gardner Z
  • Publisher:
    Association of Geographic Information Laboratories in Europe (AGILE)
  • Publication date:
    11/07/2018
  • Published proceedings:
    Proceedings of VGI-ALIVE - AnaLysis, Integration, Vision, Engagement
  • Name of conference:
    Association of Geographic Information Laboratories in Europe (AGILE) 2018
  • Conference place:
    Lund, Sweden
  • Conference start date:
    12/06/2018
  • Conference finish date:
    15/06/2018
Abstract
The emergence of the Geoweb has provided an unprecedented capacity for generating and sharing digital content by professional and non- professional participants in the form of crowdsourcing projects, such as OpenStreetMap (OSM) or Wikimapia. Despite the success of such projects, the impacts of the inherent biases within the ‘crowd’ and/or the ‘crowdsourced’ data it produces are not well explored. In this paper we examine the impact of biased trajectory data on the output of spatio-temporal data mining process. To do so, an experiment was conducted. The biases are intentionally added to the input data; i.e. the input trajectories were divided into two sets of training and control datasets but not randomly (as opposed to the data mining procedures). They are divided by time of day and week, weather conditions, contributors’ gender and spatial and temporal density of trajectory in 1km grids. The accuracy of the predictive models are then measured (both for training and control data) and biases gradually moderated to see how the accuracy of the very same model is changing with respect to the biased input data. We show that the same data mining technique yields different results in terms of the nature of the clusters and identified attributes.
Publication data is maintained in RPS. Visit https://rps.ucl.ac.uk
 More search options
UCL Researchers
Author
Centre for Advanced Spatial Analysis
Author
Dept of Geography
University College London - Gower Street - London - WC1E 6BT Tel:+44 (0)20 7679 2000

© UCL 1999–2011

Search by