
The Library
Abundant topological outliers in social media data and their effect on spatial analysis
Tools
Westerholt, René, Steiger, Enrico, Resch, Bernd and Zipf, Alexander (2016) Abundant topological outliers in social media data and their effect on spatial analysis. PLoS One, 11 (9). e0162360. doi:10.1371/journal.pone.0162360 ISSN 1932-6203.
|
PDF
WRAP-abundant-topological-outliers-social-media-Westerholt-2016.PDF - Published Version - Requires a PDF viewer. Available under License Creative Commons Attribution 4.0. Download (8Mb) | Preview |
Official URL: http://dx.doi.org/10.1371/journal.pone.0162360
Abstract
Twitter and related social media feeds have become valuable data sources to many fields of research. Numerous researchers have thereby used social media posts for spatial analysis, since many of them contain explicit geographic locations. However, despite its widespread use within applied research, a thorough understanding of the underlying spatial characteristics of these data is still lacking. In this paper, we investigate how topological outliers influence the outcomes of spatial analyses of social media data. These outliers appear when different users contribute heterogeneous information about different phenomena simultaneously from similar locations. As a consequence, various messages representing different spatial phenomena are captured closely to each other, and are at risk to be falsely related in a spatial analysis. Our results reveal indications for corresponding spurious effects when analyzing Twitter data. Further, we show how the outliers distort the range of outcomes of spatial analysis methods. This has significant influence on the power of spatial inferential techniques, and, more generally, on the validity and interpretability of spatial analysis results. We further investigate how the issues caused by topological outliers are composed in detail. We unveil that multiple disturbing effects are acting simultaneously and that these are related to the geographic scales of the involved overlapping patterns. Our results show that at some scale configurations, the disturbances added through overlap are more severe than at others. Further, their behavior turns into a volatile and almost chaotic fluctuation when the scales of the involved patterns become too different. Overall, our results highlight the critical importance of thoroughly considering the specific characteristics of social media data when analyzing them spatially.
Item Type: | Journal Article | |||||||||
---|---|---|---|---|---|---|---|---|---|---|
Subjects: | G Geography. Anthropology. Recreation > G Geography (General) H Social Sciences > HM Sociology |
|||||||||
Divisions: | Faculty of Social Sciences > Centre for Interdisciplinary Methodologies | |||||||||
Library of Congress Subject Headings (LCSH): | Geospatial data -- Research, Twitter (Firm) -- Sociological aspects -- Methodology | |||||||||
Journal or Publication Title: | PLoS One | |||||||||
Publisher: | Public Library of Science | |||||||||
ISSN: | 1932-6203 | |||||||||
Official Date: | 9 September 2016 | |||||||||
Dates: |
|
|||||||||
Volume: | 11 | |||||||||
Number: | 9 | |||||||||
Article Number: | e0162360 | |||||||||
DOI: | 10.1371/journal.pone.0162360 | |||||||||
Status: | Peer Reviewed | |||||||||
Publication Status: | Published | |||||||||
Access rights to Published version: | Open Access (Creative Commons) | |||||||||
Date of first compliant deposit: | 4 March 2019 | |||||||||
Date of first compliant Open Access: | 4 March 2019 | |||||||||
RIOXX Funder/Project Grant: |
|
|||||||||
Related URLs: |
Request changes or add full text files to a record
Repository staff actions (login required)
![]() |
View Item |
Downloads
Downloads per month over past year