Navigation auf zora.uzh.ch

Search ZORA

ZORA (Zurich Open Repository and Archive)

Comparing methods for creating a national random sample of twitter users

Alizadeh, Meysam; Zare, Darya; Samei, Zeynab; Alizadeh, Mohammadamin; Kubli, Maël; Aliahmadi, Mohammadhadi; Ebrahimi, Sarvenaz; Gilardi, Fabrizio (2024). Comparing methods for creating a national random sample of twitter users. Social Network Analysis and Mining, 14:160.

Abstract

Twitter data has been widely used by researchers across various social and computer science disciplines. A common aim when working with Twitter data is the construction of a random sample of users from a given country. However, while several methods have been proposed in the literature, their comparative performance is mostly unexplored. In this paper, we implement four common methods to create a random sample of Twitter users in the US: 1% Stream, Bounding Box, Location Query, and Language Query. Then, we compare these methods according to their tweet- and user-level metrics as well as their accuracy in estimating the US population. Our results show that users collected by the 1% Stream method tend to have more tweets, tweets per day, followers, and friends, a fewer number of likes, are younger accounts, and include more male users compared to the other three methods. Moreover, it achieves the minimum error in estimating the US population. However, the 1% Stream method is time-consuming, cannot be used for the past time frames, and is not suitable when user engagement is part of the study. In situation where these three drawbacks are important, our results support the Bounding Box method as the second-best method.

Additional indexing

Item Type:Journal Article, refereed, original work
Communities & Collections:06 Faculty of Arts > Institute of Political Science
08 Research Priority Programs > Digital Society Initiative
Dewey Decimal Classification:320 Political science
Scopus Subject Areas:Physical Sciences > Information Systems
Social Sciences & Humanities > Communication
Physical Sciences > Media Technology
Physical Sciences > Human-Computer Interaction
Physical Sciences > Computer Science Applications
Language:English
Date:14 August 2024
Deposited On:09 Dec 2024 10:05
Last Modified:30 Apr 2025 01:36
Publisher:Springer
ISSN:1869-5450
OA Status:Hybrid
Free access at:Publisher DOI. An embargo period may apply.
Publisher DOI:https://doi.org/10.1007/s13278-024-01327-5
Project Information:
  • Funder: H2020
  • Grant ID: 883121
  • Project Title: PRODIGI - Problem Definition in the Digital Democracy
  • Funder: University of Zurich
  • Grant ID:
  • Project Title:
Download PDF  'Comparing methods for creating a national random sample of twitter users'.
Preview
  • Content: Published Version
  • Language: English
  • Licence: Creative Commons: Attribution 4.0 International (CC BY 4.0)

Metadata Export

Statistics

Citations

Altmetrics

Downloads

2 downloads since deposited on 09 Dec 2024
2 downloads since 12 months
Detailed statistics

Authors, Affiliations, Collaborations

Similar Publications