Javier Pastor-Galindo, Mattia Zago, Pantaleone Nespoli, Sergio López Bernal, José A. Ruipérez Valiente, Alberto Huertas Celdrán, Manuel Gil Pérez, Gregorio Martínez Pérez, Félix Gómez Mármol
Data in Brief, vol. 32, pp. 1-10
Publication year: 2020

Abstract

The term social bots refer to software-controlled accounts that actively participate in the social platforms to influence public opinion toward desired directions. To this extent, this data descriptor presents a Twitter dataset collected from October 4th to November 11th, 2019, within the context of the Spanish general election. Starting from 46 hashtags, the collection contains almost eight hundred thousand users involved in political discussions, with a total of 5.8 million tweets.
The proposed data descriptor is related to the research article available at [1].

Its main objectives are: i) to enable worldwide researchers to improve the data gathering, organization, and preprocessing phases; ii) to test machine-learning-powered proposals; and, finally, iii) to improve state-of-the-art solutions on social bots detection, analysis, and classification.

Note that the data are anonymized to preserve the privacy of the users.

Throughout our analysis, we enriched the collected data with meaningful features in addition to the ones provided by Twitter. In particular, the tweets collection presents the tweets’ topic mentions and keywords (in the form of political bag-of-words), and the sentiment score. The users’ collection includes one field indicating the likelihood of one account being a bot. Furthermore, for those accounts classified as bots, it also includes a score that indicates the affinity to a political party and the followers/followings list.

Related Publications


The not yet exploited goldmine of OSINT: Opportunities, open challenges and future trends

JournalQ2
Javier Pastor-Galindo, Pantaleone Nespoli, Félix Gómez Mármol, Gregorio Martínez Pérez
IEEE Access, vol. 8, no. 1, pp. 10282-10304
Publication year: 2020

OSINT is the next Internet goldmine: Spain as an unexplored territory

Conference
Javier Pastor-Galindo, Pantaleone Nespoli, Félix Gómez Mármol, Gregorio Martínez Pérez
Fifth National Conference on Cybersecurity (JNIC 2019), ISBN 978-84-09-12121-2, pp. 102-109, Cáceres, Spain
Publication year: 2019

Co-Authors

This work would not have been possible without the inestimable contribution of:

  • Javier Pastor-Galindo
  • Mattia Zago
  • Pantaleone Nespoli
  • Sergio López Bernal
  • José A. Ruipérez Valiente
  • Alberto Huertas Celdrán
  • Manuel Gil Pérez
  • Gregorio Martínez Pérez

Javier Pastor-Galindo

Javier Pastor-Galindo

University of Murcia

Web

Mattia Zago

Mattia Zago

University of Murcia

Web

Pantaleone Nespoli

Pantaleone Nespoli

University of Murcia

Web

Sergio López Bernal

Sergio López Bernal

University of Murcia

Web

José A. Ruipérez Valiente

José A. Ruipérez Valiente

University of Murcia

Web

Alberto Huertas Celdrán

Alberto Huertas Celdrán

University of Murcia

Web

Manuel Gil Pérez

Manuel Gil Pérez

University of Murcia

Web

Gregorio Martínez Pérez

Gregorio Martínez Pérez

University of Murcia

Web

Citation

Javier Pastor-Galindo, Mattia Zago, Pantaleone Nespoli, Sergio López Bernal, José A. Ruipérez Valiente, Alberto Huertas Celdrán, Manuel Gil Pérez, Gregorio Martínez Pérez, Félix Gómez Mármol, «Twitter social bots: the 2019 Spanish general election data«, Data in Brief, vol. 32, pp 1-10, 2020

Journal Info & Impact Metrics

  • Journal: Data in Brief
  • ISSN: 2352-3409
  • Journal Citation Index (JCI): 0.2
  • Category: Multidisciplinary Sciences
  • Rank: 58/128
  • Quartile: Q2