/r/The_Donald Dataset
Dataset: The dataset is a set of 16,349,287 comments from 342,731 participants from /r/The_Donald made between June 30th, 2015 and February 28th 2017. The data is formatted in BJSON format . The total size of the dataset is 5.7 GB uncompressed, (2 GB compressed). You can use including a reference to our paper.
Paper: Mobilizing the Trump Train: Understanding Collective Action in a Political Trolling Community
Authors: C. Flores-Saviaga, B.Keegan, S.Savage
Proceedings: The International AAAI Conference on Web and Social Media (ICWSM) 2018
Dataset: http://bit.ly/2H64Lpc