All You Need To Know About Gab Data
The Gab data is a significant but difficult dataset. It comprises each private post and numerous private communications, in addition to being a corpus of Gab’s public discourse.
It would be a valuable social resource in simpler or more ordinary times. It’s also a record of the culture and exact remarks surrounding not only an uptick in extreme ideas and activities but also an attempted coup in 2021.
While the dataset is critical for comprehending recent and current events and serves as a significant historical archive, it also raises privacy concerns. Because of these concerns, as well as the existence of passwords and other personally identifiable information (PII), this dataset is now only available to media and researchers.
Description
In SQL format, there are 70 GB of Gab online posts, private posts, user profiles, hashed passwords for individuals, DMs, and raw passwords for groups, as well as over 70,000 text messages in over 19,000 chats with over 15,000 customers.
A series of Gab scrapes will be provided as a supplement during 2017 and 2018. The scrapes, which follow the same style as GabLeaks and date back to Gab’s beta, include posts that were later removed. We’re making them available to the public, journalists, and researchers as-is because they were all previously public posts with no private information. The scrapes are available for download via a torrent file or a magnet link.