Skip to Main Content
It looks like you're using Internet Explorer 11 or older. This website works best with modern browsers such as the latest versions of Chrome, Firefox, Safari, and Edge. If you continue with this browser, you may see unexpected results.
Social Media Data
The Interdisciplinary Research Collaboratory supports a number of tools for acquiring and analyzing social media data. Feel free to schedule a consultation if these guides do not answer your questions.
A browser-based tool for harvesting and analyzing Tweets, as well as other social media sources such as Reddit, WordPress.com blogs, and select public Facebook posts. Request access to BrandWatch
A Free and Open Source Python script that uses Twitter's public API to harvest Tweets and their associated metadata
A Chrome browser plug-in that will capture Tweets in a format usable in NVIVO qualitative data analysis software. Download NCatpure from the Chrome store
- Everything we know about Twitter
An off-site guide to a compilation of our Twitter knowledge.
- Text Data Mining
Social media data analysis can be considered a type of text data mining. The Library has a variety of other sources of texts that can be mined, and API's that can be used to compile your own full-text datasets.
There are also online sources of pre-existing datasets:
Harvard CGA Geotweet Archive v2.0
Extending back to 2010, and updated daily, the Harvard Center for Geographic Analysis (CGA) contains more than 10 billion tweets. Extractions require reimbursement to CGA.
Copyright © 2008-2019 The Regents of the University of California, All Rights Reserved.
UCSB Library (805) 893-2478 • Music Library (805) 893-2641 • UCSB, Santa Barbara, CA 93106-9010
Contact Us • Policies