• Skip to primary navigation
  • Skip to main content

Social Metrics

Social Media, Organizations, and Academic Research

  • Home
  • Blog
  • Publications
  • Teaching
  • Python
  • About Me
  • Show Search
Hide Search

Python Data Analytics Tutorials

The bulk of my research involves some degree of ‘Big Data’ — such as datasets with a million or more tweets. Getting these data prepped for analysis can involve massive amounts of data manipulation — anything from aggregating data to the daily or organizational level, to merging in additional variables, to generating data required for social network analysis. For all such steps I now almost exclusively use Python’s PANDAS library (‘Python Data Analysis Library’). In conjunction with the Jupyter Notebook interactive computing framework and packages such as NetworkX, you will have a powerful set of analysis tools at your disposal. This page contains links to the tutorials I have created to help you learn data analytics in Python. I also have a page with shorter (typically one-liner) data analytic code bytes.

Data Collection

  • Python: Where to Start?
  • Running your first code
  • Four ways to run your code
  • Setting up Your Computer to Use My Python Code
  • Setting up Access to the Twitter API
  • Using Your Twitter API Key
  • Using Python to Grab Twitter User Data
  • Tag Cloud Tutorial
  • Downloading Hashtag Tweets
  • Downloading Tweets by a List of Users
  • Downloading Tweets-Take II
  • SQLite vs. MongoDB for Big Data
  • Downloading Tweets, Take III – MongoDB

Data Analysis

  • Analyzing Big Data with Python PANDAS (Overview)
  • Set up Jupyter, Import Twitter Data and Select Cases
  • Aggregating and Analyzing Data by Twitter Account (coming soon)
  • Analyzing Twitter Data by Time Period (coming soon)
  • Analyzing Hashtags (coming soon)
  • Generating New Variables (coming soon)
  • Producing a Summary Statistics Table for Publication (coming soon)
  • Analyzing Audience Reaction on Twitter (coming soon)
  • Running, Interpreting, and Outputting Logistic Regression (coming soon)

I hope you have found this helpful. If so, please spread the word, and happy coding!

Share Button

Written by:
Gregory Saxton
Published on:
March 20, 2018
Thoughts:
No comments yet

Categories: Big Data, Featured, jupyter_notebook, pandas, research, Twitter

Explore more

Contact Information

Gregory D. Saxton
Schulich School of Business
York University
Toronto, ON
gsaxton@yorku.ca

Footer

Copyright © 2025 · Navigation Pro on Genesis Framework · WordPress · Log in

Keep in Touch

  • Email
  • Facebook
  • GitHub
  • LinkedIn
  • Twitter
  • About Me
  • Media Mentions
  • Contact