Topics: Introduction to R and R Studio. Packages. Working Directory. Dataframes and functions.

Registration: Sign-up Link.

Reference material

Code

Topics: Importing data (csv, Stata, SAS). Data wrangling, aggregation and filtering (dplyr).

Registration: Sign-up Link.

Reference material

Code

Optional Readings

Topics: Overview of Tidyverse: tidyr, dplyr, ggplot2, readr, purrr

Registration: Sign-up Link.

Reference material

Code

  • Grolemund G., & Wickham, H. (2017). Tidy Data. R for Data Science.

  • Soltoff, B. (2017). dplyr in brief. Computing for the Social Sciences.

Optional Readings

Topics: R Markdown and Bookdown. Introduction to Git

Registration: Sign-up Link.

Reference material

Optional Readings

Topics: Introduction to visualizations. ggplot2. Interactive visualizations using HTMLWidgets.

Reference material

Code

Topics: Define API’s. Twitter’s Public API (REST and Stream). Facebook API.

Reference material

Optional Readings

Topics: Basic concepts in social network analysis. Centrality measures. Network visualization.

Reference material

Optional

Code

Topics: Predictive versus explanatory analysis. Evaluation: Holdout/Cross Validation. Supervised and unsupervised machine learning. Regularization.

Reference material

Code

Topics: tidytext. Bag of words. Word frequencies, TF-IDF and Zipf’s Law. Dictionary-based methods.

Reference material

Optional Readings

Topics: Text as Data. Supervised (Text Classification) and Unsupervised Models (Topic Models, Word Scaling and Word Embedding)

Reference material

Optional Readings

Code