LATIS offers a series of workshops that are free and open to all faculty and graduate students. Join our LATIS Research Workshops Google Group to be the first to learn about workshops. You can view the slides and materials from past workshops at the LATIS Workshop Materials website.

Summer 2020 R Series

R is a popular tool for data analysis and statistical computing, and is a great alternative to tools like SPSS, Stata, or Excel. Additionally, R is free and designed for reproducible research. This workshop series will teach you how to get started using R to clean, manipulate, summarize, and visualize data. We will not cover statistical analysis. Rather, this series will focus on all the steps that come *before* you run statistics, because getting your data into the right format is often the hardest part of data analysis. 

While these workshops are open to participants from all disciplines, we will focus on issues social and behavioral scientists often encounter when using data in R.

Register here for all three workshops in the series or to attend any single workshop. 




Workshop Format

  • Asynchronous materials to review before the workshop on Canvas
  • 10:00am - 10:30am: Open help session/time to review materials on Zoom
  • 10:30am - 11:15am: Live Demonstration on Zoom
  • 11:15am - Noon: Open help session/time to work on activities on Zoom



Workshop descriptions


Introduction to R

This workshop will teach you how to get started using R to explore and clean your data. 

This workshop will cover how to:

  • Create an R script (syntax/command file) to capture data cleaning steps in a reproducible way
  • Load a comma-delimited spreadsheet (.csv) into R as a dataset
  • View and examine data in R 
  • Check and correct missing values, rename variables, create new variables, and recode values in the data 
  • Save cleaned data file in formats for later use in R or other applications


Manipulating data using dplyr

This workshop will introduce you to the dplyr package designed for data manipulation in R. 

This workshop will cover how to:

  • Subset a dataset to select the column/variables you need
  • Filter rows of the dataset to include only certain cases
  • Sort data by values in a column/variable
  • Chain together multiple R functions in a single command
  • Group and summarize data using descriptive statistics


Visualizing data with Ggplot2

Ggplot2 is a popular package that extends R’s capability for data visualization, allowing users to produce attractive and complex graphics in a relatively simple way. This workshop will introduce the logic behind ggplot2 and demonstrate how to create data visualizations using this package. 

This workshop will cover how to:

  • Understand the basics of the "grammar of graphics" underlying ggplot2's functionality
  • Create a variety of reproducible data visualizations in R, such as histograms, line charts, scatter plots, heatmaps, and density plots
  • Visualize data by groups in multiple ways, including color labeling and faceting