Software Tools

R Data Wrangling and Manipulation: Parts 1-2

November 1, 2022, 2:00pm
It is said that 80% of data analysis is spent on the process of cleaning and preparing the data for exploration, visualization, and analysis. This R workshop will introduce the dplyr and tidyr packages to make data wrangling and manipulation easier. Participants will learn how to use these packages to subset and reshape data sets, do calculations across groups of data, clean data, and other useful tasks.

Python Data Visualization

February 16, 2023, 10:00am
For this workshop, we'll provide an introduction to visualization with Python. We'll cover visualization theory and plotting with Matplotlib and Seaborn, working through examples in a Jupyter notebook.

R Deep Learning: Parts 1-2

April 19, 2023, 9:00am
This workshop introduces the basic concepts of Deep Learning — the training and performance evaluation of large neural networks, especially for image classification, natural language processing, and time-series data. Like many other machine learning algorithms, we will use deep learning algorithms to map input data to their appropriately classified outcome labels.

Python Visualization

November 3, 2021, 9:00am
For this workshop, we'll provide an introduction to visualization with Python. We'll cover visualization theory and plotting with Matplotlib and Seaborn, working through examples in a Jupyter notebook.

R Data Visualization

October 4, 2023, 2:30pm
This workshop will provide an introduction to graphics in R with ggplot2. Participants will learn how to construct, customize, and export a variety of plot types in order to visualize relationships in data. We will also explore the basic grammar of graphics, including the aesthetics and geometry layers, adding statistics, transforming scales, and coloring or panelling by groups. You will learn how to make histograms, boxplots, scatterplots, lineplots, and heatmaps as well as how to make compound figures.

R Introduction to Machine Learning with tidymodels: Parts 1-2

March 1, 2022, 9:00am
Machine learning often evokes images of Skynet, self-driving cars, and computerized homes. However, these ideas are less science fiction as they are tangible phenomena that are predicated on description, classification, prediction, and pattern recognition in data. During this two part workshop, we will discuss basic features of supervised machine learning algorithms including k-nearest neighbor, linear regression, decision tree, random forest, boosting, and ensembling using the tidymodels framework. To social scientists, such methods might be critical for investigating evolutionary relationships, global health patterns, voter turnout in local elections, or individual psychological diagnoses.

R Data Visualization

February 22, 2024, 10:00am
This workshop will provide an introduction to graphics in R with ggplot2. Participants will learn how to construct, customize, and export a variety of plot types in order to visualize relationships in data. We will also explore the basic grammar of graphics, including the aesthetics and geometry layers, adding statistics, transforming scales, and coloring or panelling by groups. You will learn how to make histograms, boxplots, scatterplots, lineplots, and heatmaps as well as how to make compound figures.

Python Visualization

June 2, 2022, 1:00pm
For this workshop, we'll provide an introduction to visualization with Python. We'll cover visualization theory and plotting with Matplotlib and Seaborn, working through examples in a Jupyter notebook.

Python Data Visualization

January 25, 2023, 2:00pm
For this workshop, we'll provide an introduction to visualization with Python. We'll cover visualization theory and plotting with Matplotlib and Seaborn, working through examples in a Jupyter notebook.

R Advanced Data Wrangling: Parts 1-2

October 5, 2021, 2:00pm
Advanced Data Wrangling aims to help students to learn powerful data wrangling tools and techniques in R to wrangle data with less pain and more fun. This workshop will show how R can make your data wrangling process faster, more reliable, and interpretable.