Software Tools

Introduction to Bash + Git

March 15, 2022, 9:00am
This workshop will start by introducing you to navigating your computer’s file system and basic Bash commands to remove the fear of working with the command line and to give you the confidence to use it to increase your productivity. And then working with Git, a powerful tool for keeping track of changes you make to the files in a project.

R Data Wrangling and Manipulation: Parts 1-2

March 1, 2022, 5:00pm
Evening workshop 5-8pm. It is said that 80% of data analysis is spent on the process of cleaning and preparing the data for exploration, visualization, and analysis. This R workshop will introduce the dplyr and tidyr packages to make data wrangling and manipulation easier. Participants will learn how to use these packages to subset and reshape data sets, do calculations across groups of data, clean data, and other useful tasks.

R Introduction to Machine Learning with tidymodels: Parts 1-2

March 1, 2022, 9:00am
Machine learning often evokes images of Skynet, self-driving cars, and computerized homes. However, these ideas are less science fiction as they are tangible phenomena that are predicated on description, classification, prediction, and pattern recognition in data. During this two part workshop, we will discuss basic features of supervised machine learning algorithms including k-nearest neighbor, linear regression, decision tree, random forest, boosting, and ensembling using the tidymodels framework. To social scientists, such methods might be critical for investigating evolutionary relationships, global health patterns, voter turnout in local elections, or individual psychological diagnoses.

R Census Data Wrangling and Mapping

March 15, 2022, 12:00pm
Since 1790, the US Census has been THE source of data about American people, providing valuable insights to social scientists and humanists. Mapping these data by census geographies adds more value by allowing researchers to explore spatial trends and outliers. This workshop will introduce three key packages for streamlining census data workflows in R: tigris, tidycensus and tmap. Participants will learn how to download census tabular data for one or more geographic aggregation units or years, download the associated census geographic data and then join these data for analysis and mapping.

Eileen Cahill

D-Lab Alumni
School of Information

Eileen is currently a first year Information Management and Systems student committed to studying human-centered design for the utility and usability of healthcare systems. She spent the last few years working in genomic research program analysis and management at the National Human Genome Research Institute. Prior to that, Eileen attended Georgetown University where she studied biology and studio art. During this time, she performed research on water contaminants in an analytical chemistry lab as well as research on estrogen mimicking compound effects on Zebrafish in a brain...

R Geospatial Fundamentals: Raster Data

March 16, 2022, 10:00am
Geospatial data are an important component of data visualization and analysis in the social sciences, humanities, and elsewhere. The R programming language is a great platform for exploring these data and integrating them into your research. This workshop focuses on fundamental operations for reading, writing, manipulating and mapping raster data, which typically represents geographic information in a grid of regular sized cells.

R Geospatial Fundamentals: Vector Data, Parts 1-2

March 7, 2022, 10:00am
Geospatial data are an important component of data visualization and analysis in the social sciences, humanities, and elsewhere. The R programming language is a great platform for exploring these data and integrating them into your research. This workshop focuses on fundamental operations for reading, writing, manipulating and mapping vector data, which encodes location as points, lines and polygons.

Python Visualization

February 17, 2022, 9:00am
For this workshop, we'll provide an introduction to visualization with Python. We'll cover visualization theory and plotting with Matplotlib and Seaborn, working through examples in a Jupyter notebook.

R Visualization

February 10, 2022, 5:00pm
This workshop will provide an introduction to graphics in R with ggplot2. Participants will learn how to construct, customize, and export a variety of plot types in order to visualize relationships in data. We will also explore the basic grammar of graphics, including the aesthetics and geometry layers, adding statistics, transforming scales, and coloring or panelling by groups. You will learn how to make histograms, boxplots, scatterplots, lineplots, and heatmaps as well as how to make compound figures.

R Data Wrangling and Manipulation: Parts 1-2

February 1, 2022, 9:00am
It is said that 80% of data analysis is spent on the process of cleaning and preparing the data for exploration, visualization, and analysis. This R workshop will introduce the dplyr and tidyr packages to make data wrangling and manipulation easier. Participants will learn how to use these packages to subset and reshape data sets, do calculations across groups of data, clean data, and other useful tasks.