Python

Python Web Scraping

June 26, 2023, 2:00pm
In this workshop, we cover how to scrape data from the web using Python. Web scraping involves downloading a webpage's source code and sifting through the material to extract desired data.
See event details for participation information.

Python Text Analysis Fundamentals: Parts 1-2

June 20, 2023, 9:00am
This two-part workshop series will prepare participants to move forward with research that uses text analysis, with a special focus on humanities and social science applications.
See event details for participation information.

Python Data Wrangling and Manipulation with Pandas

June 21, 2023, 2:00pm
Pandas is a Python package that provides fast, flexible, and expressive data structures designed to make working with 'relational' or 'labeled' data both easy and intuitive. It enables doing practical, real world data analysis in Python. In this workshop, we'll work with example data and go through the various steps you might need to prepare data for analysis.
See event details for participation information.

Python Fundamentals: Parts 1-3

June 13, 2023, 11:00am
This three-part interactive workshop series is your complete introduction to programming Python for people with little or no previous programming experience. By the end of the series, you will be able to apply your knowledge of basic principles of programming and data manipulation to a real-world social science application.
See event details for participation information.

Acquiring Genomic Data from NCBI

April 4, 2023
Acquiring Genomic Data from NCBI An Introduction to Genomic Data and NCBI Databases

Genomic data is essential for studying evolutionary biology, human health, and epidemiology. For example, genomic data is used to track the evolution of various strains of the SARS-CoV-2 virus (see here for some great visualizations). But first–what is the difference between genomic and ...

Python Data Wrangling and Manipulation with Pandas

May 4, 2023, 1:00pm
Pandas is a Python package that provides fast, flexible, and expressive data structures designed to make working with 'relational' or 'labeled' data both easy and intuitive. It enables doing practical, real world data analysis in Python. In this workshop, we'll work with example data and go through the various steps you might need to prepare data for analysis.

Python Fundamentals Pilot: Parts 1-3

May 1, 2023, 2:00pm
This three-part interactive workshop series is your complete introduction to programming Python for people with little or no previous programming experience. By the end of the series, you will be able to apply your knowledge of basic principles of programming and data manipulation to a real-world social science application.

Understanding Rock Climbing using Python & SQL

March 22, 2022
Understanding Rock Climbing using Python & SQL

The Rise of Climbing

As an avid rock climber, I’ve been curious about how climbing became so popular in such a brief time, and what these climbers look like. Unlike other well established sports such as tennis, football, or basketball, climbing has only recently gained attention on the public stage, and little data is available about this burgeoning sport.For context, back in the day, climbing was a serious commitment! You had to find a buddy to learn how to use climbing equipment...

Twitter Text Analysis: A Friendly Introduction, Part 2

March 7, 2023

The code for this blog post is available in this GitHub repository. You can also follow along in this Collab notebook!

Introduction

This blog post follows up on an ...

Can Machine Learning Models Predict Reality TV Winners? The Case of Survivor

March 14, 2023

Since its creation in 2000, the show Survivor has dropped approximately 20 contestants in a remote location and forced them to hunt for their own food and build their own shelter. While a test of physical endurance, the social dynamics and strategy are what make the show stand out from other reality television shows. In this post, I lay out the rules of the show and explore ways to leverage existing data to predict how far players will make it into the game.

The R and Python files used to clean and analyze data are available on my ...