Data Sources

Python Web Scraping

March 5, 2025, 10:00am
In this workshop, we cover how to scrape data from the web using Python. Web scraping involves downloading a webpage's source code and sifting through the material to extract desired data.

Python Web APIs

March 3, 2025, 10:00am
In this workshop, we cover how to extract data from the web with APIs using Python. APIs are often official services offered by companies and other entities, which allow you to directly query their servers in order to retrieve their data. Platforms like The New York Times, Twitter and Reddit offer APIs to retrieve data.

Excel Data Analysis: Charts, Pivot Tables, and VLOOKUP

March 7, 2025, 1:00pm
This three-hour workshop will cover charts in more detail, review pivot tables, and the widely-used VLOOKUP function. We recommend first taking the introductory workshop Excel Data Analysis: Introduction.

R Census Data Fundamentals

March 10, 2025, 2:00pm
In this workshop, we provide an overview of conducting U.S. Census data analysis and visualization in R. First, we’ll cover the basic concepts of U.S. Census Data. Then, we’ll demonstrate how to call the census data API directly from R by using the R tidycensus package.

Excel Data Analysis: Introduction

March 3, 2025, 1:00pm
This is a three-hour introductory workshop that will provide an overview of Excel, with no prior experience assumed. Attendees will learn how to use functions for handling data and making calculations, how to build charts and pivot tables, and more.

Excel Data Analysis: Charts, Pivot Tables, and VLOOKUP

February 10, 2025, 2:00pm
This three-hour workshop will cover charts in more detail, review pivot tables, and the widely-used VLOOKUP function. We recommend first taking the introductory workshop Excel Data Analysis: Introduction.

Excel Data Analysis: Introduction

February 3, 2025, 2:00pm
This is a three-hour introductory workshop that will provide an overview of Excel, with no prior experience assumed. Attendees will learn how to use functions for handling data and making calculations, how to build charts and pivot tables, and more.

A Recipe for Reliable Discoveries: Ensuring Stability Throughout Your Data Work

November 19, 2024
by Jaewon Saw. Imagine perfecting a favorite recipe, then sharing it with others, only to find their results differ because of small changes in tools or ingredients. How do you ensure the dish still reflects your original vision? This challenge captures the principle of stability in data science: achieving acceptable consistency in outcomes relative to reasonable perturbations of conditions and methods. In this blog post, I reflect on my research journey and share why grounding data work in stability is essential for reproducibility, adaptability, and trust in the final results.

R Census Data Fundamentals

November 25, 2024, 2:00pm
In this workshop, we provide an overview of conducting U.S. Census data analysis and visualization in R. First, we’ll cover the basic concepts of U.S. Census Data. Then, we’ll demonstrate how to call the census data API directly from R by using the R tidycensus package.

Exploring Rental Affordability in the San Francisco Bay Area Neighborhoods with R

November 5, 2024
by Taesoo Song. Many American cities continue to face severe rental burdens. However, we rarely examine rental affordability through the lens of quantitative data. In this blog post, I demonstrate how to download and visualize rental affordability data for the San Francisco Bay Area using R packages like `tidycensus` and `sf`. This exercise shows that mapping census data can be a straightforward and powerful way to understand the spatial patterns of housing dynamics and can offer valuable insights for research, policy, and advocacy.