Data Sources

CANCELED: Python Data Wrangling and Manipulation with Pandas

November 29, 2022, 3:00pm
Pandas is a Python package that provides fast, flexible, and expressive data structures designed to make working with 'relational' or 'labeled' data both easy and intuitive. It enables doing practical, real world data analysis in Python. In this workshop, we'll work with example data and go through the various steps you might need to prepare data for analysis.

Exploring Population Data with IPUMS

November 8, 2022
Exploring Population Data with IPUMS

IPUMS logo with feature icons

Last month, demographer and historian Steve Ruggles was awarded a prestigious MacArthur Foundation Fellowship for his work developing IPUMS—a harmonized database of individual and family...

Finding Health Statistics and Data

November 2, 2022, 1:00pm
Participants in this workshop will learn about some of the issues surrounding the collection of health statistics, and will also learn about authoritative sources of health statistics and data. We will look at tools that let you create custom tables of vital statistics (birth, death, etc.), disease statistics, health behavior statistics, and more.

Python Web Scraping & APIs

November 2, 2022, 3:00pm
In this workshop, we cover how to extract data from the web using Python. We focus on two approaches to extracting data from the web: leveraging application programming interfaces (APIs) and web scraping.

Python Data Wrangling and Manipulation with Pandas

October 24, 2022, 3:00pm
Pandas is a Python package that provides fast, flexible, and expressive data structures designed to make working with 'relational' or 'labeled' data both easy and intuitive. It enables doing practical, real world data analysis in Python. In this workshop, we'll work with example data and go through the various steps you might need to prepare data for analysis.

Python Data Wrangling and Manipulation with Pandas

September 28, 2022, 3:00pm
Pandas is a Python package that provides fast, flexible, and expressive data structures designed to make working with 'relational' or 'labeled' data both easy and intuitive. It enables doing practical, real world data analysis in Python. In this workshop, we'll work with example data and go through the various steps you might need to prepare data for analysis.

Aaron Culich

Consulting Drop-In Hours: By appointment only

Consulting Areas: Python, R, SQL, APIs, Cloud & HPC Computing, Databases & SQL, Bash or Command Line, Git or Github

Quick-tip: the fastest way to speak to a consultant is to first submit a request and then ...

Ella Belfer

Consultant
Energy and Resources Group

Ella is a PhD student in the Energy and Resources Group. Her research examines water governance in a changing climate, drawing on geo-spatial techniques. Her past work includes applications of topic modelling in climate change adaptation research, and inductive coding of semi-structured interviews.

Excel Data Analysis: Charts, Pivot Tables, and VLOOKUP

September 14, 2022, 4:00pm
This three-hour workshop will cover charts in more detail, review pivot tables, and the widely-used VLOOKUP function. We recommend first taking the introductory workshop Excel Data Analysis: Introduction.

Excel Data Analysis: Introduction

September 7, 2022, 4:00pm
This is a three-hour introductory workshop that will provide an overview of Excel, with no prior experience assumed. Attendees will learn how to use functions for handling data and making calculations, how to build charts and pivot tables, and more.