Data Sources

Python Web Scraping & APIs

November 2, 2022, 3:00pm
In this workshop, we cover how to extract data from the web using Python. We focus on two approaches to extracting data from the web: leveraging application programming interfaces (APIs) and web scraping.

Excel Data Analysis: Charts, Pivot Tables, and VLOOKUP

June 22, 2023, 9:30am
This three-hour workshop will cover charts in more detail, review pivot tables, and the widely-used VLOOKUP function. We recommend first taking the introductory workshop Excel Data Analysis: Introduction.

Python Web Scraping

November 2, 2023, 2:00pm
In this workshop, we cover how to scrape data from the web using Python. Web scraping involves downloading a webpage's source code and sifting through the material to extract desired data.

Democratizing Our Data

August 26, 2021, 10:00am
There is enormous interest in building a better understanding of how evidence and data can inform policy. New possibilities have opened up to enable data to be shared and used across states and agencies. One is a technical approach – the Administrative Data Research Facility – which provides a secure environment within which education, training, and workforce data can be shared across agencies and states. The other is human – the Applied Data Analytics training program – which trains government agency staff how to combine and use the data to serve their agency missions. Over 650 participants from over 150 agencies have participated and produced new products and new networks in the process. This presentation discusses the approach sponsored by the California Department of Social Services, joint with the Department of Education and the Economic Development Department. The D-Lab worked with the Coleridge Initiative to successfully combine the two approaches. The presentation will also address the broader vision of how approaches like this can serve to democratize data for the United States.

Finding Health Statistics and Data

October 21, 2021, 11:00am
Participants in this workshop will learn about some of the issues surrounding the collection of health statistics, and will also learn about authoritative sources of health statistics and data. We will look at tools that let you create custom tables of vital statistics (birth, death, etc.), disease statistics, health behavior statistics, and more.

Excel Data Analysis: Introduction

June 22, 2022, 3:00pm
This is a three-hour introductory workshop that will provide an overview of Excel, with no prior experience assumed. Attendees will learn how to use functions for handling data and making calculations, how to build charts and pivot tables, and more.

Finding Health Statistics and Data

March 15, 2023, 12:00pm
Participants in this workshop will learn about some of the issues surrounding the collection of health statistics, and will also learn about authoritative sources of health statistics and data. We will look at tools that let you create custom tables of vital statistics (birth, death, etc.), disease statistics, health behavior statistics, and more.

Excel Data Analysis: Charts, Pivot Tables, and VLOOKUP

January 31, 2024, 9:00am
This three-hour workshop will cover charts in more detail, review pivot tables, and the widely-used VLOOKUP function. We recommend first taking the introductory workshop Excel Data Analysis: Introduction.

Python Web Scraping

March 7, 2022, 10:00am
In this workshop, we cover how to extract data from the web using Python. We focus on two approaches to extracting data from the web: leveraging application programming interfaces (APIs) and web scraping.

Python Data Wrangling and Manipulation with Pandas

October 24, 2022, 3:00pm
Pandas is a Python package that provides fast, flexible, and expressive data structures designed to make working with 'relational' or 'labeled' data both easy and intuitive. It enables doing practical, real world data analysis in Python. In this workshop, we'll work with example data and go through the various steps you might need to prepare data for analysis.