Data Sources

Python Data Wrangling and Manipulation with Pandas

March 15, 2023, 2:00pm

Pandas is a Python package that provides fast, flexible, and expressive data structures designed to make working with 'relational' or 'labeled' data both easy and intuitive. It enables doing practical, real world data analysis in Python. In this workshop, we'll work with example data and go through the various steps you might need to prepare data for analysis.

Read more about Python Data Wrangling and Manipulation with Pandas

Python Web APIs

February 8, 2024, 10:00am

In this workshop, we cover how to extract data from the web with APIs using Python. APIs are often official services offered by companies and other entities, which allow you to directly query their servers in order to retrieve their data. Platforms like The New York Times, Twitter and Reddit offer APIs to retrieve data.

Read more about Python Web APIs

Python Data Wrangling and Manipulation with Pandas

October 19, 2021, 10:00am

Read more about Python Data Wrangling and Manipulation with Pandas

Python Web Scraping & APIs

June 29, 2022, 1:00pm

In this workshop, we cover how to extract data from the web using Python. We focus on two approaches to extracting data from the web: leveraging application programming interfaces (APIs) and web scraping.

Read more about Python Web Scraping & APIs

Excel Data Analysis: Charts, Pivot Tables, and VLOOKUP

June 22, 2023, 9:30am

This three-hour workshop will cover charts in more detail, review pivot tables, and the widely-used VLOOKUP function. We recommend first taking the introductory workshop Excel Data Analysis: Introduction.

Read more about Excel Data Analysis: Charts, Pivot Tables, and VLOOKUP

Python Web Scraping

November 2, 2023, 2:00pm

In this workshop, we cover how to scrape data from the web using Python. Web scraping involves downloading a webpage's source code and sifting through the material to extract desired data.

Read more about Python Web Scraping

Democratizing Our Data

August 26, 2021, 10:00am

There is enormous interest in building a better understanding of how evidence and data can inform policy. New possibilities have opened up to enable data to be shared and used across states and agencies. One is a technical approach – the Administrative Data Research Facility – which provides a secure environment within which education, training, and workforce data can be shared across agencies and states. The other is human – the Applied Data Analytics training program – which trains government agency staff how to combine and use the data to serve their agency missions. Over 650 participants from over 150 agencies have participated and produced new products and new networks in the process. This presentation discusses the approach sponsored by the California Department of Social Services, joint with the Department of Education and the Economic Development Department. The D-Lab worked with the Coleridge Initiative to successfully combine the two approaches. The presentation will also address the broader vision of how approaches like this can serve to democratize data for the United States.

Read more about Democratizing Our Data

Python Data Wrangling and Manipulation with Pandas

April 18, 2022, 10:00am

Read more about Python Data Wrangling and Manipulation with Pandas

Python Web Scraping & APIs

November 2, 2022, 3:00pm

Read more about Python Web Scraping & APIs

Propensity Score Matching for Causal Inference: Creating Data Visualizations to Assess Covariate Balance in R

June 10, 2024

Sharon Green

by Sharon Green. Although some people consider randomized experiments the gold standard, in many cases, it would be highly unethical to assign individuals to harmful exposures to measure their effects. Modern causal inference techniques help scientists to estimate treatment effects using observational data. In particular, propensity score matching helps scientists estimate causal effects using observational data by matching individuals so that the “treatment” and “control” groups are balanced on measured covariates. After implementing propensity score matching, data visualizations make it easier to assess the quality of the matches before estimating effects. This blog post is a tutorial for implementing propensity score matching and creating data visualizations to assess covariate balance–that is, visually assessing whether the matched individuals are balanced with respect to measured covariates.

Read more about Propensity Score Matching for Causal Inference: Creating Data Visualizations to Assess Covariate Balance in R

« first View: Taxonomy term
‹ previous View: Taxonomy term
…
4 of 12 View: Taxonomy term
5 of 12 View: Taxonomy term
6 of 12 View: Taxonomy term
7 of 12 View: Taxonomy term
8 of 12 View: Taxonomy term
9 of 12 View: Taxonomy term
10 of 12 View: Taxonomy term (Current page)
11 of 12 View: Taxonomy term
12 of 12 View: Taxonomy term
next › View: Taxonomy term
last » View: Taxonomy term