Data Manipulation and Cleaning

Wadzanai Makomva

Discovery Graduate Fellow
School of Information

Wadzanai is a graduate student at the School of Information and she is a part of the MIMS program. She has a vested interest in the integration between data science, technology and developmental surveillance techniques. She has prior experience working as a quantitative analyst in project management consulting within a professional services firm, public health, and most recently in sustainable construction materials. Wadzanai is particularly interested in increasing access of STEM subjects and fields to under-privileged women of color in the African continent, particularly her home...

Aniket Gupta

Discovery Fellow
School of Information

I am a first year masters student at UC Berkeley school of Information majoring in Information Management and Systems with a focus on Data Science and ML. I like to build optimized yet simple and scalable solutions powered by data using emerging AI technologies.

Python Data Wrangling and Manipulation with Pandas

September 20, 2023, 2:00pm
Pandas is a Python package that provides fast, flexible, and expressive data structures designed to make working with 'relational' or 'labeled' data both easy and intuitive. It enables doing practical, real world data analysis in Python. In this workshop, we'll work with example data and go through the various steps you might need to prepare data for analysis.

Python Data Wrangling and Manipulation with Pandas

August 17, 2023, 2:00pm
Pandas is a Python package that provides fast, flexible, and expressive data structures designed to make working with 'relational' or 'labeled' data both easy and intuitive. It enables doing practical, real world data analysis in Python. In this workshop, we'll work with example data and go through the various steps you might need to prepare data for analysis.

Python Web Scraping

June 26, 2023, 2:00pm
In this workshop, we cover how to scrape data from the web using Python. Web scraping involves downloading a webpage's source code and sifting through the material to extract desired data.
See event details for participation information.

Python Data Wrangling and Manipulation with Pandas

June 21, 2023, 2:00pm
Pandas is a Python package that provides fast, flexible, and expressive data structures designed to make working with 'relational' or 'labeled' data both easy and intuitive. It enables doing practical, real world data analysis in Python. In this workshop, we'll work with example data and go through the various steps you might need to prepare data for analysis.
See event details for participation information.

R Data Wrangling and Manipulation: Parts 1-2

May 1, 2023, 10:00am
It is said that 80% of data analysis is spent on the process of cleaning and preparing the data for exploration, visualization, and analysis. This R workshop will introduce the dplyr and tidyr packages to make data wrangling and manipulation easier. Participants will learn how to use these packages to subset and reshape data sets, do calculations across groups of data, clean data, and other useful tasks.

Python Data Wrangling and Manipulation with Pandas

May 4, 2023, 1:00pm
Pandas is a Python package that provides fast, flexible, and expressive data structures designed to make working with 'relational' or 'labeled' data both easy and intuitive. It enables doing practical, real world data analysis in Python. In this workshop, we'll work with example data and go through the various steps you might need to prepare data for analysis.

Exploring Population Data with IPUMS

November 8, 2022
Exploring Population Data with IPUMS

Last month, demographer and historian Steve Ruggles was awarded a prestigious MacArthur Foundation Fellowship for his work developing IPUMS—a harmonized database of individual and family responses to large-scale domestic and international surveys. With some samples going as far back as the 18th century, IPUMS can offer key insights into changing demographics, norms, and decision-making over...

Python Web Scraping

March 28, 2023, 2:00pm
In this workshop, we cover how to scrape data from the web using Python. Web scraping involves downloading a webpage's source code and sifting through the material to extract desired data.