Data Manipulation and Cleaning

Farnam Mohebi

Data Science Fellow
Haas School of Business

I am a PhD student at the Haas School of Business, University of California, Berkeley, and a researcher in the Department of Radiation Oncology at the University of California, San Francisco, having previously earned my MD and MPH degrees. My research focuses on the intersection of professionals and emerging technologies, drawing from the fields of medical sociology, organizational theory, and science and technology studies. I am particularly fascinated by the evolving relationship between physicians and artificial intelligence, the phenomenon of physician influencers, and the social...

Mingyu Yuan

Data Science for Social Justice Senior Fellow 2024
Linguistics

I am a Ph.D. candidate in Linguistics, with a focus on phonetics and phonology, specifically speech production in neuro-atypical populations. I use methods from Natural Language Processing in my day-to-day research.

Stephanie Andrews

Data Science for Social Justice Senior Fellow 2024
Info & Data Science MIDS

Stephanie Andrews is currently studying data science in the MIDS program, having previously majored in Social Welfare as an undergraduate at Cal. After graduating, she worked as an advocate for survivors of gender-based violence, as a public policy analyst focusing on anti-trafficking initiatives, and as a software engineer for progressive and social impact organizations. She is now conducting research with the Human Rights Center's Investigations Lab, using OSINT and data science methods to investigate human rights violations.

Violet Davis

Data Science for Social Justice Senior Fellow 2024
MIDS

I am a Masters student studying Data Science with the School of Information. My research involves computational social science projects focused on social justice and equity.

Sahiba Chopra

Data Science Fellow 2024
Haas

I'm a PhD student in the Management and Organizations (Macro) group at Berkeley Haas. I have a diverse professional background, primarily as a data scientist across numerous industries, including fintech, cleantech, and media. I hold a BA in Economics from the University of Maryland, an MS in Applied Economics from the University of San Francisco, and an MS in Business Administration from UC Berkeley.

My research focuses on the intersection of inequality, technology, and the labor market. I am particularly interested in understanding how to reduce inequality in...

Jaewon Saw

Data Science Fellow 2024
Civil and Enviromental Engineering

I am a PhD candidate in Systems Engineering. My current research focuses on distributed acoustic sensing (DAS), a cutting-edge technology with diverse applications. I have used DAS to detect whale vocalizations in Monterey Bay, California, and to monitor roadways, water pipelines, and energy infrastructure.

I enjoy identifying and mitigating challenges that arise when applying new technologies by developing data tools, pipelines, and frameworks for real-world deployments. My work is driven by a keen interest in exploring and refining innovative...

R Advanced Data Wrangling: Parts 1-2

October 5, 2021, 2:00pm
Advanced Data Wrangling aims to help students to learn powerful data wrangling tools and techniques in R to wrangle data with less pain and more fun. This workshop will show how R can make your data wrangling process faster, more reliable, and interpretable.

Python Data Wrangling and Manipulation with Pandas

November 15, 2023, 9:00am
Pandas is a Python package that provides fast, flexible, and expressive data structures designed to make working with 'relational' or 'labeled' data both easy and intuitive. It enables doing practical, real world data analysis in Python. In this workshop, we'll work with example data and go through the various steps you might need to prepare data for analysis.

Python Web Scraping

March 7, 2022, 10:00am
In this workshop, we cover how to extract data from the web using Python. We focus on two approaches to extracting data from the web: leveraging application programming interfaces (APIs) and web scraping.

Python Web Scraping

June 26, 2024, 10:00am
In this workshop, we cover how to scrape data from the web using Python. Web scraping involves downloading a webpage's source code and sifting through the material to extract desired data.