Data Manipulation and Cleaning

Python Data Wrangling and Manipulation with Pandas

August 22, 2024, 2:00pm
Pandas is a Python package that provides fast, flexible, and expressive data structures designed to make working with 'relational' or 'labeled' data both easy and intuitive. It enables doing practical, real world data analysis in Python. In this workshop, we'll work with example data and go through the various steps you might need to prepare data for analysis.

TEST: Python Data Wrangling and Manipulation with Pandas

August 22, 2024, 2:00pm
Pandas is a Python package that provides fast, flexible, and expressive data structures designed to make working with 'relational' or 'labeled' data both easy and intuitive. It enables doing practical, real world data analysis in Python. In this workshop, we'll work with example data and go through the various steps you might need to prepare data for analysis.

Alex Ramiller

Senior Data Science Fellow 2024-2025, Data Science Fellow 2023-2024
City and Regional Planning

I am a PhD Candidate in City and Regional Planning. My research focuses on the use of large administrative datasets to study residential mobility, neighborhood change, and housing access. I received a Master in Geography from the University of Washington and a Bachelor's in Economics and Geography from Macalester College. I have also consulted on analytical projects for several organizations including the San Francisco Federal Reserve Bank, PolicyLink, and the City of Seattle.

Kamya Yadav

Senior Data Science Fellow 2024-2025, Data Science Fellow 2023-2024
Political Science

Kamya is a third year PhD student in the Department of Political Science. Using multimethod research, she studies gender, representation, and political parties in India to understand the barriers and pathways to women's political participation and representation. She has a BA in Politics from Princeton University.

Farnam Mohebi

Data Science Fellow
Haas School of Business

I am a PhD student at the Haas School of Business, University of California, Berkeley, and a researcher in the Department of Radiation Oncology at the University of California, San Francisco, having previously earned my MD and MPH degrees. My research focuses on the intersection of professionals and emerging technologies, drawing from the fields of medical sociology, organizational theory, and science and technology studies. I am particularly fascinated by the evolving relationship between physicians and artificial intelligence, the phenomenon of physician influencers, and the social...

Taesoo Song

Data Science Fellow 2024-2025
City and Regional Planning

Taesoo is a Ph.D. candidate in the City and Regional Planning department at the University of California, Berkeley. He studies the nexus of housing policy, neighborhood change, and residential outcomes for low-income and minority households.

His dissertation aims to reassess the prevailing narrative that Asian Americans face minimal barriers in the housing market using quantitative and qualitative methods. Taesoo has worked with the Terner Center for Housing Innovation and the Urban Displacement Project at UC Berkeley, as well as the Seoul Institute in South Korea.

Christian Caballero

Data Science Fellow 2024-2025
Political Science

Christian Caballero is a Political Science PhD student at the University of California, Berkeley. His research focuses on American politics and political behavior. In particular, he studies the ways in which social networks influence processes of political persuasion and democratic deliberation, as well as how political ideologies develop within subcultures.

He holds a B.A. in Politics and Sociology from New York University and an M.A. in Political Science from the University of California, Berkeley.

Jane (Mango) Angar

Data Science Fellow 2024-2025
Political Science

Hi! I am a PhD candidate in the Political Science Department at UC Berkeley. My dissertation traces the emergence of disability rights groups in Africa, focusing on Zambia and Malawi, and examines factors influencing their effectiveness. I use mixed methods, including archival work, field interviews, participant observation, and surveys for data collection.

My data analysis techniques include text analysis, social network analysis, means tests, and regressions. In my free time, I enjoy moderately difficult hikes, walks along the beach with my dog, Princess, and...

Yue Lin

Data Science Fellow 2024-2025
Political Science

Yue is a Ph.D. student in Political Science at the University of California, Berkeley, with a Designated Emphasis on Political Economy. Using mixed methods, she studies foreign lobbying, geopolitical risk, and economic security to understand when, how, and why multinational corporations become the targets and weapons of state power rivalry.

Jaewon Saw

Data Science Fellow 2024-2025
Civil and Enviromental Engineering

I am a PhD candidate in Systems Engineering. My current research focuses on distributed acoustic sensing (DAS), a cutting-edge technology with diverse applications. I have used DAS to detect whale vocalizations in Monterey Bay, California, and to monitor roadways, water pipelines, and energy infrastructure.

I enjoy identifying and mitigating challenges that arise when applying new technologies by developing data tools, pipelines, and frameworks for real-world deployments. My work is driven by a keen interest in exploring and refining innovative...