Data Manipulation and Cleaning

Gaby May Lagunes


Hello! I’m Gaby (she/her). I am PhD student at the ESPM department, I hold a masters in Data Science and Information from the Berkeley ISchool and I have 5+ years of industrial experience in different data roles. Before that I got a masters in Engineering for International Development and an undergraduate degree in Physics from University College London. And somewhere between all that I got married, survived the pandemic, and had two awesome boys. I’m very excited to help you use data to enhance your work and your experience here at Berkeley!

Leah Lee

Data Science Fellow
Integrative Biology

I am a PhD candidate in the department of Integrative Biology. My research interest is at the intersection of biomechanics, entomology, and physiology. Currently I am studying how beetles use their shield-like forewings called elytra for flight, thermoregulation, and protection. Prior to UC Berkeley, I worked as a research assistant at Korea Institute of Ocean Science and Technology (KIOST), studying algae phylogenetics. I received my B.A. in Biology and Mathematics from Swarthmore College.

Farnam Mohebi

Data Science Fellow
Haas School of Business

I am currently a PhD student at the Haas School of Business, having previously completed my MD-MPH. I focus on the intersection of healthcare and management, driven by a deep interest in understanding the embedded meanings in medical literature. I am also interested in physician-scientists' narratives of clinical artificial intelligence. Additionally, I study the impact of management practices on physicians. My work is guided by my background in healthcare and a commitment to improving organizational practices within the field.

R Data Wrangling and Manipulation: Parts 1-2

September 25, 2023, 10:00am
It is said that 80% of data analysis is spent on the process of cleaning and preparing the data for exploration, visualization, and analysis. This R workshop will introduce the dplyr and tidyr packages to make data wrangling and manipulation easier. Participants will learn how to use these packages to subset and reshape data sets, do calculations across groups of data, clean data, and other useful tasks.

Kamya Yadav

Data Science Fellow
Political Science

Kamya is a third year PhD student in the Department of Political Science. Using multimethod research, she studies gender, representation, and political parties in India to understand the barriers and pathways to women's political participation and representation. She has a BA in Politics from Princeton University.

Enrique Valencia López

Data Science Fellow
Graduate School of Education

Enrique Valencia López is a PhD student in the Policy, Politics and Leadership cluster at the Graduate School of Education.His research interests relate to three broad areas: the stratification of education by gender, immigration status and ethnicity; the measurement of teacher working conditions and well-being; and education in Latin America.

Before coming to Berkeley, Enrique worked for Mexico’s National Institute for Educational Evaluation and Assessment (INEE) in both the Policy and Indicators area. During that time, he co-authored Mexico’s first report on the educational...

Wadzanai Makomva

Discovery Graduate Fellow
School of Information

Wadzanai is a graduate student at the School of Information and she is a part of the MIMS program. She has a vested interest in the integration between data science, technology and developmental surveillance techniques. She has prior experience working as a quantitative analyst in project management consulting within a professional services firm, public health, and most recently in sustainable construction materials. Wadzanai is particularly interested in increasing access of STEM subjects and fields to under-privileged women of color in the African continent, particularly her home...

Aniket Gupta

Discovery Fellow
School of Information

I am a first year masters student at UC Berkeley school of Information majoring in Information Management and Systems with a focus on Data Science and ML. I like to build optimized yet simple and scalable solutions powered by data using emerging AI technologies.

Python Data Wrangling and Manipulation with Pandas

September 20, 2023, 2:00pm
Pandas is a Python package that provides fast, flexible, and expressive data structures designed to make working with 'relational' or 'labeled' data both easy and intuitive. It enables doing practical, real world data analysis in Python. In this workshop, we'll work with example data and go through the various steps you might need to prepare data for analysis.

Finley Golightly

D-Lab Staff
Applied Mathematics

Finley joined D-Lab as full-time staff launching their career in Data Science after graduating with a Bachelor's degree in Applied Math from UC Berkeley.

They have been with D-Lab since Fall 2020 and formerly as part of the UTech Management team before joining as full-time staff. They love the learning environment of D-Lab and their favorite part of the job is their co-workers! In their free time, they enjoy reading, boxing, listening to music, and playing Dungeons & Dragons. Feel free to stop by the front desk to ask them any questions or just to chat...