Python

Monica Donegan

Data Science Fellow 2022-2023
Environmental Science, Policy, and Management

Monica is a third-year Ph.D. candidate in the Environmental Science, Policy, and Management program. She uses computational tools to study the evolution and ecology of agricultural plant pathogens. Previously, she worked on a data science team at a biotech company in Boston.

Sahiba Chopra

Data Science Fellow 2024-2025
Haas School of Business

I'm a PhD student in the Management and Organizations (Macro) group at Berkeley Haas. I have a diverse professional background, primarily as a data scientist across numerous industries, including fintech, cleantech, and media. I hold a BA in Economics from the University of Maryland, an MS in Applied Economics from the University of San Francisco, and an MS in Business Administration from UC Berkeley.

My research focuses on the intersection of inequality, technology, and the labor market. I am particularly interested in understanding how to reduce inequality in...

Ruiji Sun

Data Science Fellow 2024-2025
Center for the Built Environment

Ruiji Sun is currently a Ph.D. candidate in Building Science at UC Berkeley. He is also a GSR at the Center for the Built Environment (CBE). His dissertation focuses on causal inference in the built environment. Other areas of his research include indoor environmental quality, personalized environmental control systems, and building energy modeling.

He obtained his M.S. degree from Carnegie Mellon University and double-majored in Mechanical Engineering (HVAC) and Architecture at Xi’an University of Architecture and Technology, China. Ruiji also served as a board...

Nanqin Ying

Data Science Fellow 2024-2025
Goldman School of Public Policy

Nanqin Ying, a second-year graduate student at the Goldman School of Public Policy specializing in Development Practices, combines a robust nonprofit background with advanced data science techniques. She focuses on leveraging machine learning and big data to drive significant social change, aiming to transform insights into actionable, positive impacts on communities.

Jaewon Saw

Data Science Fellow 2024-2025
Civil and Enviromental Engineering

I am a PhD candidate in Systems Engineering. My current research focuses on distributed acoustic sensing (DAS), a cutting-edge technology with diverse applications. I have used DAS to detect whale vocalizations in Monterey Bay, California, and to monitor roadways, water pipelines, and energy infrastructure.

I enjoy identifying and mitigating challenges that arise when applying new technologies by developing data tools, pipelines, and frameworks for real-world deployments. My work is driven by a keen interest in exploring and refining innovative...

Christian Caballero

Data Science Fellow 2024-2025
Political Science

Christian Caballero is a Political Science PhD student at the University of California, Berkeley. His research focuses on American politics and political behavior. In particular, he studies the ways in which social networks influence processes of political persuasion and democratic deliberation, as well as how political ideologies develop within subcultures.

He holds a B.A. in Politics and Sociology from New York University and an M.A. in Political Science from the University of California, Berkeley.

Bruno Smaniotto

Data Science Fellow 2024-2025
Economics

I'm originally from Brazil, but I have been living in Berkeley for the last 5 years working towards my PhD in Economics. My main areas of interest are Behavioral and Macroeconomics, mostly their intersection, but I'm excited about learning and working on empirical applications on different fields.

Measuring Vowels Without Relying on Sex-Based Assumptions

April 8, 2025
by Amber Galvano. This tutorial builds on my previous post on Python for acoustic analysis, this time focusing on measuring vocal tract resonances without relying on sex-based assumptions. I demonstrate how to process audio files and vowel annotations using an adaptive method that optimizes the acoustic analysis across a recording. Instead of fixing parameters based on generalized vocal tract length correlations, this approach varies them within a defined range for greater accuracy. This not only enhances measurement precision but also avoids requiring (or assuming) speakers’ sex in data collection. Finally, I show how to filter for outliers and create high-quality vowel space visualizations.

Suraj Nair

Data Science Fellow 2023-2024
School of Information

I am a PhD Student at the School of Information. My research interests lie at the intersection of development economics and machine learning, with a focus on the use of large scale digital data and new computational tools to study pressing issues in global development.

Melike Sümertaş

Data Science Fellow 2023-2024
History

I hold a PhD in History from Boğaziçi University, Istanbul and B.A and M.A degrees from Middle East Technical University in Ankara, Department of Architecture, and Program in Architectural History. My research focuses on the urban/architectural/visual culture of the late Ottoman Empire and its capital city Istanbul, with a particular interest in the Greek-Orthodox community. My current project in the History Department of UC Berkeley under the umbrella of the Istanpolis collaboration led by Prof. Christine Philliou, focuses on utilizing digital humanities tools for urban/...