Software Output Interpretation

Sahiba Chopra

Data Science Fellow 2024-2025
Haas School of Business

I'm a PhD student in the Management and Organizations (Macro) group at Berkeley Haas. I have a diverse professional background, primarily as a data scientist across numerous industries, including fintech, cleantech, and media. I hold a BA in Economics from the University of Maryland, an MS in Applied Economics from the University of San Francisco, and an MS in Business Administration from UC Berkeley.

My research focuses on the intersection of inequality, technology, and the labor market. I am particularly interested in understanding how to reduce inequality in...

Measuring Vowels Without Relying on Sex-Based Assumptions

April 8, 2025
by Amber Galvano. This tutorial builds on my previous post on Python for acoustic analysis, this time focusing on measuring vocal tract resonances without relying on sex-based assumptions. I demonstrate how to process audio files and vowel annotations using an adaptive method that optimizes the acoustic analysis across a recording. Instead of fixing parameters based on generalized vocal tract length correlations, this approach varies them within a defined range for greater accuracy. This not only enhances measurement precision but also avoids requiring (or assuming) speakers’ sex in data collection. Finally, I show how to filter for outliers and create high-quality vowel space visualizations.

María Martín López

Data Science Fellow 2023-2024
Psychology

María Martín López is a PhD student in the Cognition area within the Department of Psychology. Her research relates to cognitive computational and quantitative models of individual differences in behaviors, thoughts, and emotions. She is particularly interested in how we can create and leverage novel algorithms to understand, measure, and predict processes relating to externalizing psychopathology (e.g. impulsivity, aggression, substance use). She answers these questions using a range of computational and quantitive models including AI, NLP, SEM, time series analysis, multi-level...

Lauren Chambers

Consultant
School of Information

Lauren Chambers is a Ph.D. student at the Berkeley School of Information, where she studies the intersection of data, technology, and sociopolitical advocacy with Prof. Deirdre Mulligan. Previously Lauren was the staff technologist at the ACLU of Massachusetts, where she explored government data in order to inform citizens and lawmakers about the effects of legislation and political leadership on our civil liberties. Lauren received her Bachelor's from Yale in 2017, where she double-majored in astrophysics and African American studies, and she spent two years after graduation in...

Nicolas Nunez-Sahr

Consultant
Statistics

I lived in Santiago, Chile until I graduated from high school, and then moved to the US for undergrad at Stanford, where I obtained a Bachelor’s degree from the Statistics Department. I then worked as a Data Scientist in an NLP startup that was based in Bend, OR, which analyzed news articles. I love playing soccer, volleyball, table tennis, flute, guitar, latin music, and meeting new people. I want to get better at mountain biking, whitewater kayaking, chess and computer vision. I find nature astounding, and love finding sources of inspiration.

Python Data Processing Basics for Acoustic Analysis

November 12, 2024
by Amber Galvano. Interested in learning how to merge data and metadata from multiple sources into a consolidated dataset? Dealing with annotated audio and want to automate your workflow? Tried Praat scripting but want something more streamlined? This blog post will walk through some key domain-specific Python-based tools you will need in order to take your audio data, annotations, and speaker metadata and come away with a tabular dataset containing acoustic measures, ready to visualize and submit to statistical analysis. This tutorial uses acoustic phonetics data, but can be adapted to a range of projects involving repeated measures data and/or work with audio files.

Valeria Ramírez Castañeda

Data Science for Social Justice Fellow (2024-2025)
Integrative Biology

Valeria Ramírez Castañeda is a Colombian biologist currently pursuing a PhD in the Department of Integrative Biology at the University of California, Berkeley. I completed my undergraduate degree in Biology at the National University of Colombia and earned a master's degree in Ecology and Evolution, as well as another in Science Communication. During her PhD, she is studying the interactions between snakes and frogs and how this influences the evolution of toxin resistance in snakes. She is also collaborating and leading projects regarding the consequences of English in science and the...

Theo Snow

Availability: By appointment only

Consulting Areas: Python, R, SQL, SAS, Databases & SQL, Data Manipulation and Cleaning, Data Science, Data Visualization, Geospatial Data, Maps & Spatial Analysis, Machine Learning, Mixed Methods, Qualitative methods, Surveys, Sampling & Interviews, Regression Analysis, Means Tests, Software Output Interpretation, Other, Excel, Git or Github, RStudio, RStudio Cloud, SAS, Tableau

Anusha Bishop

Availability: By appointment only

Consulting Areas: Python, R, Cloud & HPC Computing, Data Sources, Data Visualization, Geospatial Data, Maps & Analysis, Machine Learning, Research Design, Cluster analysis, Experimental design, Hierarchical Models, High dimensional statistics, Means Tests, Nonparametric methods, Regression Analysis, Software Output Interpretation, Spatial statistics, Bash or Command Line, Excel, Git or Github, RStudio

Nikita Samarin

Instructor
Electrical Engineering and Computer Science (EECS)

Nikita Samarin is a doctoral student in Computer Science in the Department of Electrical Engineering and Computer Sciences (EECS) at the University of California, Berkeley advised by Serge Egelman and David Wagner. His research focuses on computer security and privacy from an interdisciplinary perspective, combining approaches from human-computer interaction, behavioral sciences, and legal studies. Samarin is a member of the Berkeley Lab for Usable and Experimental Security (BLUES) and an affiliated graduate researcher at the Center for Long-Term Cybersecurity (CLTC) and the...