Data Visualization

Why Data Disaggregation Matters: Exploring the Diversity of Asian American Economic Outcomes Using Public Use Microdata Sample (PUMS) Data

February 11, 2025
by Taesoo Song. Asian Americans are often overlooked in discussions of racial inequality due to their high average socioeconomic attainment. Many academic and policy researchers treat Asians as a single racial category in their analysis. However, this broad categorization can mask significant within-group disparities, leaving many disadvantaged individuals without access to vital resources and policy support. Song emphasizes the importance of data disaggregation in revealing Asian American inequalities, particularly in areas like income and homeownership, and demonstrates how breaking down these categories can lead to more targeted and effective policy solutions.

Measuring Vowels Without Relying on Sex-Based Assumptions

April 8, 2025
by Amber Galvano. This tutorial builds on my previous post on Python for acoustic analysis, this time focusing on measuring vocal tract resonances without relying on sex-based assumptions. I demonstrate how to process audio files and vowel annotations using an adaptive method that optimizes the acoustic analysis across a recording. Instead of fixing parameters based on generalized vocal tract length correlations, this approach varies them within a defined range for greater accuracy. This not only enhances measurement precision but also avoids requiring (or assuming) speakers’ sex in data collection. Finally, I show how to filter for outliers and create high-quality vowel space visualizations.

Melike Sümertaş

Data Science Fellow 2023-2024
History

I hold a PhD in History from Boğaziçi University, Istanbul and B.A and M.A degrees from Middle East Technical University in Ankara, Department of Architecture, and Program in Architectural History. My research focuses on the urban/architectural/visual culture of the late Ottoman Empire and its capital city Istanbul, with a particular interest in the Greek-Orthodox community. My current project in the History Department of UC Berkeley under the umbrella of the Istanpolis collaboration led by Prof. Christine Philliou, focuses on utilizing digital humanities tools for urban/...

Suraj Nair

Data Science Fellow 2023-2024
School of Information

I am a PhD Student at the School of Information. My research interests lie at the intersection of development economics and machine learning, with a focus on the use of large scale digital data and new computational tools to study pressing issues in global development.

María Martín López

Data Science Fellow 2023-2024
Psychology

María Martín López is a PhD student in the Cognition area within the Department of Psychology. Her research relates to cognitive computational and quantitative models of individual differences in behaviors, thoughts, and emotions. She is particularly interested in how we can create and leverage novel algorithms to understand, measure, and predict processes relating to externalizing psychopathology (e.g. impulsivity, aggression, substance use). She answers these questions using a range of computational and quantitive models including AI, NLP, SEM, time series analysis, multi-level...

Kamya Yadav

Senior Data Science Fellow 2024-2025, Data Science Fellow 2023-2024
Political Science

Kamya is a third year PhD student in the Department of Political Science. Using multimethod research, she studies gender, representation, and political parties in India to understand the barriers and pathways to women's political participation and representation. She has a BA in Politics from Princeton University.

Emma Turtelboom

Data Science Fellow 2023-2024
Astronomy

I am a PhD student in the Astronomy department, and I study planets outside our own solar system. I'm interested in learning how the properties of host stars affect planetary systems. In my free time, I love swimming, hiking, reading, and baking.

Andrea Lukas

UTech
Computer Science
Data Science

Hi everyone! I'm Andrea Lukas, a 3rd-year student majoring in Computer Science and Data Science at UC Berkeley. I'm passionate about UI/UX design and AI-centered human-computer interaction, and I'm actively involved in Computational Cognition research using Large Language Models (LLMs). As the Manager at D-Lab, I'm excited to contribute to the team by optimizing operations and fostering collaboration.

Outside of my academic and professional work, I’m an active member of Berkeley's Dance Community, where I participate in various teams. I also enjoy discovering new matcha spots and...

Lauren Chambers

Consultant
School of Information

Lauren Chambers is a Ph.D. student at the Berkeley School of Information, where she studies the intersection of data, technology, and sociopolitical advocacy with Prof. Deirdre Mulligan. Previously Lauren was the staff technologist at the ACLU of Massachusetts, where she explored government data in order to inform citizens and lawmakers about the effects of legislation and political leadership on our civil liberties. Lauren received her Bachelor's from Yale in 2017, where she double-majored in astrophysics and African American studies, and she spent two years after graduation in...

Jailynne Estevez

Consultant
Info & Data Science MIDS

Jailynne Estevez is a Data Analyst and a prospective Masters in Information and Data Science candidate at UC Berkeley. With a bachelor's in Public Policy, she brings a diverse skill set to her pursuits, demonstrating aptitude in data analysis and programming.