Data Sources

Decision-Making Under Pressure during My PhD: Lessons from whale songs and ocean noise

May 6, 2025

Jaewon Saw

by Jaewon Saw. This blog post shares a story from a field experiment using Distributed Acoustic Sensing (DAS) to detect whale vocalizations in Monterey Bay. Most of the data got overwhelmed by noise from boat engines, wave motion, and cable instability. On the final day, a spur-of-the-moment decision to add loops to the fiber optic cable dramatically improved signal quality.

Read more about Decision-Making Under Pressure during My PhD: Lessons from whale songs and ocean noise

Predicting the Future: Harnessing the Power of Probabilistic Judgements Through Forecasting Tournaments

April 29, 2025

Christian Caballero

by Christian Caballero. From the threat of nuclear war to rogue superintelligent AI to future pandemics and climate catastrophes, the world faces risks that are both urgent and deeply uncertain. These risks are where traditional data-driven models fall short—there’s often no historical precedent, no baseline data, and no clear way to simulate a future world. In cases like this, how can we anticipate the future? Forecasting tournaments offer one answer, harnessing the wisdom of crowds to generate probabilistic estimates of uncertain future events. By incentivizing accuracy through structured competition and deliberation, these tournaments have produced aggregate predictions of future events that outperform well-calibrated statistical models and teams of experts. As they continue to develop and expand into more domains, they also raise urgent questions about bias, access, and whose knowledge gets to shape our collective sensemaking of the future.

Read more about Predicting the Future: Harnessing the Power of Probabilistic Judgements Through Forecasting Tournaments

Frances Leung

Data Science Fellow 2021-2022

School of Information

Frances Leung is a master’s student at UC Berkeley School of Information where she focuses her studies in information and data science. She has a keen interest in leveraging data-driven insights to better understand consumer behaviors and the world around us. In her professional work as a management consultant, she advises retailers and consumer businesses on digital transformation and creating web/mobile experiences that delight consumers through a human-centered approach. Frances holds a Master in Business Administration from York University, Schulich School...

Read more about Frances Leung

Enrique Valencia López

Data Science Fellow 2022-2023

Graduate School of Education

Enrique Valencia López is a PhD student in the Policy, Politics and Leadership cluster at the Graduate School of Education.His research interests relate to three broad areas: the stratification of education by gender, immigration status and ethnicity; the measurement of teacher working conditions and well-being; and education in Latin America.

Before coming to Berkeley, Enrique worked for Mexico’s National Institute for Educational Evaluation and Assessment (INEE) in both the Policy and Indicators area. During that time, he co-authored Mexico’s first report on the educational...

Read more about Enrique Valencia López

Elijah Mercer

Data Science Fellow 2024-2025

School of Information

Elijah, originally from Newark, New Jersey, now resides in San Francisco, California, dedicated to social and juvenile justice. With a Criminology degree from American University, he began as a research intern at the Investigative Reporting Workshop, focusing on the Digital Divide.

Teaching in Baltimore with Teach for America reinforced his belief in research and data for marginalized communities. In roles at the Coalition Against Insurance Fraud, New York Police Department, and San Francisco District Attorney’s Office, Elijah used data to combat crime. Now...

Read more about Elijah Mercer

Why Data Disaggregation Matters: Exploring the Diversity of Asian American Economic Outcomes Using Public Use Microdata Sample (PUMS) Data

February 11, 2025

Taesoo Song

by Taesoo Song. Asian Americans are often overlooked in discussions of racial inequality due to their high average socioeconomic attainment. Many academic and policy researchers treat Asians as a single racial category in their analysis. However, this broad categorization can mask significant within-group disparities, leaving many disadvantaged individuals without access to vital resources and policy support. Song emphasizes the importance of data disaggregation in revealing Asian American inequalities, particularly in areas like income and homeownership, and demonstrates how breaking down these categories can lead to more targeted and effective policy solutions.

Read more about Why Data Disaggregation Matters: Exploring the Diversity of Asian American Economic Outcomes Using Public Use Microdata Sample (PUMS) Data

Field Experiments in Corporations

January 28, 2025

Yue Lin

by Yue Lin. How do social science researchers conduct field experiments with private actors? Yue Lin provides a brief overview of the recent developments in political economy and management strategy, with a focus on filing field experiments within private corporations. Unlike conventional targets like individuals and government agencies, private companies are an emergent sweet spot for scholars to test for important theories, such as sustainability, censorship, and market behavior. After comparing the strengths and weaknesses of this powerful yet nascent method, Lin brainstorms some practical solutions to improve the success rate of field experimental studies. She aims to introduce a new methodological tool in a nascent research field and shed some light on improving experimental quality while adhering to ethical standards.

Read more about Field Experiments in Corporations

Measuring Vowels Without Relying on Sex-Based Assumptions

April 8, 2025

Amber Galvano

by Amber Galvano. This tutorial builds on my previous post on Python for acoustic analysis, this time focusing on measuring vocal tract resonances without relying on sex-based assumptions. I demonstrate how to process audio files and vowel annotations using an adaptive method that optimizes the acoustic analysis across a recording. Instead of fixing parameters based on generalized vocal tract length correlations, this approach varies them within a defined range for greater accuracy. This not only enhances measurement precision but also avoids requiring (or assuming) speakers’ sex in data collection. Finally, I show how to filter for outliers and create high-quality vowel space visualizations.

Read more about Measuring Vowels Without Relying on Sex-Based Assumptions

Suraj Nair

Data Science Fellow 2023-2024

School of Information

I am a PhD Student at the School of Information. My research interests lie at the intersection of development economics and machine learning, with a focus on the use of large scale digital data and new computational tools to study pressing issues in global development.

Read more about Suraj Nair

Jailynne Estevez

Consultant

Info & Data Science MIDS

Jailynne Estevez is a Data Analyst and a prospective Masters in Information and Data Science candidate at UC Berkeley. With a bachelor's in Public Policy, she brings a diverse skill set to her pursuits, demonstrating aptitude in data analysis and programming.

Read more about Jailynne Estevez

1 of 12 View: Taxonomy term (Current page)
2 of 12 View: Taxonomy term
3 of 12 View: Taxonomy term
4 of 12 View: Taxonomy term
5 of 12 View: Taxonomy term
6 of 12 View: Taxonomy term
7 of 12 View: Taxonomy term
8 of 12 View: Taxonomy term
9 of 12 View: Taxonomy term
…
next › View: Taxonomy term
last » View: Taxonomy term