Research Project

Filtering, Visualizing, and Interpreting Spatial Time Series Data

December 17, 2025
by Maksymilian Jasiak. Spatial time series (consecutive measurements across space and time) are often difficult to interpret, especially when there are many overlapping signals. However, have no fear! Filtering and visualizing can help better interpret and understand the spatial time series data.

Seeing Behavior in Everyday Data

December 10, 2025
by Skyler Chen. This post discusses how my training in data science changed the way I think about behavioral research. I share how simply exploring everyday datasets and noticing small, unexpected patterns can spark new research questions, and how archival data and experiments each offer distinct yet complementary insights into how people make judgments and decisions. I also highlight the growing set of tools that help us understand behavior in richer ways.

A Practical Guide to Shift-Share Instruments (and What I Learned Replicating the China Shock)

November 26, 2025
by Jiayu Lai. Shift-share instruments are among the most widely used tools in applied economics, appearing in labor, trade, immigration, and policy evaluation research. But despite their popularity, many researchers still use them as black boxes — and risk invalid instruments as a result. In this blog post, I unpack how shift-share IVs actually work, why their validity depends on both the “shifts” and the “shares,” and what practical steps researchers should take to check assumptions. I also walk through how I used the Borusyak–Hull–Jaravel (2022, 2025) framework to reproduce the seminal Autor, Dorn, and Hanson (2013) China shock analysis.

A Participant-Centered, GIS-Based Approach to Improving Contextual Measurement

November 19, 2025
by Sarah Daniel. Researchers increasingly recognize that neighborhoods profoundly shape life outcomes, yet measuring them remains challenging. A common approach uses administrative boundaries, such as census tracts, as proxies for neighborhoods, but this method presents three key challenges. First, administrative boundaries may fail to capture residents’ lived experiences, a limitation that is particularly concerning in marginalized communities; second, they can misrepresent contextual effects; and third, they may produce inconsistent findings. To address these issues, I advocate for the use of self-defined neighborhood boundaries as an alternative measure. I compare GIS- and non-GIS-based methods and propose that GIS-based methods offer the strongest potential for more valid measurement.

Beyond the Hype: How We Built AI Tools That Actually Support Learning

November 12, 2025
by Weiying Li. What does genuine partnership look like when building AI for education? Working with middle school teachers and computer scientists, we co-designed AI dialogs where teachers are valuable contributors to refine what the AI understands as valuable thinking. Through iterative refinement, teachers identified precursor ideas and observations that predicted future learning, and refined guidance design in the dialog. Our AI dialog sees learning the way teachers do, built through genuine collaboration where both model development, learning sciences theories, and teachers' classroom expertise work together from the start, not just at the end.

How to Get Involved in Computing Research as a Undergrad at UC Berkeley

October 15, 2025
by Abby O'Neill. Are you an undergrad interested in getting involved in CS/DS research? This blog post gives some advice for navigating the Berkeley research landscape. It includes mentions of structured programs like DARE, URAP, and Data Science Discovery, as well as cold emailing strategies and using office hours effectively. The main takeaway: Know your why, don't filter yourself out, and focus on finding people and projects that align with your goals.

Decision-Making Under Pressure during My PhD: Lessons from whale songs and ocean noise

May 6, 2025
by Jaewon Saw. This blog post shares a story from a field experiment using Distributed Acoustic Sensing (DAS) to detect whale vocalizations in Monterey Bay. Most of the data got overwhelmed by noise from boat engines, wave motion, and cable instability. On the final day, a spur-of-the-moment decision to add loops to the fiber optic cable dramatically improved signal quality.

Field Experiments in Corporations

January 28, 2025
by Yue Lin. How do social science researchers conduct field experiments with private actors? Yue Lin provides a brief overview of the recent developments in political economy and management strategy, with a focus on filing field experiments within private corporations. Unlike conventional targets like individuals and government agencies, private companies are an emergent sweet spot for scholars to test for important theories, such as sustainability, censorship, and market behavior. After comparing the strengths and weaknesses of this powerful yet nascent method, Lin brainstorms some practical solutions to improve the success rate of field experimental studies. She aims to introduce a new methodological tool in a nascent research field and shed some light on improving experimental quality while adhering to ethical standards.

Why Data Disaggregation Matters: Exploring the Diversity of Asian American Economic Outcomes Using Public Use Microdata Sample (PUMS) Data

February 11, 2025
by Taesoo Song. Asian Americans are often overlooked in discussions of racial inequality due to their high average socioeconomic attainment. Many academic and policy researchers treat Asians as a single racial category in their analysis. However, this broad categorization can mask significant within-group disparities, leaving many disadvantaged individuals without access to vital resources and policy support. Song emphasizes the importance of data disaggregation in revealing Asian American inequalities, particularly in areas like income and homeownership, and demonstrates how breaking down these categories can lead to more targeted and effective policy solutions.

Measuring Vowels Without Relying on Sex-Based Assumptions

April 8, 2025
by Amber Galvano. This tutorial builds on my previous post on Python for acoustic analysis, this time focusing on measuring vocal tract resonances without relying on sex-based assumptions. I demonstrate how to process audio files and vowel annotations using an adaptive method that optimizes the acoustic analysis across a recording. Instead of fixing parameters based on generalized vocal tract length correlations, this approach varies them within a defined range for greater accuracy. This not only enhances measurement precision but also avoids requiring (or assuming) speakers’ sex in data collection. Finally, I show how to filter for outliers and create high-quality vowel space visualizations.