Research Project

Exploring Rental Affordability in the San Francisco Bay Area Neighborhoods with R

November 5, 2024
by Taesoo Song. Many American cities continue to face severe rental burdens. However, we rarely examine rental affordability through the lens of quantitative data. In this blog post, I demonstrate how to download and visualize rental affordability data for the San Francisco Bay Area using R packages like `tidycensus` and `sf`. This exercise shows that mapping census data can be a straightforward and powerful way to understand the spatial patterns of housing dynamics and can offer valuable insights for research, policy, and advocacy.

Human-Centered Design for Migrant Rights

October 29, 2024
by Victoria Hollingshead. In honor of the 2024 International Day of Care and Support, Victoria Hollingshead shares her recent work with the Center for Migrant Advocacy’s Direct Assistance Program and their innovative approach to supporting Overseas Filipino Workers (OFWs) using generative AI. OFWs, especially female domestic workers in the Gulf Cooperation Council (GCC), are vulnerable to exploitation from foreign employers and recruitment agencies while having limited access to legal support. Using a design thinking framework, Victoria and CMA’s Direct Assistance team co-designed a proof of concept to enhance the legal and contract literacy among OFWs in the Kingdom of Saudi Arabia, a top destination country. This project shows promise in leveraging emerging technologies to empower OFWs, enhancing the Philippines' reputation as a migrant champion and supporting the nation's broader push for digital transformation.

Claudia von Vacano, Ph.D.

Founding Executive Director, P.I., Research Director, FSRDC

Dr. Claudia von Vacano is the Founding Executive Director and Senior Research Associate of D-Lab and Digital Humanities at Berkeley and is on the boards of the Social Science Matrix and Berkeley Center for New Media. She has worked in policy and educational administration since 2000, and at the UC Office of the President and UC Berkeley since 2008. She received a Master’s degree from Stanford University in Learning, Design, and Technology. Her doctorate is in Policy, Organizations, Measurement, and Evaluation from UC Berkeley. Her expertise is in organizational theory and...

Concepts and Measurements in Social Network Analysis

October 22, 2024
by Christian Caballero. We live in an interconnected world, more so now than ever. Social Network Analysis (SNA) provides a toolkit to study the influence of this interconnectivity. This blog post introduces some key theoretical concepts behind SNA, as well as a family of metrics for measuring influence in a network, known as centrality. These concepts and measurements help form the basis for a theoretically informed study of social relationships in an era where the availability of relational data has dramatically increased thanks to technological advances.

Institutional Review Board (IRB) Fundamentals

October 17, 2024, 3:00pm
Are you starting a research project at UC Berkeley that involves human subjects? If so, one of the first steps you will need to take is getting IRB approval.

Tom van Nuenen, Ph.D.

Data/Research Scientist, Senior Consultant, and Senior Instructor
D-Lab
Social Sciences
Digital Humanities

I work as a Lecturer, Data Scientist, and Senior Consultant at UC Berkeley's D-Lab. I lead the curriculum design for D-Lab’s data science workshop portfolio, as well as the Digital Humanities Summer Program at Berkeley.

Former research projects include a Research Associate position in the ‘Discovering and Attesting Digital Discrimination’ project at King’s College London (2019-2022) and a researcher-in-residence role for the UK’s National Research Centre on Privacy, Harm Reduction, and Adversarial Influence Online (2022). My research uses Natural Language Processing methods to
...

Leveraging Large Language Models for Analyzing Judicial Disparities in China

October 8, 2024
by Nanqin Ying. This study analyzes over 50 million judicial decisions from China’s Supreme People’s Court to examine disparities in legal representation and their impact on sentencing across provinces. Focusing on 290 000 drug-related cases, it employs large language models to differentiate between private attorneys and public defenders and assess their sentencing outcomes. The methodology combines advanced text processing with statistical analysis, using clustering to categorize cases by province and representation, and regression models to isolate the effect of legal representation from factors like drug quantity and regional policies. Findings reveal significant regional disparities in legal access driven by economic conditions, highlighting the need for reforms in China’s legal aid system to ensure equitable representation for marginalized groups and promote transparent judicial data for systemic improvements.

Causal Thinking in Thermal Comfort

September 17, 2024
by Ruiji Sun. We demonstrate the importance of causal thinking by comparing two linear regression approaches used in thermal comfort research: Approach (a), which regresses thermal sensation votes (y-axis) on indoor temperature (x-axis); Approach (b), which does the reverse, regressing indoor temperature (y-axis) on thermal sensation votes (x-axis). From a correlational perspective, they may appear interchangeable, but causal thinking reveals substantial and practical differences between them. Using the same data, we found Approach (b) leads to a 10 °C narrower than the conventionally derived comfort zone using Approach (a). This finding has important implications for occupant comfort and building energy efficiency. We highlight the importance of integrating causal thinking into correlation-based statistical methods, especially given the increasing volume of data in the built environment.

Data for a Just U.S. - Using Data Science to Empower Marginalized Communities

September 3, 2024
by Elijah Mercer. In this blog post, I share how working with marginalized communities through data science has transformed my understanding of the field. My journey from crime analysis to founding Data for Just US reveals the profound impact data can have when used to empower and uplift underserved populations. I explore the challenges and rewards of this work, illustrating how data science can drive social change and foster a more equitable future.

Minding the Gaps: Pay Equity in California

July 9, 2024
by Tonya D. Lindsey, Ph.D. The gender pay gap continues to reflect that, on average, men outearn women. California is among the states with the smallest pay gaps (outpacing the national number at 13%) and is unique in that it enacted legislation aimed at eliminating pay gaps by sex and race categories. This blog post reflects on California’s pay gap as students study it in an undergraduate social statistics course. Independent variables indicate three theoretical frameworks: 1) human capital, 2) occupational segregation, and 3) discrimination. While the work students do is rigorous using a representative sample of full-time year-round California workers, there remains work to be done and caveats to the data and analyses.