DSAI Cohorts

Beyond the Hype: How We Built AI Tools That Actually Support Learning

November 12, 2025
by Weiying Li. What does genuine partnership look like when building AI for education? Working with middle school teachers and computer scientists, we co-designed AI dialogs where teachers are valuable contributors to refine what the AI understands as valuable thinking. Through iterative refinement, teachers identified precursor ideas and observations that predicted future learning, and refined guidance design in the dialog. Our AI dialog sees learning the way teachers do, built through genuine collaboration where both model development, learning sciences theories, and teachers' classroom expertise work together from the start, not just at the end.

In Silico Approach to Mining Viral Sequences from Bulk RNA-Seq Data

October 28, 2025
by Carly Karrick. Viruses play important roles in evolution and influence ecosystems and host health. However, isolating and studying them can be difficult. In lieu of using resource-intensive methods to concentrate viruses into a “virome,” bulk sequencing methods include data from all biological entities present in a sample. In this tutorial, we explore an approach to mine viral sequences from publicly available bulk RNA-Seq data. The output from this analysis paves the way for future statistical analyses comparing viral communities in different contexts. This approach can be applied to other datasets, including studies of human health.

Forecasting Social Outcomes with Deep Neural Networks

October 7, 2025
by Paige Park. Our capacity to accurately predict social outcomes is increasing. Deep neural networks and artificial intelligence are crucial technologies pushing this progress along. As these tools reshape how social prediction is done, social scientists should feel comfortable engaging with them and meaningfully contributing to the conversation. But many social scientists are still unfamiliar with and sometimes even skeptical of deep learning. This tutorial is designed to help close that knowledge gap. We’ll walk step-by-step through training a simple neural network for a social prediction task: forecasting population-level mortality rates.

Why I Don’t Call Myself a Data Scientist: A Researcher's Journey

October 1, 2025
by Jose Aguilar. I reflect on my uneasy relationship with being called a data scientist. Despite training in computer science and utilizing computational tools in education policy, I struggle with how data science often strips away human narratives and reinforces existing inequities. My identity as a first-generation, queer, Latinx scholar deepens these tensions, prompting me to explore frameworks such as QuantCrit and critical data science. Ultimately, I utilize research that bridges computation and critique, advocating for more human-centered, politically aware approaches to data that integrate lived experiences alongside data findings.

Decision-Making Under Pressure during My PhD: Lessons from whale songs and ocean noise

May 6, 2025
by Jaewon Saw. This blog post shares a story from a field experiment using Distributed Acoustic Sensing (DAS) to detect whale vocalizations in Monterey Bay. Most of the data got overwhelmed by noise from boat engines, wave motion, and cable instability. On the final day, a spur-of-the-moment decision to add loops to the fiber optic cable dramatically improved signal quality.

Predicting the Future: Harnessing the Power of Probabilistic Judgements Through Forecasting Tournaments

April 29, 2025
by Christian Caballero. From the threat of nuclear war to rogue superintelligent AI to future pandemics and climate catastrophes, the world faces risks that are both urgent and deeply uncertain. These risks are where traditional data-driven models fall short—there’s often no historical precedent, no baseline data, and no clear way to simulate a future world. In cases like this, how can we anticipate the future? Forecasting tournaments offer one answer, harnessing the wisdom of crowds to generate probabilistic estimates of uncertain future events. By incentivizing accuracy through structured competition and deliberation, these tournaments have produced aggregate predictions of future events that outperform well-calibrated statistical models and teams of experts. As they continue to develop and expand into more domains, they also raise urgent questions about bias, access, and whose knowledge gets to shape our collective sensemaking of the future.