Text Analysis

Farnam Mohebi

Data Science Fellow
Haas School of Business

I am a PhD student at the Haas School of Business, University of California, Berkeley, and a researcher in the Department of Radiation Oncology at the University of California, San Francisco, having previously earned my MD and MPH degrees. My research focuses on the intersection of professionals and emerging technologies, drawing from the fields of medical sociology, organizational theory, and science and technology studies. I am particularly fascinated by the evolving relationship between physicians and artificial intelligence, the phenomenon of physician influencers, and the social...

Christian Caballero

Data Science Fellow 2024-2025
Political Science

Christian Caballero is a Political Science PhD student at the University of California, Berkeley. His research focuses on American politics and political behavior. In particular, he studies the ways in which social networks influence processes of political persuasion and democratic deliberation, as well as how political ideologies develop within subcultures.

He holds a B.A. in Politics and Sociology from New York University and an M.A. in Political Science from the University of California, Berkeley.

Jane (Mango) Angar

Data Science Fellow 2024-2025
Political Science

Hi! I am a PhD candidate in the Political Science Department at UC Berkeley. My dissertation traces the emergence of disability rights groups in Africa, focusing on Zambia and Malawi, and examines factors influencing their effectiveness. I use mixed methods, including archival work, field interviews, participant observation, and surveys for data collection.

My data analysis techniques include text analysis, social network analysis, means tests, and regressions. In my free time, I enjoy moderately difficult hikes, walks along the beach with my dog, Princess, and...

Sahiba Chopra

Data Science Fellow 2024-2025
Haas

I'm a PhD student in the Management and Organizations (Macro) group at Berkeley Haas. I have a diverse professional background, primarily as a data scientist across numerous industries, including fintech, cleantech, and media. I hold a BA in Economics from the University of Maryland, an MS in Applied Economics from the University of San Francisco, and an MS in Business Administration from UC Berkeley.

My research focuses on the intersection of inequality, technology, and the labor market. I am particularly interested in understanding how to reduce inequality in...

Mingyu Yuan

Data Science for Social Justice Senior Fellow 2024
Linguistics

I am a Ph.D. candidate in Linguistics, with a focus on phonetics and phonology, specifically speech production in neuro-atypical populations. I use methods from Natural Language Processing in my day-to-day research.

Stephanie Andrews

Data Science for Social Justice Senior Fellow 2024
Info & Data Science MIDS

Stephanie Andrews is currently studying data science in the MIDS program, having previously majored in Social Welfare as an undergraduate at Cal. After graduating, she worked as an advocate for survivors of gender-based violence, as a public policy analyst focusing on anti-trafficking initiatives, and as a software engineer for progressive and social impact organizations. She is now conducting research with the Human Rights Center's Investigations Lab, using OSINT and data science methods to investigate human rights violations.

Hellina Hailu Nigatu

Data Science for Social Justice Senior Fellow 2024
Electrical Engineering and Computer Science (EECS)

I am a PhD student at UC Berkeley in the EECS department co-advised by Prof. Sarah Chasins and Prof. John Canny. My research interest broadly lies in the intersection of AI and HCI, with a focus on making usable AI tools accessible to end users.

I am currently looking into making NLP tools usable and accessible for low-resourced languages. I am also interested in the impact of AI on society, specifically in how it affects Global Majority countries and communities. Outside of research, I like to read books, make and drink traditional Ethiopian coffee, knit,...

Python Text Analysis: Topic Modeling

April 4, 2024, 10:00am
In this part, we study unsupervised learning of text data. This is a stand alone work that builds from the two-part text analysis series.

Python Text Analysis: Topic Modeling

April 13, 2022, 3:00pm
In this part, we study unsupervised learning of text data. This is a stand alone work that builds from the two-part text analysis series.

Python Text Analysis Fundamentals: Parts 1-3

September 21, 2021, 10:00am
This three-part workshop series will prepare participants to move forward with research that uses text analysis, with a special focus on humanities and social science applications.