Natural Language Processing (NLP)

Gaby May Lagunes

Consultant
ESPM

Hello! I’m Gaby (she/her). I am PhD student at the ESPM department, I hold a masters in Data Science and Information from the Berkeley ISchool and I have 5+ years of industrial experience in different data roles. Before that I got a masters in Engineering for International Development and an undergraduate degree in Physics from University College London. And somewhere between all that I got married, survived the pandemic, and had two awesome boys. I’m very excited to help you use data to enhance your work and your experience here at Berkeley!

Nicolas Nunez-Sahr

Consultant
Statistics

I lived in Santiago, Chile until I graduated from high school, and then moved to the US for undergrad at Stanford, where I obtained a Bachelor’s degree from the Statistics Department. I then worked as a Data Scientist in an NLP startup that was based in Bend, OR, which analyzed news articles. I love playing soccer, volleyball, table tennis, flute, guitar, latin music, and meeting new people. I want to get better at mountain biking, whitewater kayaking, chess and computer vision. I find nature astounding, and love finding sources of inspiration.

Python Text Analysis: Parts 1-3

March 17, 2025, 2:00pm
This three-part workshop will prepare participants to move forward with research that uses text analysis, with a special focus on social science applications. We explore fundamental approaches to applying computational methods to text in Python. We cover some of the major packages used in natural language processing, including scikit-learn, NLTK, spaCy, and Gensim.

Language Models in Mental Health Conversations – How Empathetic Are They Really?

December 3, 2024
by Sohail Khan. Language models are becoming integral to daily life as trusted sources of advice. While their utility has expanded from simple tasks like text summarization to more complex interactions, the empathetic quality of their responses is crucial. This article explores methods to assess the emotional appropriateness of these models, using metrics such as BLEU, ROUGE, and Sentence Transformers. By analyzing models like LLaMA in mental health dialogues, we learn that while they suffer through traditional word-based metrics, LLaMA's performance in capturing empathy through semantic similarity is promising. In addition, we must advocate for continuous monitoring to ensure these models support their users' mental well-being effectively.

Stephanie Andrews

Availability: By appointment only

Consulting Areas: Python, SQL, HTML / CSS, Javascript, APIs, Databases & SQL, Data Manipulation and Cleaning, Data Science, Data Sources, Data Visualization, Digital Humanities, Machine Learning, Natural Language Processing, Software Tools, Text Analysis, Web Scraping, Bash or Command Line, Excel, Git or Github, Tableau

Stephanie Andrews

Consultant
Info & Data Science MIDS

Stephanie Andrews is currently studying data science in the MIDS program, having previously majored in Social Welfare as an undergraduate at Cal. After graduating, she worked as an advocate for survivors of gender-based violence, as a public policy analyst focusing on anti-trafficking initiatives, and as a software engineer for progressive and social impact organizations. She is now conducting research with the Human Rights Center's Investigations Lab, using OSINT and data science methods to investigate human rights violations.

Amanda Glazer

Instructor
Statistics

Amanda is a PhD candidate in the statistics department at Berkeley. Her research focuses on causal inference with applications in education, political science and sports. Previously she earned her Bachelor’s degree in mathematics and statistics, with a secondary in computer science, from Harvard.

Chirag Manghani

Consultant
School of Information

Chirag is a 2nd year graduate at the I-School. Proficient in Python, Java, R, and SQL, he navigates software application development, machine learning and data science. His keen interest lies in data analysis and statistical methods, driving him to bridge theory and practice seamlessly. Chirag's dedication to excellence, adaptable mindset, and innate curiosity define him as a dynamic problem solver in the ever-evolving tech landscape.

Hellina Hailu Nigatu

Data Science for Social Justice Senior Fellow 2024
Electrical Engineering and Computer Science (EECS)

I am a PhD student at UC Berkeley in the EECS department co-advised by Prof. Sarah Chasins and Prof. John Canny. My research interest broadly lies in the intersection of AI and HCI, with a focus on making usable AI tools accessible to end users.

I am currently looking into making NLP tools usable and accessible for low-resourced languages. I am also interested in the impact of AI on society, specifically in how it affects Global Majority countries and communities. Outside of research, I like to read books, make and drink traditional Ethiopian coffee, knit,...

Python Text Analysis Fundamentals: Parts 1-2

March 8, 2023, 2:00pm
This two-part workshop series will prepare participants to move forward with research that uses text analysis, with a special focus on humanities and social science applications.