Natural Language Processing (NLP)

Nicolas Nunez-Sahr

Consulting Drop-In Hours: By appointment only

Consulting Areas: Python, R, SQL, C++, APIs, Databases & SQL, Data Manipulation and Cleaning, Data Science, Data Visualization, Deep Learning, Machine Learning, Natural Language Processing, Python Programming, R Programming, Text Analysis, Regression Analysis, Software Output Interpretation, Bash or Command Line, Git or Github, RStudio, Google Cloud, PostgreSQL, Python Django

Quick-tip: the fastest way to speak to a consultant is to first ...

Sanjana Gajendran

Consulting Drop-In Hours: Thu 3pm-5pm

Consulting Areas: Python, SQL, Data Science, Machine Learning, Natural Language Processing, Text Analysis, Git or Github

Quick-tip: the fastest way to speak to a consultant is to first submit a request and then ...

Addison Pickrell

IUSE Undergraduate Advisory Board
Mathematics
Sociology

Addison is an aspiring mathematician and social scientist (Class of '27). He loves collecting books he'll never read, is an open-source and open-access advocate, and an aspiring community organizer and systems disrupter. Ask me about community-based participatory action research (CBPAR), critical pedagogy, applied mathematics, and social science.

María Martín López

Data Science Fellow
Psychology

María Martín López is a PhD student in the Cognition area within the Department of Psychology. Her research relates to cognitive computational and quantitative models of individual differences in behaviors, thoughts, and emotions. She is particularly interested in how we can create and leverage novel algorithms to understand, measure, and predict processes relating to externalizing psychopathology (e.g. impulsivity, aggression, substance use). She answers these questions using a range of computational and quantitive models including AI, NLP, SEM, time series analysis, multi-level...

Python Text Analysis: Topic Modeling

October 16, 2023, 2:00pm
In this part, we study unsupervised learning of text data. This is a stand alone work that builds from the two-part text analysis series.

Python Text Analysis: Word Embeddings

October 25, 2023, 2:00pm
How can we use neural networks to create meaningful representations of words? The bag-of-words is limited in its ability to characterize text, because it does not utilize word context.

D-Lab & Graduate Division create inclusive data science summer program

August 9, 2023
by Vanessa Navarro Rodriguez. UC Berkeley's Social Sciences D-Lab and Graduate Division created the Data Science for Social Justice Program to address underrepresentation in data science. The program teaches diverse students critical data analysis and its applications in addressing societal injustices. The 8-week free summer course for admitted University of California students focuses on Python programming, Natural Language Processing, and value-informed data practices. It aims to empower students from underrepresented backgrounds and to bridge STEM with social justice. This blog post elaborates on the program's creation and features one of the DSSJ students, Robin López, and his reasons for participating.

Unlock the Joy and Power of Reading in Language Learning

August 21, 2023
by Bowen Wang-Kildegaard. I share my story of how reading for pleasure transformed my English speaking and writing skills. This experience inspired my passion to promote the joy and power of reading to all language learners. Using natural language processing techniques, I dive into the Language Learning subreddit, revealing a trend: Learners are often highly anxious about output practices, but are generally positive about input methods like reading and listening. I then distill complex language learning theories into actionable language learning tips, emphasizing the value of extensive reading for pleasure, pointing to potential methods like using ChatGPT for customization of reading materials, and advocating for joy in the learning journey.

My Summer Exploring Data Science for Social Justice: Learnings, Tensions & Recommendations

September 5, 2023
by Genevieve Smith. This summer I joined the D-Lab hosted Data Science for Social Justice workshop at UC Berkeley diving into Python – including TF-IDF, sentiment analysis, word embeddings, and more – with a lens towards leveraging data science for social justice. My team explored a Reddit channel on abortion and used computational analysis to answer key questions related to abortion access from before versus after Roe vs. Wade was overturned. Computational social science is incredibly powerful, but I continue to grapple with tensions particularly as it relates to employing machine learning and large language in international research, and end with key recommendations for CSS practitioners.

Claudia von Vacano, Ph.D.

Consulting Drop-In Hours: By appointment only

Consulting Areas: Python, R,Digital Humanities, Mixed Methods, Natural Language Processing, Qualitative methods, Social Network Analysis, Surveys, Sampling & Interviews, Text Analysis,MaxQDA, Qualtrics

Quick-tip: the fastest way to speak to a consultant is to first ...