Natural Language Processing (NLP)

Python Text Analysis: Word Embeddings

October 25, 2023, 2:00pm
How can we use neural networks to create meaningful representations of words? The bag-of-words is limited in its ability to characterize text, because it does not utilize word context.

D-Lab & Graduate Division create inclusive data science summer program

August 9, 2023
by Vanessa Navarro Rodriguez. UC Berkeley's Social Sciences D-Lab and Graduate Division created the Data Science for Social Justice Program to address underrepresentation in data science. The program teaches diverse students critical data analysis and its applications in addressing societal injustices. The 8-week free summer course for admitted University of California students focuses on Python programming, Natural Language Processing, and value-informed data practices. It aims to empower students from underrepresented backgrounds and to bridge STEM with social justice. This blog post elaborates on the program's creation and features one of the DSSJ students, Robin López, and his reasons for participating.

Unlock the Joy and Power of Reading in Language Learning

August 21, 2023
by Bowen Wang-Kildegaard. I share my story of how reading for pleasure transformed my English speaking and writing skills. This experience inspired my passion to promote the joy and power of reading to all language learners. Using natural language processing techniques, I dive into the Language Learning subreddit, revealing a trend: Learners are often highly anxious about output practices, but are generally positive about input methods like reading and listening. I then distill complex language learning theories into actionable language learning tips, emphasizing the value of extensive reading for pleasure, pointing to potential methods like using ChatGPT for customization of reading materials, and advocating for joy in the learning journey.

My Summer Exploring Data Science for Social Justice: Learnings, Tensions & Recommendations

September 5, 2023
by Genevieve Smith. This summer I joined the D-Lab hosted Data Science for Social Justice workshop at UC Berkeley diving into Python – including TF-IDF, sentiment analysis, word embeddings, and more – with a lens towards leveraging data science for social justice. My team explored a Reddit channel on abortion and used computational analysis to answer key questions related to abortion access from before versus after Roe vs. Wade was overturned. Computational social science is incredibly powerful, but I continue to grapple with tensions particularly as it relates to employing machine learning and large language in international research, and end with key recommendations for CSS practitioners.

Claudia von Vacano, Ph.D.

Consulting Drop-In Hours: By appointment only

Consulting Areas: Python, R,Digital Humanities, Mixed Methods, Natural Language Processing, Qualitative methods, Social Network Analysis, Surveys, Sampling & Interviews, Text Analysis,MaxQDA, Qualtrics

Quick-tip: the fastest way to speak to a consultant is to first ...

Nicolas Nunez-Sahr

Consultant
Statistics

I lived in Santiago, Chile until I graduated from high school, and then moved to the US for undergrad at Stanford, where I obtained a Bachelor’s degree from the Statistics Department. I then worked as a Data Scientist in an NLP startup that was based in Bend, OR, which analyzed news articles. I love playing soccer, volleyball, table tennis, flute, guitar, latin music, and meeting new people. I want to get better at mountain biking, whitewater kayaking, chess and computer vision. I find nature astounding, and love finding sources of inspiration.

Chirag Manghani

Consultant
School of Information

Chirag is a 2nd year graduate at the I-School. Proficient in Python, Java, R, and SQL, he navigates software application development, machine learning and data science. His keen interest lies in data analysis and statistical methods, driving him to bridge theory and practice seamlessly. Chirag's dedication to excellence, adaptable mindset, and innate curiosity define him as a dynamic problem solver in the ever-evolving tech landscape.

Gaby May Lagunes

Consultant
ESPM

Hello! I’m Gaby (she/her). I am PhD student at the ESPM department, I hold a masters in Data Science and Information from the Berkeley ISchool and I have 5+ years of industrial experience in different data roles. Before that I got a masters in Engineering for International Development and an undergraduate degree in Physics from University College London. And somewhere between all that I got married, survived the pandemic, and had two awesome boys. I’m very excited to help you use data to enhance your work and your experience here at Berkeley!

Python Text Analysis Fundamentals: Parts 1-2

September 25, 2023, 2:00pm
This two-part workshop series will prepare participants to move forward with research that uses text analysis, with a special focus on humanities and social science applications.

Sanjana Gajendran

Consultant
MIMS

I'm a second year MIMS Student with a focus on Data Science and Natural Language Processing. During the Summer 2023, I interned at Genentech as a Data Science Intern.