Text Analysis

Mingyu Yuan

Data Science for Social Justice Senior Fellow 2024
Linguistics

I am a Ph.D. candidate in Linguistics, with a focus on phonetics and phonology, specifically speech production in neuro-atypical populations. I use methods from Natural Language Processing in my day-to-day research.

Hellina Hailu Nigatu

Data Science for Social Justice Senior Fellow 2024
Electrical Engineering and Computer Science (EECS)

I am a PhD student at UC Berkeley in the EECS department co-advised by Prof. Sarah Chasins and Prof. John Canny. My research interest broadly lies in the intersection of AI and HCI, with a focus on making usable AI tools accessible to end users.

I am currently looking into making NLP tools usable and accessible for low-resourced languages. I am also interested in the impact of AI on society, specifically in how it affects Global Majority countries and communities. Outside of research, I like to read books, make and drink traditional Ethiopian coffee, knit,...

Python Text Analysis: Topic Modeling

April 13, 2022, 3:00pm
In this part, we study unsupervised learning of text data. This is a stand alone work that builds from the two-part text analysis series.

Python Text Analysis Fundamentals: Parts 1-3

September 21, 2021, 10:00am
This three-part workshop series will prepare participants to move forward with research that uses text analysis, with a special focus on humanities and social science applications.

Python Text Analysis Fundamentals: Parts 1-2

March 8, 2023, 2:00pm
This two-part workshop series will prepare participants to move forward with research that uses text analysis, with a special focus on humanities and social science applications.

Python Text Analysis Fundamentals: Parts 1-2

March 28, 2022, 3:00pm
This two-part workshop series will prepare participants to move forward with research that uses text analysis, with a special focus on humanities and social science applications.

CANCELED: Python Text Analysis: Word Embeddings

November 17, 2022, 12:00pm
How can we use neural networks to create meaningful representations of words? The bag-of-words is limited in its ability to characterize text, because it does not utilize word context.

Python Text Analysis Fundamentals: Parts 1-2

September 25, 2023, 2:00pm
This two-part workshop series will prepare participants to move forward with research that uses text analysis, with a special focus on humanities and social science applications.

Python Text Analysis Fundamentals: Parts 1-2

June 20, 2023, 9:00am
This two-part workshop series will prepare participants to move forward with research that uses text analysis, with a special focus on humanities and social science applications.

Python Text Analysis: Topic Modeling

April 4, 2024, 10:00am
In this part, we study unsupervised learning of text data. This is a stand alone work that builds from the two-part text analysis series.