Text Analysis

Abhishek Roy

IUSE Undergraduate Advisory Board
Economics
Data Science

I'm Abhishek Roy and I'm double majoring in Economics and Data Science. I've been a part of D-Lab's IUSE project since Spring 2020 and have truly found an organization that is not only passionate about Data Science but also strives to expand its reach equitably to all communities. I am involved in Research and Project Management roles in various departments and labs at Berkeley and I'm an Editor at the Berkeley Economic Review. I love diving into anything at the intersection of Data Science, Economics, Business, and Computational Social Science. Whenever I'm free, I love writing...

Frances Leung

Data Science Fellow
School of Information

Frances Leung is a master’s student at UC Berkeley School of Information where she focuses her studies in information and data science. She has a keen interest in leveraging data-driven insights to better understand consumer behaviors and the world around us. In her professional work as a management consultant, she advises retailers and consumer businesses on digital transformation and creating web/mobile experiences that delight consumers through a human-centered approach. Frances holds a Master in Business Administration from York University, Schulich School...

Marina Blum

Data Science Fellow
School of Public Health

Marina is a master's student in the Health and Social Behavior division of the School of Public Health. She has extensive experience in ATLAS.ti and can help you get the most out of the program. She is passionate about data visualization, and is happy to help with related questions and questions on qualitative methods.

Racism Narratives in Medical Literature

Systemic racism is a driving factor in unequal health outcomes, but it is rarely the subject of study in top medical journals (see a 2021 analysis by Krieger et al.). This project, a collaboration between the UC Berkeley D-Lab and the American Medical Association's Center for Health Equity, aims to measure progress in acknowledging, studying, & dismantling racism by creating tools to track racism-related narratives in influential medical research.

Erin Manalo-Pedro

Research Fellow
Community Health Sciences (UCLA)

Erin Manalo-Pedro is a Ph.D. student in the Department of Community Health Sciences at the UCLA Fielding School of Public Health with a minor in education. She focuses her racial health equity research on curriculum, the health workforce, and political interventions for communities of color. Drawing from Public Health Critical Race Praxis and Pinayism, she aims to use methods, like natural language processing and counter storytelling, to document the subtleties of structural racism and resistance from marginalized groups.

To guide her interdisciplinary approach, Erin leverages
...

Python Text Analysis Fundamentals: Parts 1-2

June 14, 2022, 1:00pm
This two-part workshop series will prepare participants to move forward with research that uses text analysis, with a special focus on humanities and social science applications.
See event details for participation information.

Aaron Culich

Deputy Director of D-Lab; Cyberinfrastructure Architect and Consulting Lead

Aaron Culich is a staff member at the D-Lab with expertise in Cloud Computing, High Performance Computing (HPC), Databases (SQL and NoSQL), JupyterHub and BinderHub infrastructure, and a variety of programming languages (Python, R, Java, C, C++, and more). His ongoing mission is to explore new compute possibilities, discovering useful tools and practices, and making them more accessible to researchers on campus and beyond.

Katherine Wolf

Adjunct Fellow
Environmental Science, Policy, and Management

Doctoral student in Rachel Morello-Frosch's laboratory in the Department of Environmental Science, Policy, and Management working at the intersection of environmental epidemiology, environmental justice, and causal inference. Particularly interested in developing quantitative methods to investigate the operation of social power in environmental monitoring regimes in the United States.

Python Text Analysis: Topic Modeling

April 13, 2022, 3:00pm
In this part, we study unsupervised learning of text data. This is a stand alone work that builds from the two-part text analysis series.

Python Text Analysis: Word Embeddings

April 6, 2022, 3:00pm
How can we use neural networks to create meaningful representations of words? The bag-of-words is limited in its ability to characterize text, because it does not utilize word context.