Text Analysis

Aaron Culich

Former Deputy Director of D-Lab; Cyberinfrastructure Architect and Consulting Lead

Aaron Culich is a staff member at the D-Lab with expertise in Cloud Computing, High Performance Computing (HPC), Databases (SQL and NoSQL), JupyterHub and BinderHub infrastructure, and a variety of programming languages (Python, R, Java, C, C++, and more). His ongoing mission is to explore new compute possibilities, discovering useful tools and practices, and making them more accessible to researchers on campus and beyond.

Sahiba Chopra

Data Science Fellow 2024-2025
Haas School of Business

I'm a PhD student in the Management and Organizations (Macro) group at Berkeley Haas. I have a diverse professional background, primarily as a data scientist across numerous industries, including fintech, cleantech, and media. I hold a BA in Economics from the University of Maryland, an MS in Applied Economics from the University of San Francisco, and an MS in Business Administration from UC Berkeley.

My research focuses on the intersection of inequality, technology, and the labor market. I am particularly interested in understanding how to reduce inequality in...

Lance Santana

Consulting Drop-In Hours: By appointment only

Consulting Areas: APIs, ArcGIS Desktop - Online or Pro, Bayesian Methods, Cluster Analysis, Data Visualization, Databases and SQL, Excel, Git or GitHub, Java, Machine Learning, Means Tests, Natural Language Processing (NLP), Python, Qualtrics, R, Regression Analysis, Research Planning, RStudio, Software Output Interpretation, SQL, Survey Design, Survey Sampling, Tableau, Text Analysis

Quick-tip: the fastest way to speak to a consultant is to first ...

Carl Illustrisimo

Consulting Drop-In Hours: By appointment only

Consulting Areas: Bash or Command Line, Cluster Analysis, Data Sources, Data Visualization, Digital Humanities, Excel, Git or GitHub, Javascript, LaTeX, Machine Learning, Natural Language Processing (NLP), Python, Regression Analysis, RStudio, SQL, Text Analysis

Quick-tip: the fastest way to speak to a consultant is to first ...

Alyssa Heinze

Consulting Drop-In Hours: By appointment only

Consulting Areas: Causal Inference, Data Visualization, Experimental Design, Focus Groups and Interviews, Git or GitHub, LaTeX, Machine Learning, Meta-Analysis, Mixed Methods, Qualitative Methods, Qualtrics, R, Regression Analysis, Research Design, RStudio, STATA, Survey Design, Text Analysis

Quick-tip: the fastest way to speak to a consultant is to first ...

Maksymilian Jasiak

Data Science & AI Fellow 2025-2026
Civil and Environmental Engineering

Maksymilian Jasiak is a PhD Student in GeoSystems Engineering at the University of California, Berkeley. His research focuses on Distributed Fiber Optic Sensing (DFOS) for lifeline infrastructure monitoring. His work aims to advance critical infrastructure security and resilience. He holds a MS in GeoSystems Engineering from the University of California, Berkeley and a BS in Civil Engineering from the University of Illinois Urbana-Champaign.

Sohail Khan

Senior Data Science Fellow 2025-2026, Data Science Fellow 2024-2025
School of Information

Hey everyone, I’m Sohail - a 1st years Master’s student studying Data Science at the I-School. I am interested in the intersection between Computer Science, Data Science, and Cognitive Psychology and using these tools to understand, discover, and drive the development of assistive technologies.

I have experience building with brain computer Interfaces, developing distributed data processing applications, and am currently working on a large scale archival project aimed at preserving the history and memory of resistance movements through an embedding based...

Jane (Mango) Angar

Senior Data Science Fellow 2025-2026, Data Science Fellow 2024-2025
Political Science

Hi! I am a PhD candidate in the Political Science Department at UC Berkeley. My dissertation traces the emergence of disability rights groups in Africa, focusing on Zambia and Malawi, and examines factors influencing their effectiveness. I use mixed methods, including archival work, field interviews, participant observation, and surveys for data collection.

My data analysis techniques include text analysis, social network analysis, means tests, and regressions. In my free time, I enjoy moderately difficult hikes, walks along the beach with my dog, Princess, and...

Scarlet Sands-Bliss

Data Science & AI Fellow 2025-2026, Domain Consultant, Research IT
School of Public Health

Scarlet Bliss is an MS/PhD student in Epidemiology in the School of Public Health. Her work focuses on mixed methods approaches to characterizing and preventing spread of antimicrobial resistance and other enteric pathogens via the environment. She has experience in statistical analysis and public health bioinformatics. She is interested in ethical use of big data as it relates to epidemiologic research.

Jose Aguilar

Data Science & AI Fellow 2025-2026
Berkeley Graduate School of Education

Jose R. Aguilar is currently a PhD student in the Policy, Politics, and Leadership program at UC Berkeley’s School of Education. His research utilizes natural language processing, machine learning, and social network analysis to investigate how institutional discourse, algorithmic decision-making, and education policy influence postsecondary access and equity for marginalized students. Before Berkeley, Jose earned his M.A. in Urban Education from Loyola Marymount University and dual B.A./B.S.A. degrees in Government, Latina/o Studies, and Computer Science from the University of...