Programming Languages

Nikita Samarin

Data Science Fellow 2021-2022
Electrical Engineering and Computer Science (EECS)

Nikita Samarin is a doctoral student in Computer Science in the Department of Electrical Engineering and Computer Sciences (EECS) at the University of California, Berkeley advised by Serge Egelman and David Wagner. His research focuses on computer security and privacy from an interdisciplinary perspective, combining approaches from human-computer interaction, behavioral sciences, and legal studies. Samarin is a member of the Berkeley Lab for Usable and Experimental Security (BLUES) and an affiliated graduate researcher at the Center for Long-Term Cybersecurity (CLTC) and the...

Monica Donegan

Data Science Fellow 2022-2023
Environmental Science, Policy, and Management

Monica is a third-year Ph.D. candidate in the Environmental Science, Policy, and Management program. She uses computational tools to study the evolution and ecology of agricultural plant pathogens. Previously, she worked on a data science team at a biotech company in Boston.

Ruiji Sun

Data Science Fellow 2024-2025
Center for the Built Environment

Ruiji Sun is currently a Ph.D. candidate in Building Science at UC Berkeley. He is also a GSR at the Center for the Built Environment (CBE). His dissertation focuses on causal inference in the built environment. Other areas of his research include indoor environmental quality, personalized environmental control systems, and building energy modeling.

He obtained his M.S. degree from Carnegie Mellon University and double-majored in Mechanical Engineering (HVAC) and Architecture at Xi’an University of Architecture and Technology, China. Ruiji also served as a board...

Amber Galvano

Data Science Fellow 2024-2025
Linguistics

I am a fourth-year PhD student in Linguistics, with a focus in sociophonetics and phonology. In my research, I'm interested in how understudied speech communities (Andalusians, southern Spain; Lobi and Tonko Limba, West Africa) and often-relegated aspects of social identity (sexuality, gender normativity) can inform new approaches to theory and methodology and how we conceptualize the interfaces between linguistic subfields.

I'm also involved in language documentation/revitalization work for Lobi and the development of automated phonetic methods, particularly for...

Measuring Vowels Without Relying on Sex-Based Assumptions

April 8, 2025
by Amber Galvano. This tutorial builds on my previous post on Python for acoustic analysis, this time focusing on measuring vocal tract resonances without relying on sex-based assumptions. I demonstrate how to process audio files and vowel annotations using an adaptive method that optimizes the acoustic analysis across a recording. Instead of fixing parameters based on generalized vocal tract length correlations, this approach varies them within a defined range for greater accuracy. This not only enhances measurement precision but also avoids requiring (or assuming) speakers’ sex in data collection. Finally, I show how to filter for outliers and create high-quality vowel space visualizations.

Qualtrics Fundamentals: Parts 1-2

April 14, 2025, 1:00pm
In this two-part workshop, we provide an introduction to using Qualtrics. In the first part, we'll cover how to use the platform and its features to create, distribute, and analyze surveys. In the second part, we'll discuss best practices for survey design.

Git Fundamentals

May 8, 2025, 10:00am
This introductory workshop covers basics of Git using command line(Bash). We will cover key concepts and workflows, including version control, repository creation, branching, merging, and collaboration. You'll gain hands-on experience navigating Git, managing repositories, and contributing to projects, making it easier to streamline your work and collaborate with others.

MAXQDA Fundamentals Departmental (90m)

April 14, 2025, 12:30pm
This 90-minute introductory workshop will teach you MaxQDA from scratch with clear introductions, concise examples, and support documents. You will learn how to download and install the MaxQDA software, upload multiple forms of data then how to use manual and autocode features. We will review some of the additional analytic features including visual, memo and the Questions, Themes and Theories (QTT) tools. We will briefly touch on the MaxQDA Team cloud-based version. Instructors will share recommended resources.

R SQL Fundamentals

April 28, 2025, 3:00pm
In this workshop, we provide an introduction to using SQL to query and retrieve data from relational databases in R. First, we’ll cover what relational databases and SQL are. Then, we’ll use different packages in R to navigate relational databases using SQL.

Python Data Visualization: Parts 1-2

April 7, 2025, 8:00am
For this workshop, we'll provide an introduction to visualization with Python. We'll cover visualization theory and plotting with Matplotlib and Seaborn, working through examples in a Jupyter notebook.