Data Science

Python Fundamentals: Parts 1-3

August 19, 2024, 2:00pm
This three-part interactive workshop series is your complete introduction to programming Python for people with little or no previous programming experience. By the end of the series, you will be able to apply your knowledge of basic principles of programming and data manipulation to a real-world social science application.

Deya Chic

Data Science for Social Justice Fellow 2024
Graduate School of Education

Deya is deeply committed to supporting underrepresented students and contributing to policies that address oppression in higher education. She aims to influence fellow professionals and researchers to adopt a comprehensive approach to addressing systemic issues in the higher education system.

Megumi Tanaka

Data Science for Social Justice Fellow 2024
School of Journalism

I'm currently a graduate student in the AS&T program and the School of Journalism, conducting research in the field of AI ethics education. I'm interested in the intersection of data science, journalism and education!

TEST: Python Data Wrangling and Manipulation with Pandas

August 22, 2024, 2:00pm
Pandas is a Python package that provides fast, flexible, and expressive data structures designed to make working with 'relational' or 'labeled' data both easy and intuitive. It enables doing practical, real world data analysis in Python. In this workshop, we'll work with example data and go through the various steps you might need to prepare data for analysis.

TEST: R Fundamentals: Parts 1-4

August 20, 2024, 9:00am
This workshop is a four-part introductory series that will teach you R from scratch with clear introductions, concise examples, and support documents. You will learn how to download and install the open-sourced R Studio software, understand data and basic manipulations, import and subset data, explore and visualize data, and understand the basics of automation in the form of loops and functions. After completion of this workshop you will have a foundational understanding to create, organize, and utilize workflows for your personal research.

TEST: Python Fundamentals: Parts 1-3

August 19, 2024, 2:00pm
This three-part interactive workshop series is your complete introduction to programming Python for people with little or no previous programming experience. By the end of the series, you will be able to apply your knowledge of basic principles of programming and data manipulation to a real-world social science application.

Hellina Hailu Nigatu

Data Science for Social Justice Senior Fellow 2024
Electrical Engineering and Computer Science (EECS)

I am a PhD student at UC Berkeley in the EECS department co-advised by Prof. Sarah Chasins and Prof. John Canny. My research interest broadly lies in the intersection of AI and HCI, with a focus on making usable AI tools accessible to end users.

I am currently looking into making NLP tools usable and accessible for low-resourced languages. I am also interested in the impact of AI on society, specifically in how it affects Global Majority countries and communities. Outside of research, I like to read books, make and drink traditional Ethiopian coffee, knit,...

Violet Davis

Data Science for Social Justice Senior Fellow 2024
MIDS

I am a Masters student studying Data Science with the School of Information. My research involves computational social science projects focused on social justice and equity.

Minding the Gaps: Pay Equity in California

July 9, 2024
by Tonya D. Lindsey, Ph.D. The gender pay gap continues to reflect that, on average, men outearn women. California is among the states with the smallest pay gaps (outpacing the national number at 13%) and is unique in that it enacted legislation aimed at eliminating pay gaps by sex and race categories. This blog post reflects on California’s pay gap as students study it in an undergraduate social statistics course. Independent variables indicate three theoretical frameworks: 1) human capital, 2) occupational segregation, and 3) discrimination. While the work students do is rigorous using a representative sample of full-time year-round California workers, there remains work to be done and caveats to the data and analyses.

Python Data Wrangling and Manipulation with Pandas

February 15, 2022, 9:00am
Pandas is a Python package that provides fast, flexible, and expressive data structures designed to make working with 'relational' or 'labeled' data both easy and intuitive. It enables doing practical, real world data analysis in Python. In this workshop, we'll work with example data and go through the various steps you might need to prepare data for analysis.