Data Science

Python Data Wrangling and Manipulation with Pandas

May 4, 2023, 1:00pm
Pandas is a Python package that provides fast, flexible, and expressive data structures designed to make working with 'relational' or 'labeled' data both easy and intuitive. It enables doing practical, real world data analysis in Python. In this workshop, we'll work with example data and go through the various steps you might need to prepare data for analysis.

R Fundamentals: Parts 1-4

May 1, 2023, 10:00am
This workshop is a four-part introductory series that will teach you R from scratch with clear introductions, concise examples, and support documents. You will learn how to download and install the open-sourced R Studio software, understand data and basic manipulations, import and subset data, explore and visualize data, and understand the basics of automation in the form of loops and functions. After completion of this workshop you will have a foundational understanding to create, organize, and utilize workflows for your personal research.

Exploring Population Data with IPUMS

November 8, 2022
Exploring Population Data with IPUMS

Last month, demographer and historian Steve Ruggles was awarded a prestigious MacArthur Foundation Fellowship for his work developing IPUMS—a harmonized database of individual and family responses to large-scale domestic and international surveys. With some samples going as far back as the 18th century, IPUMS can offer key insights into changing demographics, norms, and decision-making over...

Data Science for Social Justice Workshop 2023

March 1, 2023, 12:00pm
This 8-week online workshop for currently enrolled UC Berkeley graduate students will give you the opportunity to learn the essential tools and methods for data science analysis and be introduced to critical frameworks that will enable you to create a project of your own design and to tell stories that can counter the market-first mentality of data science.
See event details for participation information.

Data Science + Social Justice Workshop (Apply by May 1)

May 1, 2022, 10:00am
This 6-week workshop will give you the opportunity to learn the essential tools and methods for data science analysis and be introduced to critical frameworks that will enable you to create a project of your own design and to tell stories that can counter the market-first mentality of data science.
See event details for participation information.

Python Web Scraping

March 28, 2023, 2:00pm
In this workshop, we cover how to scrape data from the web using Python. Web scraping involves downloading a webpage's source code and sifting through the material to extract desired data.

Python Intermediate: Parts 1-3

April 6, 2023, 2:00pm
This three-part interactive workshop series teaches you intermediate programming Python for people with previous programming experience equivalent to our Python Fundamentals workshop. By the end of the series, you will be able to apply your knowledge of basic principles of programming and data manipulation to a real-world social science application.

Python Machine Learning for Data Science Discovery

March 22, 2023, 7:00pm
Overview of Machine learning, Methods of Linear Regression, Logistic Regression (Classification), and Data Preprocessing. The workshop will consist of a live coding demo with a live question-answer session.

Python Web APIs

March 14, 2023, 2:00pm
In this workshop, we cover how to extract data from the web with APIs using Python. APIs are often official services offered by companies and other entities, which allow you to directly query their servers in order to retrieve their data. Platforms like The New York Times, Twitter and Reddit offer APIs to retrieve data.

Twitter Text Analysis: A Friendly Introduction

October 25, 2022

Read part 2 here.

Introduction

Text analysis techniques, including sentiment analysis, topic modeling, and named entity recognition, have been increasingly used to probe patterns in a variety of text-based documents, such as books, social media posts, and others. This blog post introduces Twitter text analysis, but is not intended to cover all of the aforementioned topics. The tutorial is broken down into two parts. In this very first post, I...