Data Science

Python Data Wrangling and Manipulation with Pandas

February 8, 2024, 2:00pm
Pandas is a Python package that provides fast, flexible, and expressive data structures designed to make working with 'relational' or 'labeled' data both easy and intuitive. It enables doing practical, real world data analysis in Python. In this workshop, we'll work with example data and go through the various steps you might need to prepare data for analysis.

Python Introduction to Machine Learning: Parts 1-2

February 23, 2022, 10:00am
This workshop introduces students to scikit-learn, the popular machine learning library in Python, as well as the auto-ML library built on top of scikit-learn, TPOT. The focus will be on scikit-learn syntax and available tools to apply machine learning algorithms to datasets. No theory instruction will be provided.

R Fundamentals: Parts 1-4

April 29, 2024, 9:00am
This workshop is a four-part introductory series that will teach you R from scratch with clear introductions, concise examples, and support documents. You will learn how to download and install the open-sourced R Studio software, understand data and basic manipulations, import and subset data, explore and visualize data, and understand the basics of automation in the form of loops and functions. After completion of this workshop you will have a foundational understanding to create, organize, and utilize workflows for your personal research.

R Bootcamp: Fall 2021

August 21, 2021, 8:30am
The workshop will be an intensive two-day introduction to R using RStudio. After the first morning session, the workshop will (staffing permitting) be split into two separate tracks. Co-sponsored by the UC Berkeley Statistics Department and the D-Lab.

Institutional Review Boards (IRB) Fundamentals

March 17, 2022, 3:00pm
Are you starting a research project at UC Berkeley that involves human subjects? If so, one of the first steps you will need to take is getting IRB approval.

R Fundamentals: Parts 1-4

September 7, 2021, 10:00am
This workshop is a four-part introductory series that will teach you R from scratch with clear introductions, concise examples, and support documents. You will learn how to download and install the open-sourced R Studio software, understand data and basic manipulations, import and subset data, explore and visualize data, and understand the basics of automation in the form of loops and functions. After completion of this workshop you will have a foundational understanding to create, organize, and utilize workflows for your personal research.

Python Data Wrangling and Manipulation with Pandas

January 23, 2023, 2:00pm
Pandas is a Python package that provides fast, flexible, and expressive data structures designed to make working with 'relational' or 'labeled' data both easy and intuitive. It enables doing practical, real world data analysis in Python. In this workshop, we'll work with example data and go through the various steps you might need to prepare data for analysis.

Data Science for Social Justice Workshop 2023

March 1, 2023, 12:00pm
This 8-week online workshop for currently enrolled UC Berkeley graduate students will give you the opportunity to learn the essential tools and methods for data science analysis and be introduced to critical frameworks that will enable you to create a project of your own design and to tell stories that can counter the market-first mentality of data science.

Python Data Wrangling and Manipulation with Pandas

October 19, 2021, 10:00am
Pandas is a Python package that provides fast, flexible, and expressive data structures designed to make working with 'relational' or 'labeled' data both easy and intuitive. It enables doing practical, real world data analysis in Python. In this workshop, we'll work with example data and go through the various steps you might need to prepare data for analysis.

R Fundamentals: Parts 1-4

March 27, 2023, 4:00pm
This workshop is a four-part introductory series that will teach you R from scratch with clear introductions, concise examples, and support documents. You will learn how to download and install the open-sourced R Studio software, understand data and basic manipulations, import and subset data, explore and visualize data, and understand the basics of automation in the form of loops and functions. After completion of this workshop you will have a foundational understanding to create, organize, and utilize workflows for your personal research.