Data Science

Qualtrics Fundamentals

October 5, 2023, 2:00pm
Qualtrics is a powerful online tool available to Berkeley community members that can be used for a range of data collection activities. Primarily, Qualtrics is designed to make web surveys easy to write, test, and implement, but the software can be used for data entry, training, quality control, evaluation, market research, pre/post-event feedback, and other uses with some creativity.

Python Data Wrangling and Manipulation with Pandas

November 1, 2021, 12:00pm
Pandas is a Python package that provides fast, flexible, and expressive data structures designed to make working with 'relational' or 'labeled' data both easy and intuitive. It enables doing practical, real world data analysis in Python. In this workshop, we'll work with example data and go through the various steps you might need to prepare data for analysis.

Python Fundamentals: Parts 1-4

January 23, 2023, 10:00am
This four-part, interactive workshop series is your complete introduction to programming Python for people with little or no previous programming experience. By the end of the series, you will be able to apply your knowledge of basic principles of programming and data manipulation to a real-world social science application.

Python Web Scraping

November 2, 2023, 2:00pm
In this workshop, we cover how to scrape data from the web using Python. Web scraping involves downloading a webpage's source code and sifting through the material to extract desired data.

Stata Fundamentals: Parts 1-3

January 12, 2022, 10:00am
This workshop is a three-part introductory series that will teach you Stata from scratch with clear introductions, concise examples, and support documents. You will learn how to download and install the Stata software, understand data and basic manipulations, import and subset data, explore and visualize data, and understand the basics of automation in the form of loops and functions. After completion of this workshop you will have a foundational understanding to create, organize, and utilize workflows for your personal research.

CANCELED: Python Data Wrangling and Manipulation with Pandas

November 29, 2022, 3:00pm
Pandas is a Python package that provides fast, flexible, and expressive data structures designed to make working with 'relational' or 'labeled' data both easy and intuitive. It enables doing practical, real world data analysis in Python. In this workshop, we'll work with example data and go through the various steps you might need to prepare data for analysis.

Python Machine Learning for Data Science Discovery

March 22, 2023, 7:00pm
Overview of Machine learning, Methods of Linear Regression, Logistic Regression (Classification), and Data Preprocessing. The workshop will consist of a live coding demo with a live question-answer session.

R Bootcamp: Fall 2021

August 21, 2021, 8:30am
The workshop will be an intensive two-day introduction to R using RStudio. After the first morning session, the workshop will (staffing permitting) be split into two separate tracks. Co-sponsored by the UC Berkeley Statistics Department and the D-Lab.

Infosession: D-Lab Data Science Fellowship (2024-2025)

April 11, 2024, 3:00pm
The D-Lab is seeking applications for the 2024-2025 cohort of Data Science Fellows. This infosession will give you an in-depth look at the D-Lab Data Science Fellowship and an opportunity for you to ask questions about the program that may be helpful to your application process to become a Fellow!

Python Deep Learning: Parts 1-2

March 28, 2022, 9:00am
This workshop presents a brief history of Artificial Neural Networks (ANNs) and an explanation of the intuition behind them; a step-by-step reconstruction of a very basic ANN, and then how to use the scikit-learn library to implement an ANN for solving a classification problem.