Data Science

Python Text Analysis Fundamentals: Parts 1-2

June 20, 2023, 9:00am
This two-part workshop series will prepare participants to move forward with research that uses text analysis, with a special focus on humanities and social science applications.

Institutional Review Board (IRB) Fundamentals

February 16, 2024, 9:00am
Are you starting a research project at UC Berkeley that involves human subjects? If so, one of the first steps you will need to take is getting IRB approval.

Python Introduction to Machine Learning: Parts 1-2

December 7, 2021, 1:00pm
This workshop introduces students to scikit-learn, the popular machine learning library in Python, as well as the auto-ML library built on top of scikit-learn, TPOT. The focus will be on scikit-learn syntax and available tools to apply machine learning algorithms to datasets. No theory instruction will be provided.

Python Fundamentals: Parts 1-3

September 11, 2023, 2:00pm
This three-part interactive workshop series is your complete introduction to programming Python for people with little or no previous programming experience. By the end of the series, you will be able to apply your knowledge of basic principles of programming and data manipulation to a real-world social science application.

Python Text Analysis: Word Embeddings

April 11, 2024, 10:00am
How can we use neural networks to create meaningful representations of words? The bag-of-words is limited in its ability to characterize text, because it does not utilize word context.

R Fundamentals: Parts 1-2 (5pm-8pm)

February 15, 2022, 5:00pm
Evening workshop 5-8pm. This workshop is a two-part introductory series that will teach you R from scratch with clear introductions, concise examples, and support documents. You will learn how to download and install the open-sourced R Studio software, understand data and basic manipulations, import and subset data, explore and visualize data, and understand the basics of automation in the form of loops and functions. After completion of this workshop you will have a foundational understanding to create, organize, and utilize workflows for your personal research.

R Bootcamp: Fall 2021

August 21, 2021, 8:30am
The workshop will be an intensive two-day introduction to R using RStudio. After the first morning session, the workshop will (staffing permitting) be split into two separate tracks. Co-sponsored by the UC Berkeley Statistics Department and the D-Lab.

Python Deep Learning: Parts 1-2

March 28, 2022, 9:00am
This workshop presents a brief history of Artificial Neural Networks (ANNs) and an explanation of the intuition behind them; a step-by-step reconstruction of a very basic ANN, and then how to use the scikit-learn library to implement an ANN for solving a classification problem.

R Fundamentals: Parts 1-4

January 24, 2023, 2:00pm
This workshop is a four-part introductory series that will teach you R from scratch with clear introductions, concise examples, and support documents. You will learn how to download and install the open-sourced R Studio software, understand data and basic manipulations, import and subset data, explore and visualize data, and understand the basics of automation in the form of loops and functions. After completion of this workshop you will have a foundational understanding to create, organize, and utilize workflows for your personal research.

Python Web Scraping

June 26, 2024, 10:00am
In this workshop, we cover how to scrape data from the web using Python. Web scraping involves downloading a webpage's source code and sifting through the material to extract desired data.