Data Science

Python Web APIs

March 3, 2025, 10:00am

In this workshop, we cover how to extract data from the web with APIs using Python. APIs are often official services offered by companies and other entities, which allow you to directly query their servers in order to retrieve their data. Platforms like The New York Times, Twitter and Reddit offer APIs to retrieve data.

Read more about Python Web APIs

Python Text Analysis: Parts 1-3

March 17, 2025, 2:00pm

This three-part workshop will prepare participants to move forward with research that uses text analysis, with a special focus on social science applications. We explore fundamental approaches to applying computational methods to text in Python. We cover some of the major packages used in natural language processing, including scikit-learn, NLTK, spaCy, and Gensim.

Read more about Python Text Analysis: Parts 1-3

Qualtrics Fundamentals

March 5, 2025, 3:00pm

Qualtrics is a powerful online tool available to Berkeley community members that can be used for a range of data collection activities. Primarily, Qualtrics is designed to make web surveys easy to write, test, and implement, but the software can be used for data entry, training, quality control, evaluation, market research, pre/post-event feedback, and other uses with some creativity.

Read more about Qualtrics Fundamentals

Python GPT Fundamentals

February 20, 2025, 10:00am

This workshop offers a general introduction to the GPT (Generative Pretrained Transformers) model. No technical background is required. We will explore the transformer architecture upon which GPT models are built, how transformer models encode natural language into embeddings, and how GPT predicts text.

Read more about Python GPT Fundamentals

Python Data Wrangling and Manipulation with Pandas: Parts 1-2

February 10, 2025, 2:00pm

Pandas is a Python package that provides fast, flexible, and expressive data structures designed to make working with 'relational' or 'labeled' data both easy and intuitive. It enables doing practical, real world data analysis in Python. In this workshop, we'll work with example data and go through the various steps you might need to prepare data for analysis.

Read more about Python Data Wrangling and Manipulation with Pandas: Parts 1-2

Python Machine Learning Fundamentals: Parts 1-2

February 24, 2025, 2:00pm

This workshop introduces students to scikit-learn, the popular machine learning library in Python, as well as the auto-ML library built on top of scikit-learn, TPOT. The focus will be on scikit-learn syntax and available tools to apply machine learning algorithms to datasets. No theory instruction will be provided.

Read more about Python Machine Learning Fundamentals: Parts 1-2

Qualtrics Fundamentals

February 6, 2025, 5:00pm

Read more about Qualtrics Fundamentals

Python Fundamentals: Parts 1-3

January 13, 2025, 9:00am

This three-part interactive workshop series is your complete introduction to programming Python for people with little or no previous programming experience. By the end of the series, you will be able to apply your knowledge of basic principles of programming and data manipulation to a real-world social science application.

Read more about Python Fundamentals: Parts 1-3

R Fundamentals: Parts 1-4

February 3, 2025, 2:00pm

This workshop is a four-part introductory series that will teach you R from scratch with clear introductions, concise examples, and support documents. You will learn how to download and install the open-sourced R Studio software, understand data and basic manipulations, import and subset data, explore and visualize data, and understand the basics of automation in the form of loops and functions. After completion of this workshop you will have a foundational understanding to create, organize, and utilize workflows for your personal research.