Stata Fundamentals: Parts 1-3

February 14, 2022, 3:00pm to February 23, 2022, 6:00pm

Trying to register, but not affiliated with the UCB campus? If you are from Berkeley Lab (LBL), UCSF, or CZ Biohub, please register via our partner portals here.

If you are from the UCB campus there's no more waitlist! But after registering above, please do fill out the affiliations form if you have not done so at least once before:

Location: Remote via Zoom. Link will be sent on the morning of the event.

Date & Time: This workshop is a 3-part series running from 3pm-6pm each day:

  • Part 1: Monday, February 14
  • Part 2: Wednesday, February 16
  • Part 3: Wednesday, February 23

Start Time: D-Lab workshops start 10 minutes after the scheduled start time (“Berkeley Time”). We will admit all participants from the waiting room at that time.


This workshop is a three-part introductory series that will teach you Stata from scratch with clear introductions, concise examples, and support documents. You will learn how to download and install the Stata software, understand data and basic manipulations, import and subset data, explore and visualize data, and understand the basics of automation in the form of loops and functions. After completion of this workshop you will have a foundational understanding to create, organize, and utilize workflows for your personal research.

Each of the parts is divided into a lecture-style coding walkthrough interrupted by challenge problems, discussions of the solutions, and breaks. Instructors and TAs are dedicated to engaging you in the classroom and answering questions in plain language. 

Part 1:  Introduction

  • Loading datasets into Stata (no previous knowledge expected)
  • Examining a dataset and finding variables of interest
  • Summarizing and tabulating variables
  • Stata specific tools and resources (do files, logs, help files, etc.)
  • Coding and cleaning data (making new variables from old variables; labeling variables and values, etc.)
  • Using logical operators in Stata
  • Cross-tabulations

Part 2: Data Analysis in Stata 

  • Correlation
  • T-tests
  • Ordinary Least Squares (OLS) and logistic regression (basic syntax, using interaction terms, interpreting output)
  • Visualization (histograms, bar graphs, scatter plots)
  • Regression postestimation (getting predicted values, basic graphs)
  • Merging and appending datasets

Part 3: Stata Programming

  • Local and global variables (macros)
  • Looping (foreach, forvalues)
  • Reshaping data between wide and long formats
  • Recalling and using command output
  • Generating nicely formatted journal-style tables

Workshop Materials:

Software Requirements: Installation Instructions (note: UC Berkeley students will receive an email with an instructional license for the workshop)

Feedback: After completing the workshop, please provide us feedback using this form

Questions? Email: