Location: Remote via Zoom. Link will be sent on the morning of the event.
Date & Time: This workshop runs from 1pm-4pm on Tuesday, May 31.
Description
Pandas is a Python package that provides fast, flexible, and expressive data structures designed to make working with 'relational' or 'labeled' data both easy and intuitive. It enables doing practical, real world data analysis in Python. In this workshop, we'll work with example data and go through the various steps you might need to prepare data for analysis.
We will cover:
-
Pandas data structures
-
Loading data
-
Subsetting and filtering
-
Calculating summary statistics
-
Dealing with missing values
-
Merging data sets
-
Creating new variables
-
Basic plotting
-
Exporting data
Prerequisites: D-Lab’s Python Fundamentals introductory series or equivalent knowledge.
GitHub Repository: https://github.com/dlab-berkeley/Python-Data-Wrangling
Software Requirements:Installation Instructions for Python Anaconda
Is Python Not working on your laptop?
Attend the workshop anyway, we can provide you with a cloud-based solution until you figure out the problems with your local installation.
Feedback: After completing the workshop, please provide us feedback using this form
Registration & Logistics
Trying to register, but not affiliated with the UCB campus? If you are from Berkeley Lab (LBL), UCSF, California Research Bureau (CRB), or CZ Biohub, please register via our partner portals here.
Recordings: This D-Lab workshop will be recorded. Your registration for the event indicates your consent to having any images, comments and chat messages included as part of the video recording materials that are made available.
Start Time: D-Lab workshops start 10 minutes after the scheduled start time (“Berkeley Time”). We will admit all participants from the waiting room at that time.
Questions? Email: dlab-frontdesk@berkeley.edu