Software Tools

R Data Visualization

September 19, 2022, 2:00pm
This workshop will provide an introduction to graphics in R with ggplot2. Participants will learn how to construct, customize, and export a variety of plot types in order to visualize relationships in data. We will also explore the basic grammar of graphics, including the aesthetics and geometry layers, adding statistics, transforming scales, and coloring or panelling by groups. You will learn how to make histograms, boxplots, scatterplots, lineplots, and heatmaps as well as how to make compound figures.

R Data Wrangling and Manipulation: Parts 1-2

September 12, 2022, 2:00pm
It is said that 80% of data analysis is spent on the process of cleaning and preparing the data for exploration, visualization, and analysis. This R workshop will introduce the dplyr and tidyr packages to make data wrangling and manipulation easier. Participants will learn how to use these packages to subset and reshape data sets, do calculations across groups of data, clean data, and other useful tasks.

Christopher Paciorek, Ph.D.

Research Computing Consultant, Adjunct Professor
Department of Statistics
Research IT

Chris Paciorek is an adjunct professor in the Department of Statistics, as well as the Statistical Computing Consultant in the Department's Statistical Computing Facility (SCF) and in the Econometrics Laboratory (EML) of the Economics Department. He is also a user support consultant for Berkeley Research Computing. He teaches and presents workshops on statistical computing topics, with a focus on R.

Bash + Git: Introduction

June 27, 2022, 1:00pm
This workshop will start by introducing you to navigating your computer’s file system and basic Bash commands to remove the fear of working with the command line and to give you the confidence to use it to increase your productivity. And then working with Git, a powerful tool for keeping track of changes you make to the files in a project.
See event details for participation information.

Python Visualization

June 2, 2022, 1:00pm
For this workshop, we'll provide an introduction to visualization with Python. We'll cover visualization theory and plotting with Matplotlib and Seaborn, working through examples in a Jupyter notebook.
See event details for participation information.

Bash + Git: Introduction

June 1, 2022, 9:00am
This workshop will start by introducing you to navigating your computer’s file system and basic Bash commands to remove the fear of working with the command line and to give you the confidence to use it to increase your productivity. And then working with Git, a powerful tool for keeping track of changes you make to the files in a project.
See event details for participation information.

R Visualization

May 27, 2022, 9:00am
This workshop will provide an introduction to graphics in R with ggplot2. Participants will learn how to construct, customize, and export a variety of plot types in order to visualize relationships in data. We will also explore the basic grammar of graphics, including the aesthetics and geometry layers, adding statistics, transforming scales, and coloring or panelling by groups. You will learn how to make histograms, boxplots, scatterplots, lineplots, and heatmaps as well as how to make compound figures.
See event details for participation information.

R Data Wrangling and Manipulation: Parts 1-2

May 24, 2022, 1:00pm
It is said that 80% of data analysis is spent on the process of cleaning and preparing the data for exploration, visualization, and analysis. This R workshop will introduce the dplyr and tidyr packages to make data wrangling and manipulation easier. Participants will learn how to use these packages to subset and reshape data sets, do calculations across groups of data, clean data, and other useful tasks.
See event details for participation information.

Frank Hidalgo Ruiz

Data Science Fellow
Chemistry

I am currently a 5th-year Chemical Biology Ph.D. student. My research focuses on understanding the mechanism by which mutations in a protein called Ras lead to tumorigenesis. More specifically, I aim to integrate high-throughput mutagenesis, coevolutionary analysis, and machine learning algorithms to generate a predictive model. Over the last year, I have built a Python package to process, analyze, and visualize Next Generation Sequencing datasets. I love collaborating across research fields and sharing my passion for data science.

Introduction to Bash + Git

April 21, 2022, 10:00am
This workshop will start by introducing you to navigating your computer’s file system and basic Bash commands to remove the fear of working with the command line and to give you the confidence to use it to increase your productivity. And then working with Git, a powerful tool for keeping track of changes you make to the files in a project.