Programming Languages

Python Machine Learning Fundamentals: Parts 1-2

October 4, 2022, 2:00pm
This workshop introduces students to scikit-learn, the popular machine learning library in Python, as well as the auto-ML library built on top of scikit-learn, TPOT. The focus will be on scikit-learn syntax and available tools to apply machine learning algorithms to datasets. No theory instruction will be provided.

Python Visualization

October 5, 2022, 3:00pm
For this workshop, we'll provide an introduction to visualization with Python. We'll cover visualization theory and plotting with Matplotlib and Seaborn, working through examples in a Jupyter notebook.

Bash + Git: Introduction

October 19, 2022, 2:00pm
This workshop will start by introducing you to navigating your computer’s file system and basic Bash commands to remove the fear of working with the command line and to give you the confidence to use it to increase your productivity. And then working with Git, a powerful tool for keeping track of changes you make to the files in a project.

Qualtrics Fundamentals

October 19, 2022, 5:00pm
Qualtrics is a powerful online tool available to Berkeley community members that can be used for a range of data collection activities. Primarily, Qualtrics is designed to make web surveys easy to write, test, and implement, but the software can be used for data entry, training, quality control, evaluation, market research, pre/post-event feedback, and other uses with some creativity.

Marcus Manos

Consulting Drop-In Hours: Fri 10:30am-12:30pm

Consulting Areas: Python, SQL, Visual Basic, Databases & SQL, Data Manipulation and Cleaning, Excel, Git or Github, Tableau

Aaron Culich

Consulting Drop-In Hours: By appointment only

Consulting Areas: Python, R, SQL, APIs, Cloud & HPC Computing, Databases & SQL, Bash or Command Line, Git or Github

Monica Donegan

Data Science Fellow
Environmental Science, Policy, and Management

Monica is a third-year Ph.D. candidate in the Environmental Science, Policy, and Management program. She uses computational tools to study the evolution and ecology of agricultural plant pathogens. Previously, she worked on a data science team at a biotech company in Boston.

R Data Visualization

September 19, 2022, 2:00pm
This workshop will provide an introduction to graphics in R with ggplot2. Participants will learn how to construct, customize, and export a variety of plot types in order to visualize relationships in data. We will also explore the basic grammar of graphics, including the aesthetics and geometry layers, adding statistics, transforming scales, and coloring or panelling by groups. You will learn how to make histograms, boxplots, scatterplots, lineplots, and heatmaps as well as how to make compound figures.

James Hall

Department of Statistics

James Hall is a graduate student in the Statistics MA program at University of California, Berkeley. He is a husband and father to three awesome kids. Originally from Baltimore, MD, James earned his bachelors in Mathematics at the United States Military Academy at West Point, NY in 2011, and served as a U.S. Army officer. He’s served as a leader at multiple levels within large organizations with a professional focus on visualizing and communicating complex analysis to decision makers. James’ experience and coursework give him expertise in navigating different statistical methods,...