R

R Data Wrangling and Manipulation

September 17, 2021, 2:00pm
It is said that 80% of data analysis is spent on the process of cleaning and preparing the data for exploration, visualization, and analysis. This R workshop will introduce the dplyr and tidyr packages to make data wrangling and manipulation easier. Participants will learn how to use these packages to subset and reshape data sets, do calculations across groups of data, clean data, and other useful tasks.

R Fundamentals: Parts 1-4

September 7, 2021, 10:00am
This workshop is a four-part introductory series that will teach you R from scratch with clear introductions, concise examples, and support documents. You will learn how to download and install the open-sourced R Studio software, understand data and basic manipulations, import and subset data, explore and visualize data, and understand the basics of automation in the form of loops and functions. After completion of this workshop you will have a foundational understanding to create, organize, and utilize workflows for your personal research.
Registration is unavailable.

R Bootcamp: Fall 2021

August 21, 2021, 8:30am
The workshop will be an intensive two-day introduction to R using RStudio. After the first morning session, the workshop will (staffing permitting) be split into two separate tracks. Co-sponsored by the UC Berkeley Statistics Department and the D-Lab.
See event details for participation information.

R Fundamentals: Parts 1-4

August 19, 2021, 9:30am
Data are the foundations of the social and biological sciences and humanities. Familiarizing yourself with a programming language can help you better understand the roles that data play in your field. This workshop will teach you to use base R to build a programming vocabulary to develop and train your data skills! The D-Lab's R Fundamentals workshop is a four-part introductory series that will teach you R from scratch with clear introductions, concise examples, and support documents. You will learn how to download and install the open-sourced R Studio software, understand data and basic manipulations, import and subset data, explore and visualize data, and understand the basics of automation in the form of loops and functions. After completion of this workshop you will have a foundational understanding to create, organize, and utilize workflows for your personal research.
Registration is unavailable.

Grazia Rovelli, Ph.D.

Data Science Fellow
Chemical Science Division (LBL)

Grazia is a postdoctoral scholar at the Chemical Science Division at Berkeley Lab and a Data Science Fellow at D-Lab. Her research has focused on several different aspects of atmospheric chemistry and she is now interested in data science and machine learning tools applied to atmospheric pollution problems.

Ilya Akdemir

Data Science Fellow
School of Law

Ilya is a JSD candidate at UC Berkeley School of Law. His research focuses on natural language processing and machine learning applications that are motivated by both theoretical and practical questions in the legal domain.

Amanda Glazer

Instructor
Statistics

Amanda is a PhD candidate in the statistics department at Berkeley. Her research focuses on causal inference with applications in education, political science and sports. Previously she earned her Bachelor’s degree in mathematics and statistics, with a secondary in computer science, from Harvard.

Why Teaching Social Scientists How To Code Like A Professional Is Important

September 23, 2020

I use data science to study political learning, organization, and mobilization among marginalized populations. I have always loved programming and want to serve people lacking voice and representation in a society. I am blessed to have found and chosen computational social science—a field situated between social science and data science—as my main research area.

I also love teaching people how to code, especially social scientists, and I take that mission seriously. I have taught computational tools and techniques at both graduate and undergraduate levels in semester-...

Manuscript Workflow with R Markdown and GIT

March 16, 2021

As part of my Masters of Public Health program I needed to complete a capstone. Working on a manuscript is a lot of back and forth: You need to make edits, fix your words and figures, and sometimes re-work entire sections. If you are like me, the thought of doing this process over a long period of time in Word makes me nauseous. Two main issues that cause this nausea for me are:

I frequently forget to make a record of my writing and often overwrite work

Copying and pasting figures while arguing with Word’s formatting...

Dustin Wallace

IUSE Undergraduate Advisory Board
Data Science

My name is Dustin Wallace, I am a 3rd-year transfer student from Fresno City College. I’m majoring in Data Science and plan to pursue a Master's of Financial Engineering or Master's of Information and Data Science after my time here at Cal. I'm a part of many communities on campus, such as reentry transfer students, Berkeley Underground Scholars, and EOP. My research deals with challenges faced by formerly incarcerated students within higher education, and private prisons. I am currently involved in student trading challenges where I compete against students from all over the world...