Programming Languages

Qualtrics Fundamentals

February 20, 2024, 10:00am
Qualtrics is a powerful online tool available to Berkeley community members that can be used for a range of data collection activities. Primarily, Qualtrics is designed to make web surveys easy to write, test, and implement, but the software can be used for data entry, training, quality control, evaluation, market research, pre/post-event feedback, and other uses with some creativity.

Geospatial Fundamentals with QGIS: Parts 1-2

March 1, 2022, 3:00pm
This workshop will introduce methods for working with geospatial data in QGIS, a popular open-source desktop GIS program that runs on both PCs and Macs as well as linux computers. Participants will learn how to load, query and visualize point, line and polygon data. We will also introduce basic methods for processing spatial data, which are the building blocks of spatial analysis workflows. Coordinate reference systems and map projections will also be introduced.

Python Web Scraping & APIs

November 2, 2022, 3:00pm
In this workshop, we cover how to extract data from the web using Python. We focus on two approaches to extracting data from the web: leveraging application programming interfaces (APIs) and web scraping.

R Fundamentals: Parts 1-4

September 11, 2023, 10:00am
This workshop is a four-part introductory series that will teach you R from scratch with clear introductions, concise examples, and support documents. You will learn how to download and install the open-sourced R Studio software, understand data and basic manipulations, import and subset data, explore and visualize data, and understand the basics of automation in the form of loops and functions. After completion of this workshop you will have a foundational understanding to create, organize, and utilize workflows for your personal research.

Python Intermediate: Parts 1-3

April 2, 2024, 10:00am
This three-part interactive workshop series teaches you intermediate programming Python for people with previous programming experience equivalent to our Python Fundamentals workshop. By the end of the series, you will be able to apply your knowledge of basic principles of programming and data manipulation to a real-world social science application.

Python Introduction to Machine Learning: Parts 1-2

September 27, 2021, 2:00pm
This workshop introduces students to scikit-learn, the popular machine learning library in Python, as well as the auto-ML library built on top of scikit-learn, TPOT. The focus will be on scikit-learn syntax and available tools to apply machine learning algorithms to datasets. No theory instruction will be provided.

Stata Fundamentals: Parts 1-3

March 8, 2022, 1:00pm
This workshop is a three-part introductory series that will teach you Stata from scratch with clear introductions, concise examples, and support documents. You will learn how to download and install the Stata software, understand data and basic manipulations, import and subset data, explore and visualize data, and understand the basics of automation in the form of loops and functions. After completion of this workshop you will have a foundational understanding to create, organize, and utilize workflows for your personal research.

CANCELED: R Geospatial Fundamentals: Raster Data

November 29, 2022, 1:00pm
Geospatial data are an important component of data visualization and analysis in the social sciences, humanities, and elsewhere. The R programming language is a great platform for exploring these data and integrating them into your research. This workshop focuses on fundamental operations for reading, writing, manipulating and mapping raster data, which typically represents geographic information in a grid of regular sized cells.

R Census Data Wrangling and Mapping

April 1, 2022, 10:00am
Since 1790, the US Census has been THE source of data about American people, providing valuable insights to social scientists and humanists. Mapping these data by census geographies adds more value by allowing researchers to explore spatial trends and outliers. This workshop will introduce three key packages for streamlining census data workflows in R: tigris, tidycensus and tmap. Participants will learn how to download census tabular data for one or more geographic aggregation units or years, download the associated census geographic data and then join these data for analysis and mapping.

Python Data Visualization

January 25, 2023, 2:00pm
For this workshop, we'll provide an introduction to visualization with Python. We'll cover visualization theory and plotting with Matplotlib and Seaborn, working through examples in a Jupyter notebook.