Data Science

Python Fundamentals: Parts 1-4

May 2, 2022, 9:00am
This four-part, interactive workshop series is your complete introduction to programming Python for people with little or no previous programming experience. By the end of the series, you will be able to apply your knowledge of basic principles of programming and data manipulation to a real-world social science application.

dbplyr: do we still need to learn SQL to create and manage databases?

April 11, 2022

How to deal with datasets that are larger than our computer’s memory? Do we still need to learn Structured Query Language (SQL) to create and manage a database?

As an incipient data analyst, one of my first major challenges was to build and manage a spatial database using PostGIS, an open-source software that adds a geographic to PostgreSQL relational databases. I was given several text files in a hard drive that weighed approximately 10 GB each! My first reaction was to double click on the first text file that I saw… but this was clearly...

Python Text Analysis: Topic Modeling

April 13, 2022, 3:00pm
In this part, we study unsupervised learning of text data. This is a stand alone work that builds from the two-part text analysis series.

Python Data Wrangling and Manipulation with Pandas

April 18, 2022, 10:00am
Pandas is a Python package that provides fast, flexible, and expressive data structures designed to make working with 'relational' or 'labeled' data both easy and intuitive. It enables doing practical, real world data analysis in Python. In this workshop, we'll work with example data and go through the various steps you might need to prepare data for analysis.

Qualtrics Fundamentals

April 21, 2022, 2:00pm
Qualtrics is a powerful online tool available to Berkeley community members that can be used for a range of data collection activities. Primarily, Qualtrics is designed to make web surveys easy to write, test, and implement, but the software can be used for data entry, training, quality control, evaluation, market research, pre/post-event feedback, and other uses with some creativity.

Python Fundamentals: Parts 1-4

April 5, 2022, 9:00am
This four-part, interactive workshop series is your complete introduction to programming Python for people with little or no previous programming experience. By the end of the series, you will be able to apply your knowledge of basic principles of programming and data manipulation to a real-world social science application.

R Fundamentals: Parts 1-4

April 4, 2022, 9:00am
This workshop is a four-part introductory series that will teach you R from scratch with clear introductions, concise examples, and support documents. You will learn how to download and install the open-sourced R Studio software, understand data and basic manipulations, import and subset data, explore and visualize data, and understand the basics of automation in the form of loops and functions. After completion of this workshop you will have a foundational understanding to create, organize, and utilize workflows for your personal research.

What can state government do…to attract a data scientist like YOU?

March 29, 2022

What can state government do…to attract a data scientist like YOU?

By Kellie Hogue

What’s your next move? When I was in grad school, one of my professors told me that regardless of the job I am currently in, I should always be planning the next step in my career.

At the time, it made sense–academic appointments in my discipline were few and far between, and I wouldn’t get one without some major strategic networking and planning. Simply a case of too much supply, not enough demand....

Python Fundamentals: Parts 1-2 (4pm-7pm)

March 29, 2022, 4:00pm
Evening workshop 4-7pm. This two-part, interactive workshop series is your complete introduction to programming Python for people with little or no previous programming experience. By the end of the series, you will be able to apply your knowledge of basic principles of programming and data manipulation to a real-world social science application.