Data Manipulation and Cleaning

Python Web Scraping

March 5, 2025, 10:00am

In this workshop, we cover how to scrape data from the web using Python. Web scraping involves downloading a webpage's source code and sifting through the material to extract desired data.

Read more about Python Web Scraping

R Data Wrangling and Manipulation: Parts 1-2

April 22, 2025, 4:00pm

It is said that 80% of data analysis is spent on the process of cleaning and preparing the data for exploration, visualization, and analysis. This R workshop will introduce the dplyr and tidyr packages to make data wrangling and manipulation easier. Participants will learn how to use these packages to subset and reshape data sets, do calculations across groups of data, clean data, and other useful tasks.

Read more about R Data Wrangling and Manipulation: Parts 1-2

R Machine Learning with tidymodels: Parts 1-2

February 24, 2025, 3:00pm

Machine learning often evokes images of Skynet, self-driving cars, and computerized homes. However, these ideas are less science fiction as they are tangible phenomena that are predicated on description, classification, prediction, and pattern recognition in data. During this two part workshop, we will discuss basic features of supervised machine learning algorithms including k-nearest neighbor, linear regression, decision tree, random forest, boosting, and ensembling using the tidymodels framework. To social scientists, such methods might be critical for investigating evolutionary relationships, global health patterns, voter turnout in local elections, or individual psychological diagnoses.

Read more about R Machine Learning with tidymodels: Parts 1-2

Python Web APIs

March 3, 2025, 10:00am

In this workshop, we cover how to extract data from the web with APIs using Python. APIs are often official services offered by companies and other entities, which allow you to directly query their servers in order to retrieve their data. Platforms like The New York Times, Twitter and Reddit offer APIs to retrieve data.

Read more about Python Web APIs

R Data Wrangling and Manipulation: Parts 1-2

April 7, 2025, 2:00pm

Read more about R Data Wrangling and Manipulation: Parts 1-2

Python Data Wrangling and Manipulation with Pandas: Parts 1-2

February 10, 2025, 2:00pm

Pandas is a Python package that provides fast, flexible, and expressive data structures designed to make working with 'relational' or 'labeled' data both easy and intuitive. It enables doing practical, real world data analysis in Python. In this workshop, we'll work with example data and go through the various steps you might need to prepare data for analysis.

Read more about Python Data Wrangling and Manipulation with Pandas: Parts 1-2

R Data Wrangling and Manipulation: Parts 1-2

February 10, 2025, 10:00am

Read more about R Data Wrangling and Manipulation: Parts 1-2

R Data Wrangling and Manipulation: Parts 1-2

November 18, 2024, 2:00pm

Read more about R Data Wrangling and Manipulation: Parts 1-2

Python Web Scraping

October 24, 2024, 2:00pm

In this workshop, we cover how to scrape data from the web using Python. Web scraping involves downloading a webpage's source code and sifting through the material to extract desired data.

Read more about Python Web Scraping

Python Web APIs

October 22, 2024, 2:00pm

Read more about Python Web APIs

« first View: Taxonomy term
‹ previous View: Taxonomy term
1 of 14 View: Taxonomy term
2 of 14 View: Taxonomy term
3 of 14 View: Taxonomy term (Current page)
4 of 14 View: Taxonomy term
5 of 14 View: Taxonomy term
6 of 14 View: Taxonomy term
7 of 14 View: Taxonomy term
8 of 14 View: Taxonomy term
9 of 14 View: Taxonomy term
…
next › View: Taxonomy term
last » View: Taxonomy term