Data Manipulation and Cleaning

Python Data Wrangling and Manipulation with Pandas

September 27, 2024, 9:00am
Pandas is a Python package that provides fast, flexible, and expressive data structures designed to make working with 'relational' or 'labeled' data both easy and intuitive. It enables doing practical, real world data analysis in Python. In this workshop, we'll work with example data and go through the various steps you might need to prepare data for analysis.

R Data Wrangling and Manipulation: Parts 1-2

October 1, 2024, 1:00pm
It is said that 80% of data analysis is spent on the process of cleaning and preparing the data for exploration, visualization, and analysis. This R workshop will introduce the dplyr and tidyr packages to make data wrangling and manipulation easier. Participants will learn how to use these packages to subset and reshape data sets, do calculations across groups of data, clean data, and other useful tasks.

Sylvia Song

Availability: By appointment only

Consulting Areas: Python, R, LaTeX, Data Manipulation and Cleaning, Data Science, Data Visualization, Machine Learning, Regression Analysis, Excel, RStudio, RStudio Cloud

Theo Snow

Availability: By appointment only

Consulting Areas: Python, R, SQL, SAS, Databases & SQL, Data Manipulation and Cleaning, Data Science, Data Visualization, Geospatial Data, Maps & Spatial Analysis, Machine Learning, Mixed Methods, Qualitative methods, Surveys, Sampling & Interviews, Regression Analysis, Means Tests, Software Output Interpretation, Other, Excel, Git or Github, RStudio, RStudio Cloud, SAS, Tableau

Stephanie Andrews

Availability: By appointment only

Consulting Areas: Python, SQL, HTML / CSS, Javascript, APIs, Databases & SQL, Data Manipulation and Cleaning, Data Science, Data Sources, Data Visualization, Digital Humanities, Machine Learning, Natural Language Processing, Software Tools, Text Analysis, Web Scraping, Bash or Command Line, Excel, Git or Github, Tableau

Sakina Dhorajiwala

Availability: By appointment only

Consulting Areas: Python, R, Stata, LaTeX, Data Manipulation and Cleaning, Data Visualization, Mixed Methods, Qualitative Methods, Surveys, Sampling & Interviews, Regression Analysis, Excel, Git or Github, RStudio

Manish Kumar

Availability: By appointment only

Consulting Areas: Python, R, Javascript, C, C++, APIs, Databases & SQL, Data Manipulation and Cleaning, Digital Humanities, Software Tools, Git or Github, MATLAB, RStudio

Emma Lasky

Availability: By appointment only

Consulting Areas: Python Programming, R Programming, Data Manipulation and Cleaning, Data Science, Data Sources, Data Visualization, Geospatial Data, Maps & Spatial Analysis, Mixed Methods, Regression Analysis, ArcGIS Desktop, Online or Pro, Excel, Git or Github, QGIS, RStudio, RStudio Cloud

Iñigo Parra

Availability: By appointment only

Consulting Areas: Python, R, LaTeX, Data Manipulation and Cleaning, Data Science, Data Visualization, Deep Learning, Digital Humanities, Machine Learning, Natural Language Processing, Social Network Analysis, Regression Analysis, Means Tests, Bash or Command Line, Excel, Gephi, Git or Github, Qualtrics, RStudio, Overleaf

Aaron Culich

Schedule an Appointment

Consulting Areas: Python, R, SQL, AI & LLMs, APIs, Cloud & HPC Computing, Informatics, Data Wrangling, Databases & SQL, Bash or Command Line, Git or Github, Web Scraping