Python

Twitter Text Analysis: A Friendly Introduction

October 25, 2022
Introduction

Text analysis techniques, including sentiment analysis, topic modeling, and named entity recognition, have been increasingly used to probe patterns in a variety of text-based documents, such as books, social media posts, and others. This blog post introduces Twitter text analysis, but is not intended to cover all of the aforementioned topics. The tutorial is broken down into two parts. In this very first post, I will give a step-by-step guide of how to use Python and Pandas to explore Twitter data In the second post, I will introduce Language...

Python Web Scraping & APIs

November 2, 2022, 3:00pm
In this workshop, we cover how to extract data from the web using Python. We focus on two approaches to extracting data from the web: leveraging application programming interfaces (APIs) and web scraping.

Python Visualization

October 26, 2022, 3:00pm
For this workshop, we'll provide an introduction to visualization with Python. We'll cover visualization theory and plotting with Matplotlib and Seaborn, working through examples in a Jupyter notebook.

Python Data Wrangling and Manipulation with Pandas

October 24, 2022, 3:00pm
Pandas is a Python package that provides fast, flexible, and expressive data structures designed to make working with 'relational' or 'labeled' data both easy and intuitive. It enables doing practical, real world data analysis in Python. In this workshop, we'll work with example data and go through the various steps you might need to prepare data for analysis.

Reduce, Reuse, Recycle: Practical strategies for working with large datasets

October 12, 2022

When the size of your datasets start to approach the size of your computer’s available memory, even the simplest data wrangling tasks can become frustrating. Suddenly, reading in a .csv or calculating a simple average becomes time-consuming or impossible. As students or researchers, accessing additional computing resources can be costly or is not always an available option. Here are some principles and strategies for reducing the overhead of your dataset while keeping the momentum going. The code mainly focuses on reading csv files - a very common data format - into Python...

Python Fundamentals: Parts 1-4

October 11, 2022, 3:00pm
This four-part, interactive workshop series is your complete introduction to programming Python for people with little or no previous programming experience. By the end of the series, you will be able to apply your knowledge of basic principles of programming and data manipulation to a real-world social science application.

Python Machine Learning Fundamentals: Parts 1-2

October 4, 2022, 2:00pm
This workshop introduces students to scikit-learn, the popular machine learning library in Python, as well as the auto-ML library built on top of scikit-learn, TPOT. The focus will be on scikit-learn syntax and available tools to apply machine learning algorithms to datasets. No theory instruction will be provided.

Python Data Wrangling and Manipulation with Pandas

September 28, 2022, 3:00pm
Pandas is a Python package that provides fast, flexible, and expressive data structures designed to make working with 'relational' or 'labeled' data both easy and intuitive. It enables doing practical, real world data analysis in Python. In this workshop, we'll work with example data and go through the various steps you might need to prepare data for analysis.

Python Visualization

October 5, 2022, 3:00pm
For this workshop, we'll provide an introduction to visualization with Python. We'll cover visualization theory and plotting with Matplotlib and Seaborn, working through examples in a Jupyter notebook.

Tom van Nuenen, Ph.D.

Consulting Drop-In Hours: By appointment only

Consulting Areas: Python, R, SQL, LaTeX, HTML / CSS, Javascript, Julia,Data Manipulation and Cleaning, Data Science, Data Visualization, Digital Humanities, Machine Learning, Mixed Methods, Natural Language Processing, Python Programming, Qualitative methods, R Programming, Surveys, Sampling & Interviews, Text Analysis, Web Scraping,Regression Analysis,Bash or Command Line, Excel, Gephi, Git or Github, NVivo, Qualtrics, RStudioFairness, Perceptions of AI, Hermeneutics

Quick-tip:...