Log in

Sign up for our mailing list!

When & Where
Date: 
Wed, February 6, 2019 - 10:00 AM to 12:00 PM
Location: 
Barrows 356: D-Lab Convening Room
Description
Type: 

This hands-on workshop presents a broad overview of the existing methods to use text as data, with a focus on applications in social sciences and humanities. After a brief theoretical discussion, we will go through the common “preprocessing recipe” that is used as the foundation for a variety of other applications as well as some basic natural language processing techniques.  These include: a) tokenization b) removal of stopwords, numbers, punctuation, c) calculation of word frequencies / proportions, d) part of speech tagging.  

Prior knowledge: Basic familiarity with Python is required if you wish to follow along with the tutorial. Completion of D-Lab's Python FUN!damentals workshop series will be sufficient.

This workshop is one of a three-part series that will prepare participants to move forward with text analysis research. Please register for each workshop separately.

  • Text Analysis Fundamentals: Unsupervised Approaches

  • Text Analysis Fundamentals: Supervised Methods


Getting started & software prerequisites:

We will learn how to implement text analysis methods with Jupyter Notebooks.

To run the code on your computer, you will need to have Python 3 installed as well as some additional libraries. Anaconda is a free product that makes the installation process easy. It bundles together the Python language and a whole bunch of additional packages that we often rely on in our workshops. This way, you only have to download and install one thing. To use this method, visit this site and follow the instructions for your operating system to download the Python 3.x version (it might be 3.6, or 3.7, or higher). Please, please, please download the 3.x version, not the Python 2.x version. You may have a choice between using the graphical installer or the command line installer. Use whichever you're comfortable with, but the graphical one is easier.

IMPORTANT: Please download the material  for day 1 using the link below and save the folder on your desktop  . The content may change between workshops so make sure you have downloaded the most recent version before each workshop.

Details
Training Host: 
D-lab Facilitator: 
Evan Muzzall
Format Detail: 
hands-on, interactive
Participant Technology Requirement: 
Laptop
Log in to register for this training.