Log in

Sign up for our mailing list!

When & Where
Fri, April 22, 2016 - 12:00 PM to 2:00 PM
D-Lab: Convening Room (356 Barrows Hall)

This workshop addresses various topics in Natural Language Processing, primarily through the use of NLTK. We first scrape and clean a long text; create a corpus in NLTK; explore tagged corpora; build basic machine learning POS taggers; finally, we construct a grammar for chunking and tree building. This workshop concentrates of linguistic issues in NLP and text analysis.

Prerequisites: Attendees should either already have a thorough knowledge of Python, or have attended the Python for Everything series. Some familiarity with NLTK would be helpful.

Please install the following packages ahead of the workshop: 

Python 3 (https://www.continuum.io/downloads)


  • NLTK ( $ pip install nltk) 
  • Brown corpus from NLTK ( >>> nltk.download() ) 
  • BeautifulSoup ( $ pip install beautifulsoup4)
Training Host: 
D-lab Facilitator: 
Zawadi Rucks-Ahidiana
Format Detail: 
Hands-on, Follow-along, Interactive tutorial