Log in

Sign up for our mailing list!

This is an archive of our past training offerings. We are looking to include workshops on topics not yet covered here. Is there something not currently on the list? Send us a proposal.

E.g., 25-Apr-19
E.g., 25-Apr-19
April 24, 2019
Author:
Emily Ozer, PhD, Keegan Freiburger, Catherine Duarte, Brian Villa

Unfortunately, due to a family loss, Prof. Ozer is unable to be present at this event. Catherine Duarte and Brian Villa, her research team members and planned co-presenters, will incorporate a previously recorded lecture by Prof. Ozer and otherwise deliver the presentation as planned. We look forward to your attendance at this event. To Prof.

April 23, 2019
Author:
Caroline Le Pennec

In this workshop we will cover the most common CTA task: supervised classification. Using the Python library scikit-learn, we will implement Logistic Regression and Random Forest methods to perform sentiment analysis. Optional: introduction to word vector representations with Word2Vec.

April 23, 2019
Author:
Chris Kennedy, Evan Muzzall

This workshop introduces the basic concepts of Deep Learning - the training and performance evaluation of large neural networks, especially for image classification, natural language processing, and time-series data.

April 22, 2019
Author:
Lynn Cunningham

In this hands-on workshop, we will learn how to create web graphics for your digital publishing projects and websites. We will cover topics such as: image editing tools in Photoshop; image resolution for the web; sources for free public domain and Creative Commons images; and image upload to publishing tools such as WordPress. If possible, please bring a laptop with Photoshop installed.

April 22, 2019
Author:
Dr. Michael Samuel

This workshop will cover theory and techniques for maximizing the effectiveness of figures used for visualizing information. Rather than teaching any particular visualization software, this course will teach students about the "nuts and bolts" of effective data visualization.  

April 22, 2019
Author:
Josué Meléndez Rodríguez

Qualitative Data Analysis (QDA) software is used to organize and structure data, codes, memos, and other components of a qualitative study.

This workshop is a two-part series for qualitative researchers,  new and established, interested in learning about MAXQDA, a QDA software for which D-Lab provides substantive support.

April 18, 2019
Author:
Aaron Culich

Research on Data Science Career Paths

April 18, 2019
Author:
Chris Kennedy, Evan Muzzall

This is a six-hour tutorial on machine learning in R that covers data preprocessing, cross-validation, ordinary least squares regression, lasso, decision trees, random forest, xgboost, and superlearner algorithms.

April 17, 2019
Author:
Samy Abdel-Ghaffar

This workshop introduces students to scikit-learn, the popular machine learning library in Python, as well as the auto-ML library built on top of scikit-learn, TPOT. The focus will be on scikit-learn syntax and available tools to apply machine learning algorithms to datasets.

April 16, 2019
Author:
Aaron Culich

Git is a powerful tool for keeping track of changes you make to the files in a project. You can use it to synchronize your work across computers, collaborate with others, and even deploy applications to the cloud. In this workshop, we'll learn the basics of understanding and using Git, including working with the popular "social coding" website, GitHub.

April 16, 2019
Author:
Caroline Le Pennec

This hands on workshop builds on part 1 by introducing the basics of Python's scikit-learn package to implement unsupervised text analysis methods. This workshop will cover a) vectorization and Document Term Matrices, b) weighting (tf-idf), and c) uncovering patterns using topic modeling.

April 16, 2019
Author:
Kate Beck

Crowdsourcing is a method increasingly used in qualitative, quantitative, and mixed-methods research. However, many researchers remain unclear about what this method is, when it may be appropriate to use, and how it could be implemented.

April 16, 2019
Author:
Evan Muzzall, Chris Kennedy

This is a six-hour tutorial on machine learning in R that covers data preprocessing, cross-validation, ordinary least squares regression, lasso, decision trees, random forest, xgboost, and superlearner algorithms.

April 12, 2019
Author:
Adam Bouyamourn

This workshop will provide a comprehensive overview of graphics in R, including base graphics and ggplot2. Participants will learn how to construct, customize, and export a variety of plot types in order to visualize relationships in data.

April 11, 2019
Author:
Evan Muzzall

R Fundamentals Part 4: Putting it all together

In the final part, we will review data importation, subsetting, and visualization. Students will then be given the majority of time to reproduce a workflow on two different datasets, ask questions, and review the solutions as a group.

April 10, 2019
Author:
Nancy Thomas

This workshop introduces the basic principles of understanding digital imagery, including the fundamentals of multi-spectral imagery. Participants will learn how to find and download satellite and aerial imagery, how to display and enhance digital imagery, and basic techniques for image interpretation and analysis.

April 10, 2019
Author:
Samy Abdel-Ghaffar

For this workshop, we'll provide an introduction to visualization with Python. We'll cover visualization theory and plotting with Matplotlib and Seaborn, working through examples in a Jupyter (formerly IPython) notebook. The following plot types will be covered:

  • line
  • bar
  • scatter
  • boxplot

We'll also learn about styles and customizing plots.

April 9, 2019
Author:
Pelagie Elimbi Moudio

Pandas is a Python package providing fast, flexible, and expressive data structures designed to make working with 'relational' or 'labeled' data both easy and intuitive. It enables doing practical, real world data analysis in Python.

In this workshop, we'll work with example data and go through the various steps you might need to prepare data for analysis.

We plan to cover:

April 9, 2019
Author:
Caroline Le Pennec

This hands on workshop goes through the common “preprocessing recipe” that is used as the foundation for a variety of other applications as well as some basic natural language processing techniques.  These include: a) digitization (utf 8), b) removal of stopwords, numbers, punctuation, c) tokenization, d) calculation of word frequencies / proportions, e) part of speech tagging, and f) concordan

April 9, 2019
Author:
Evan Muzzall

R Fundamentals Part 3: Data Exploration and Analysis

Students will be introduced to data exploration and analysis in R. You will learn how to summarize data and explore it with histograms, scatterplots, and boxplots. You will also be introduced to coding statistical data analysis via t-tests, analyses of variance, correlation, and linear regression.

Pages