Python

Python Data Wrangling and Manipulation with Pandas

January 23, 2023, 2:00pm
Pandas is a Python package that provides fast, flexible, and expressive data structures designed to make working with 'relational' or 'labeled' data both easy and intuitive. It enables doing practical, real world data analysis in Python. In this workshop, we'll work with example data and go through the various steps you might need to prepare data for analysis.

Aniket Kesari, Ph.D.

Former D-Lab Postdoc and Senior Data Science Fellow
Berkeley Law

Aniket Kesari was a postdoc and data science fellow at D-Lab. He is currently a research fellow at NYU’s Information Law Institute, and will join the faculty of Fordham Law School in 2023. His research focuses on law and data science, with particular interests in privacy, cybersecurity, and consumer protection.

Featured D-Lab Blog Post: Introducing “A Three-Step Guide to Training Computational Social Science Ph.D. Students for...

Python Fundamentals: Parts 1-4

December 5, 2022, 9:00am
This four-part, interactive workshop series is your complete introduction to programming Python for people with little or no previous programming experience. By the end of the series, you will be able to apply your knowledge of basic principles of programming and data manipulation to a real-world social science application.

Python Geospatial Data and Mapping: Parts 1-2

November 8, 2022, 2:00pm
Geospatial data are an important component of data visualization and analysis in the social sciences, humanities, and elsewhere. The Python programming language is a great platform for exploring these data and integrating them into your research.

CANCELED: Python Visualization

December 1, 2022, 3:00pm
For this workshop, we'll provide an introduction to visualization with Python. We'll cover visualization theory and plotting with Matplotlib and Seaborn, working through examples in a Jupyter notebook.

CANCELED: Python Data Wrangling and Manipulation with Pandas

November 29, 2022, 3:00pm
Pandas is a Python package that provides fast, flexible, and expressive data structures designed to make working with 'relational' or 'labeled' data both easy and intuitive. It enables doing practical, real world data analysis in Python. In this workshop, we'll work with example data and go through the various steps you might need to prepare data for analysis.

CANCELED: Python Text Analysis: Word Embeddings

November 17, 2022, 12:00pm
How can we use neural networks to create meaningful representations of words? The bag-of-words is limited in its ability to characterize text, because it does not utilize word context.
Registration is unavailable.

CANCELED: Python Text Analysis: Topic Modeling

November 15, 2022, 12:00pm
In this part, we study unsupervised learning of text data. This is a stand alone work that builds from the two-part text analysis series.
Registration is unavailable.

CANCELED: Python Machine Learning Fundamentals: Parts 1-2

November 14, 2022, 4:00pm
This workshop introduces students to scikit-learn, the popular machine learning library in Python, as well as the auto-ML library built on top of scikit-learn, TPOT. The focus will be on scikit-learn syntax and available tools to apply machine learning algorithms to datasets. No theory instruction will be provided.
Registration is unavailable.

Python Fundamentals: Parts 1-4

November 7, 2022, 11:00am
This four-part, interactive workshop series is your complete introduction to programming Python for people with little or no previous programming experience. By the end of the series, you will be able to apply your knowledge of basic principles of programming and data manipulation to a real-world social science application.