This workshop will focus on how to create publication-ready graphs using Stata. We will begin with a short demo of how to create standard graphs in Stata, and continue on to more high-quality formats for greater legibility and a more polished display.
This is an archive of our past training offerings. We are looking to include workshops on topics not yet covered here. Is there something not currently on the list? Send us a proposal.
This workshop will cover the basic concepts involved in georeferencing/georectifiying a digital image and provide hands-on practice using a web-based interface. Georeferencing involves “spatializing” scanned maps or aerial imagery so they can be used in a geographic information system (GIS).
Ordination techniques are valuable to the social sciences. They offer useful ways to explore and distill variation that is distributed across many variables into reduced dimensional space. They are useful for exploring relationships within large multivariate datasets as site and species relationships can be plotted in two and three dimensions.
Please note: This workshop was originally scheduled for Tuesday, 3/21 at 10:00am-12:00pm. It has been rescheduled for Tuesday, 4/4 at 10:00am-12:00pm.
This three-part series will focus on how to set up database-like structures, navigate them, create models and build various types of reports in Microsoft Excel. By the end of this series, participants will be able to sort and look for information within large datasets, use character-based functions, pivot tables, and build basic financial models.
This workshop will cover OpenRefine, a very useful open source, point-and-click interface for cleaning and standardizing data. Examples of OpenRefine's capabilities include: correcting variations in spelling, standardizing date formats, and detecting duplicate values.
Prior knowledge: No prior knowledge is required, though a familiarity with spreadsheets is helpful.
This workshop covers how to organize and analyze qualitative data in Dedoose. The training will outline the key decisions the researcher must make in the coding process, as well as teach attendees how to start a new Dedoose project, add codes and set up a code system, code data, add characteristics about data, run analysis tools such as queries, and explore data for further analysis.
In this workshop we will cover two main supervised text analysis methods, the dictionary method, and supervised classification. We will use list comprehension to implement the dictionary method, using sentiment analysis as our example.
This introductory workshop is geared towards social scientists and digital humanitists who have data - spreadsheets, scanned maps, digital photos, etc - that they want to display on a map. We explore these tasks in CARTO, a versatile web mapping and geospatial analysis tool.
Data are the foundations of the social and biological sciences. Familiarizing yourself with a programming language can help you better understand the roles that data play in your field. Learn to develop and train your data skills at the free D-Lab R workshops!
This four-part, interactive workshop series is your complete introduction to programming Python for people with little or no previous programming experience. By the end of the series, you will be able to apply your knowledge of basic principles of programming and data manipulation to a real-world social science application.
This hands on workshop builds on part 2 by introducing the basics of Python's scikit-learn package to implement unsupervised text analysis methods. This workshop will cover a) vectorization and Document Term Matrices, b) weighting (tf-idf), and c) uncovering patterns using topic modeling.
**Please note: This workshop will be held at the Goldman School of Public Policy at 1893 Le Roy Avenue, Berkeley.**
This workshop will cover how to access databases via RESTful APIs using the Python programming language. Our main example will involve collecting data on news articles through the New York Times Article API.
This workshop will demonstrate the use of the ggplot2 and ggmaps packages for working with and visualizing geospatial data. Participants will learn how to plot points, line and polygon features on a map, customize symbology, and add reference data like Google Maps and OpenStreetMaps. Geocoding place names and addresses with ggmaps will also be introduced.
This workshop will provide a comprehensive overview of graphics in R, including base graphics and ggplot2. Participants will learn how to construct, customize, and export a variety of plot types in order to visualize relationships in data.
Join us as Andrew Garrett discusses his research on linguistic and ethnographic sound recordings from early 20th century California.
This three-part series will cover the following materials:
Part 1: Introduction (Wednesday, March 1)
This hands on workshop goes through the common “preprocessing recipe” that is used as the foundation for a variety of other applications as well as some basic natural language processing techniques. These include: a) digitization (utf 8), b) removal of stopwords, numbers, punctuation, c) tokenization, d) calculation of word frequencies / proportions, e) part of speech tagging, and f) concordan