The D-Lab and the Library partnered to make a set of social science and digital humanities data resources available to the UC Berkeley scholarly community. These data were acquired as part of a joint pilot program called the Data Acquisition and Access Program (DAAP). These data are now managed by the Library.
The DAAP datasets all have restrictions on their access and use. Berkeley users can gain access to these data upon satisfactory completion of a data use agreement specific to the resource.
The DAAP datasets are valuable for a wide range of research including linguistic, social science and data science (e.g., text analysis) applications. Here is a list of these datasets with their associated link to the UC Berkeley Library catalog record. For a full list of these and related resources, read the information and follow the links on the Library Guide to Text Mining and Computational Text Analysis