Python

Twitter data extraction with Selenium

March 1, 2022

Introduction

With online communities and social networks serving as important sites for computational social science research, Twitter has quickly become a popular data source for researchers (Frey et al. (2020), Kusen et al. (2017), Rao et al. (2010) and Ru et al. (2021)). This blog post will demonstrate one way to extract twitter data without using the Twitter API. This is especially useful for researchers who are new to exploring the use of Twitter data in their research, looking to develop a baseline corpus for a research question they are newly...

Getting Started with the NYT API

March 1, 2022

Introduction

The web is chock full of valuable troves of data that can spawn an infinite number of social science research projects. However, not all data is easily accessible! While some data can be easily downloaded, access to some sources of data are dictated by what is known as an API. Standing for application programming interface, APIs are a set of defined protocols governing the terms of access to software and servers from programs created...

Eileen Cahill

D-Lab Alumni
School of Information

Eileen is currently a first year Information Management and Systems student committed to studying human-centered design for the utility and usability of healthcare systems. She spent the last few years working in genomic research program analysis and management at the National Human Genome Research Institute. Prior to that, Eileen attended Georgetown University where she studied biology and studio art. During this time, she performed research on water contaminants in an analytical chemistry lab as well as research on estrogen mimicking compound effects on Zebrafish in a brain...

William Rathje

D-Lab Alumni
Sociology

I'm a second-year sociology PhD student interested in data science, critical theory, and culture. I work as a data science fellow, with technical interests in networks, natural language, machine learning, statistics, and social media analysis. Outside work, I enjoy reading, writing, coffee, and running!

Ian Castro

D-Lab Alumni
School of Information

Ian is a graduate student in the Master of Information Management and Systems program at the School of Information with a focus in applied data science. He earned his B.A. in Media Studies and B.S. in Microbial Biology from UC Berkeley, and his research interests and work experience are in STEM education. He focuses in building courses and academic programs to make data and computing accessible to historically marginalized students and those without prior exposure to the field.

PoliPy: A Python Library for Scraping and Analyzing Privacy Policies

February 8, 2022

In light of recent scandals involving the misuse and improper handling of personal data by large corporations, advocacy groups and regulators alike have given increased attention to the issue of consumer privacy [e.g., 1, 2, 3, 4, 5]. National and local governments have been enacting privacy legislation that requires companies to minimize the amount of data they collect, deters the collection of sensitive data, limits the purposes for which the data are used, and critically, gives users more transparency into data collection and use.

As part...

Portia Awuah

Instructor, Consultant
Energy and Resources Group

I am pursuing an MS. Energy and Resources with a focus on offgrid energy. My aim is to extend sustainable electricity supply to remote communities.

Lia Chin-Purcell

Consultant
School of Information

Hello! I am a first-year Masters's student at the School of Information in the MIMS program with a focus is in data science and ethics. Before joining Berkeley, I studied computer science at the University of Puget Sound.

Emily Kaner

Consultant
Public Health / City Planning

Emily Kaner is a 3rd year MPH/MCP student focused on the structural determinants of health, intersections between health and place, and the use of mixed methods in research. Her research explores contexts and meanings of substance use among different communities and she loves thinking through the challenges of mixed methods and qualitative research.

Tiffany Taylor

PhD Student
Anthropology

Tiffany Taylor is a doctoral student at the University of California, Berkeley. Previously, she received a Master of Public Health in Epidemiology from Columbia University's Mailman School of Public Health. She graduated from the University of Chicago with majors in Political Science, Sociology, and Comparative Race and Ethnic Studies (Asian American Studies). Some of her research interests include social medicine, educational sociology, and social demography. Additional interests include pilates, yoga, and fashion.