Web Scraping

Bo Yun Park, Ph.D.

Postdoc
D-Lab

I am a Postdoctoral Scholar in the D-Lab at the University of California, Berkeley. My research lies at the intersection of political, cultural, and transnational sociology. I am particularly interested in dynamics of social inclusion and exclusion, social change, technology, and digital politics. My dissertation investigated how political strategists in France and the United States craft narratives of political leadership for presidential candidates in the digital age. I received my Ph.D. in Sociology at Harvard University, where I was affiliated with the Institute for Quantitative Social...

Avery Richards

Senior Data Science Fellow
School of Public Health

Avery is an MPH graduate at the School of Public Health. With a background in literature and behavioral health, his current research focuses on innovations in applied epidemiology, including multidisciplinary approaches to health and social science data. Avery's general interests include public health surveillance, data quality assurance, and geospatial analysis.

Aaron Culich

Consulting Drop-In Hours: By appointment only

Consulting Areas: Python, R, SQL, APIs, Cloud & HPC Computing, Databases & SQL, Bash or Command Line, Git or Github

Quick-tip: the fastest way to speak to a consultant is to first submit a request and then ...

Abhishek Roy

IUSE Undergraduate Advisory Board
Economics
Data Science

I'm Abhishek Roy and I'm double majoring in Economics and Data Science. I've been a part of D-Lab's IUSE project since Spring 2020 and have truly found an organization that is not only passionate about Data Science but also strives to expand its reach equitably to all communities. I am involved in Research and Project Management roles in various departments and labs at Berkeley and I'm an Editor at the Berkeley Economic Review. I love diving into anything at the intersection of Data Science, Economics, Business, and Computational Social Science. Whenever I'm free, I love writing...

Frances Leung

Data Science Fellow
School of Information

Frances Leung is a master’s student at UC Berkeley School of Information where she focuses her studies in information and data science. She has a keen interest in leveraging data-driven insights to better understand consumer behaviors and the world around us. In her professional work as a management consultant, she advises retailers and consumer businesses on digital transformation and creating web/mobile experiences that delight consumers through a human-centered approach. Frances holds a Master in Business Administration from York University, Schulich School...

Aaron Culich

Deputy Director of D-Lab; Cyberinfrastructure Architect and Consulting Lead

Aaron Culich is a staff member at the D-Lab with expertise in Cloud Computing, High Performance Computing (HPC), Databases (SQL and NoSQL), JupyterHub and BinderHub infrastructure, and a variety of programming languages (Python, R, Java, C, C++, and more). His ongoing mission is to explore new compute possibilities, discovering useful tools and practices, and making them more accessible to researchers on campus and beyond.

Aniket Kesari, Ph.D.

Research Fellow
Berkeley Law

Aniket is a postdoctoral scholar at the D-Lab. He earned his Ph.D. from Berkeley Law, where he specialized in Law & Economics. He also holds a BA from Rutgers University – New Brunswick in Political Science and History and is a JD candidate at Yale University. His research focuses on privacy and cybersecurity law, and he is generally interested in using data science to tackle public policy problems. During his graduate career, he was a Google Public Policy Fellow, a Data Science for Social Good (DSSG) Fellow at the University of Chicago, and a Technology Policy Analyst Intern at...

PoliPy: A Python Library for Scraping and Analyzing Privacy Policies

February 8, 2022

In light of recent scandals involving the misuse and improper handling of personal data by large corporations, advocacy groups and regulators alike have given increased attention to the issue of consumer privacy [e.g., 1, 2, 3, 4, 5]. National and local governments have been enacting privacy legislation that requires companies to minimize the amount of data they collect, deters the collection of sensitive data, limits the purposes for which the data are used, and critically, gives users more transparency into data collection and use.

As part...

Lia Chin-Purcell

Consultant
School of Information

Hello! I am a first-year Masters's student at the School of Information in the MIMS program with a focus is in data science and ethics. Before joining Berkeley, I studied computer science at the University of Puget Sound.

Resisting our Data Doppelgangers: A Proposal for Unpacking the Dangers of Data-Driven Fertility Advertising With Data Science Tools

December 7, 2021

Introduction

When Janet Vertasi, a sociology professor of technology at Princeton, learned of her pregnancy, she decided to conduct a personal experiment. She hid her pregnancy from the internet for nine months. This meant only sharing her pregnancy with close friends and family, using her own personal server while making purchases on Amazon and even opting to use cash For many of her transactions. During this time Amazon mistook her as a “suspicious customer” (Vertasi 2014, Gray 2014). Recall another incident of how Target found out about a...