Python

William Rathje

D-Lab Alumni
Sociology

I'm a second-year sociology PhD student interested in data science, critical theory, and culture. I work as a data science fellow, with technical interests in networks, natural language, machine learning, statistics, and social media analysis. Outside work, I enjoy reading, writing, coffee, and running!

Ian Castro

D-Lab Alumni
School of Information

Ian is a graduate student in the Master of Information Management and Systems program at the School of Information with a focus in applied data science. He earned his B.A. in Media Studies and B.S. in Microbial Biology from UC Berkeley, and his research interests and work experience are in STEM education. He focuses in building courses and academic programs to make data and computing accessible to historically marginalized students and those without prior exposure to the field.

Eileen Cahill

D-Lab Alumni
School of Information

Eileen is currently a first year Information Management and Systems student committed to studying human-centered design for the utility and usability of healthcare systems. She spent the last few years working in genomic research program analysis and management at the National Human Genome Research Institute. Prior to that, Eileen attended Georgetown University where she studied biology and studio art. During this time, she performed research on water contaminants in an analytical chemistry lab as well as research on estrogen mimicking compound effects on Zebrafish in a brain...

PoliPy: A Python Library for Scraping and Analyzing Privacy Policies

February 8, 2022

In light of recent scandals involving the misuse and improper handling of personal data by large corporations, advocacy groups and regulators alike have given increased attention to the issue of consumer privacy [e.g., 1, 2, 3, 4, 5]. National and local governments have been enacting privacy legislation that requires companies to minimize the amount of data they collect, deters the collection of sensitive data, limits the purposes for which the data are used, and critically, gives users more transparency into data collection and use.

As part...

Portia Awuah

Instructor, Consultant
Energy and Resources Group

I am pursuing an MS. Energy and Resources with a focus on offgrid energy. My aim is to extend sustainable electricity supply to remote communities.

Lia Chin-Purcell

Consultant
School of Information

Hello! I am a first-year Masters's student at the School of Information in the MIMS program with a focus is in data science and ethics. Before joining Berkeley, I studied computer science at the University of Puget Sound.

Emily Kaner

Consultant
Public Health / City Planning

Emily Kaner is a 3rd year MPH/MCP student focused on the structural determinants of health, intersections between health and place, and the use of mixed methods in research. Her research explores contexts and meanings of substance use among different communities and she loves thinking through the challenges of mixed methods and qualitative research.

Tiffany Taylor

PhD Student
Anthropology

Tiffany Taylor is a doctoral student at the University of California, Berkeley. Previously, she received a Master of Public Health in Epidemiology from Columbia University's Mailman School of Public Health. She graduated from the University of Chicago with majors in Political Science, Sociology, and Comparative Race and Ethnic Studies (Asian American Studies). Some of her research interests include social medicine, educational sociology, and social demography. Additional interests include pilates, yoga, and fashion.

Working with State-of-the-Art NLP Models: A Friendly Introduction to Hugging Face

December 13, 2021

We often read about the many new advancements being made in the field of Natural Language Processing (NLP). Each month, leading organizations release new models that seem like magic to us, such as models that can write it’s own code based on user prompts [1] or are able to help answer our queries when we use Google Search [2]. Large AI research groups like OpenAI and Google spend many years and pour millions of...

Rural vs. Urban: Using Python to Explore Legislative Data

November 8, 2021

Before COVID-19, becoming a data scientist was never on my radar. As a policy analyst for the California Research Bureau, a legislative research and reference section of the California State Library, I’ve worked on a variety of projects and requests. For the last 8 years, my work has focused on producing timely, confidential ...