Diversity in Data

Diversity in Data topic

The Geography of Cannabis: Does California’s dual licensing program (de)criminalize cannabis and drive unnecessary anthropogenic activity in remote rural environments?

August 29, 2023
by Chevon Holmes. When California voters (de)criminalized cannabis production, the state’s dual licensure requirement forced local jurisdictions to create permitting programs or uphold prohibition. Many Counties developed ersatz zoning ordinances to regulate cannabis activities and hired staff to administer local permits. As an inspector, administrator, and project planner for Mendocino County from 2017-2021, I visited hundreds of cultivation sites and production facilities where I learned first-hand how two legal pathways impacted the ways in which operators could transition their businesses. This post details a dataset created to track, aggregate, and analyze the relationship between cannabis infrastructure and licensing.

Unlock the Joy and Power of Reading in Language Learning

August 21, 2023
by Bowen Wang-Kildegaard. I share my story of how reading for pleasure transformed my English speaking and writing skills. This experience inspired my passion to promote the joy and power of reading to all language learners. Using natural language processing techniques, I dive into the Language Learning subreddit, revealing a trend: Learners are often highly anxious about output practices, but are generally positive about input methods like reading and listening. I then distill complex language learning theories into actionable language learning tips, emphasizing the value of extensive reading for pleasure, pointing to potential methods like using ChatGPT for customization of reading materials, and advocating for joy in the learning journey.

Artificial Intelligence (AI) Systems, the Poor, and Consent: A Feminist Anti-Colonial Lens to Digitalized Surveillance

September 18, 2023
By Alejandro Nuñez. Today’s digital age has created a sea of endless datafication where our everyday interactions, actions, and conversations are turned into data. The advancements of automated artificial intelligence (AI) systems, and their infrastructure in which they are created and trained on, have catapulted us into an era of consistent monitoring and surveillance.

Disaggregating Race and Ethnicity Categories in Census Data

November 1, 2022

The collection of race and ethnicity data by the United States Census Bureau has a long, complex, and problematic history. The Census claims that their racial categories generally reflect a social definition of race recognized in America, adhering to guidelines set by the U.S. Office of Management and Budget. In 1900, the Census recognized five racial categories: White, Black, Chinese, Japanese, and American Indian. Today, the Census collects more...

Spooky Microbiomes and the Curse of Dimensionality

October 25, 2021

Microbiomes are all the rage today and this trendiness is clear with the Human Microbiome Market predicted to be valued at more than 1 billion dollars by the year 2027 (up from 376 million in 2019).

With studies showing that our microbial community is associated with health outcomes, from regulating our brain chemistry and behaviors...

Working with Color Data: An Introduction to Colorspaces

October 19, 2021

For many of us, the colors a computer screen makes are of secondary importance: we would much rather have the screen adjust for the conditions — a bit less blue in the evening, a bit more brightness when it’s sunny — and have a crisp resolution. If the color is about right…it's good enough. For many professionals though, from photographers, to film producers, to Data Scientists, color can become an exacting and important field of study.

For a little backstory on color in general it is important to define why it’s so complex and how creating a...

What to do about Fairness in Machine Learning?

April 7, 2020

How many thousands of machine learning applications have been developed and gone to market in recent years? Feeding vast amounts of data into software to make decisions for us is a social paradigm the 21st century is embracing to the fullest.

I’m a graduate student of public health, but have a long history as a social worker, student of psychology, literature and the human condition. Since early childhood, one thing I have always been is a science fiction fanatic: human, and societal relationships with technology have fascinated me to the core since before I can remember.

...

Silence is Violence

June 4, 2020

Dear fellow scholars and community members,

We write this statement of solidarity, denouncing the murder of George Floyd in Minnesota and the ongoing oppression of Black Americans.

D-Lab mourns with millions across the country over the murders of George Floyd, Breonna Taylor, Ahmaud Arbery and so many other Black people killed at the hands of police. We denounce these horrendous events that are indicative of the systemic racism that is built into the fabric of our society and government. We call for the demilitarization of institutions of higher learning...

Imposter Syndrome in Data Science

January 19, 2021

I promised the D-Lab a blog post and then promptly felt unqualified to write it. (I wish that I were kidding.) Thus begins my ironic tale of imposter syndrome in data science.

Who am I?

As a new D-Lab Data Science Fellow and as a doctoral student in Rachel Morello-Frosch’s laboratory in the Department of Environmental Science, Policy, and Management at Berkeley, I work at the intersection of social theory, environmental epidemiology, and geography. Specifically, I research how institutional power manifests in the distribution of environmental monitoring...

Impactful Data Science: What I Learned Through Data Science For Social Good

November 4, 2020

It’s times like these when I like to think about how I can bring together all the technical work I like to do and the impact I want to make. As a Graduate Student in the Energy and Resources Group (ERG) focusing on the intersection of humans and climate, I thoroughly believe that the efforts to create a more just society require work across disciplines to tackle society’s most pressing issues. This led me to join the Data Science for Social Good (DSSG) program at the...