Data Science

Why Data Disaggregation Matters: Exploring the Diversity of Asian American Economic Outcomes Using Public Use Microdata Sample (PUMS) Data

February 11, 2025
by Taesoo Song. Asian Americans are often overlooked in discussions of racial inequality due to their high average socioeconomic attainment. Many academic and policy researchers treat Asians as a single racial category in their analysis. However, this broad categorization can mask significant within-group disparities, leaving many disadvantaged individuals without access to vital resources and policy support. Song emphasizes the importance of data disaggregation in revealing Asian American inequalities, particularly in areas like income and homeownership, and demonstrates how breaking down these categories can lead to more targeted and effective policy solutions.

Python Fundamentals: Brief Introduction (60 minutes)

February 14, 2025, 3:30pm
This is a lightweight module aimed to provide a brief introduction to Python using Jupyter Notebooks.

Which Coin Should I Flip? The Multi-Arm Bandit

February 4, 2025
by Bruno Smaniotto. Consider the following game: You are given the option to choose between two coins to flip. These coins are possibly biased, so the probability of getting Heads for each coin might differ from 50/50. Each time that you flip Heads, you win one dollar. There are a total of 10 rounds. Which coin should you flip at each round? In this blog post, we will analyze this problem through the lens of a famous decision-making algorithm called the Multi-Arm Bandit, exploring how to structure the problem mathematically and how it can be solved for particular examples.

Field Experiments in Corporations

January 28, 2025
by Yue Lin. How do social science researchers conduct field experiments with private actors? Yue Lin provides a brief overview of the recent developments in political economy and management strategy, with a focus on filing field experiments within private corporations. Unlike conventional targets like individuals and government agencies, private companies are an emergent sweet spot for scholars to test for important theories, such as sustainability, censorship, and market behavior. After comparing the strengths and weaknesses of this powerful yet nascent method, Lin brainstorms some practical solutions to improve the success rate of field experimental studies. She aims to introduce a new methodological tool in a nascent research field and shed some light on improving experimental quality while adhering to ethical standards.

The Creation of Bad Students: AI Detection for Non-Native English Speakers

January 21, 2025
by Valeria Ramírez Castañeda. This blog explores how AI detection tools in academia perpetuate surveillance and punishment, disproportionately penalizing non-native English speakers (NNES). It critiques the rigid, culturally biased notions of originality and intellectual property, highlighting how NNES rely on AI to navigate the dominance of English in academic settings. Current educational practices often label AI use as dishonest, ignoring its potential to reduce global inequities. The post argues for a shift from punitive measures to integrate AIs as a tool for inclusivity, fostering diverse perspectives. By embracing AI, academia can prioritize collaboration and creativity over control and discipline.

Command Line Fundamentals

February 11, 2025, 10:00am
In this workshop, we provide a basic introduction to how to interact with your computer via terminal. We are going to focus on Bash (Bourne-Again Shell) or Zsh (Z Shell), which are one of the most commonly used Unix/Linux shells.

GitHub Fundamentals

February 20, 2025, 10:00am
This introductory workshop covers basics of GitHub using GitHub Desktop, a GUI Git client. This workshop is designed for people with no previous experience with Git or GitHub. We will cover key concepts and workflows, including version control, repository creation, branching, merging, and collaboration.

R Copilot Assisted Coding Workshop

February 4, 2025, 10:00am
This workshop provides a beginner-friendly introduction to coding with GitHub Copilot, a popular AI coding assistant. We will start from the basics so you can take advantage of AI assistants to improve your coding and avoid common pitfalls. First, we’ll cover how to install and set-up Visual Studio Code, a free code editor through which we will use GitHub Copilot. Then, we will go through the different features of GitHub Copilot and how to use them to help us code in R.

Fritz_X_DargesBlue42… Who Are You?

January 14, 2025
by Jonathan Pérez. Reflecting on the complexities of the human experience is paramount to conducting research. Jonathan Pérez, through his exploration of a conspiracy subreddit, reflects on his experience trying to find the human behind the datum. Jonathan critiques the harmful effects of dehumanizing rhetoric and the researcher’s responsibility to navigate ethical implications. In doing so, he establishes three guiding rules to support researchers seeking to humanize their analysis: 1) a researcher must always find the story behind the data; 2) a researcher must protect themselves; 3) a researcher must still humanize participants (even those who perpetuate harmful narratives).

Navigating AI Tools in Open Source Contributions: A Guide to Authentic Development

December 17, 2024
by Sahiba Chopra. The rise of ChatGPT has transformed how developers approach their work - but it might be hurting your reputation in the open-source community. While AI can supercharge your productivity, knowing when not to use it is just as crucial as knowing how to use it effectively. This guide reveals the unspoken rules of AI usage in open source, helping you navigate the fine line between leveraging AI and maintaining authenticity. Learn when to embrace AI tools and when to rely on your own expertise, plus get practical tips for building trust in the open-source community.