Research Design

Institutional Review Board (IRB) Fundamentals

February 7, 2023, 10:00am
Are you starting a research project at UC Berkeley that involves human subjects? If so, one of the first steps you will need to take is getting IRB approval.

Sand Mining - Plugging a Critical Data Gap

May 14, 2024
by Suraj Nair. Excessive sand mining is causing a global ecological crisis. In this blog post, I present why sand mining is one of the most pressing challenges facing the planet, and why persistent data gaps hinder accountability and monitoring. I also discuss an ongoing research project of mine where we combine freely available satellite imagery and machine learning models to build open-source sand mine detection tools that can plug some of these data gaps.

Enhancing Research Transparency Inspired by Grounded Theory

April 30, 2024
by Farnam Mohebi. Grounded theory, a powerful tool for qualitative analysis, can enhance data science research by improving transparency and impact. Researchers can create a vivid record of their process by meticulously documenting the entire research journey, including the decisions they make and the corresponding rationale behind them, from initial data exploration to developing and refining theories. Embracing grounded theory principles, such as iterative coding and constant comparison, can help data scientists build robust, data-driven theories while ensuring transparency throughout the research process. This approach makes research more replicable and understandable and invites others to engage with the work, fostering collaboration and constructive critique, ultimately elevating the value and reach of their findings.

Transparency in Experimental Political Science Research

April 9, 2024
by Kamya Yadav. With the increase in studies with experiments in political science research, there are concerns about research transparency, particularly around reporting results from studies that contradict or do not find evidence for proposed theories (commonly called “null results”). To encourage publication of results with null results, political scientists have turned to pre-registering their experiments, be it online survey experiments or large-scale experiments conducted in the field. What does pre-registration look like and how can it help during data analysis and publication?

Design Your Observational Study with the Joint Variable Importance Plot

March 12, 2024
by Lauren Liao. When evaluating causal inference in observational studies, there often is a natural imbalance in the data. Luckily, variables are often measured alongside that can be helpful for adjustment. However, deciding which variables should be prioritized for adjustment is not trivial – since not all variables are equally important to the intervention or the outcome. I recommend using the joint variable importance plot during the observational study design phase to visualize which variables should be prioritized. This post provides a gentle guide on how to do so and why it is important.

How can we use big data from iNaturalist to address important questions in Entomology?

February 26, 2024
by Leah Lee. Large-scale geographic data over time on insect diversity can be used to answer important questions in Entomology. Open-source, open-access citizen science platforms like iNaturalist generate huge amounts of data on species diversity and distribution at accelerating rates. However, unstructured citizen science data contain inherent biases and need to be used with care. One of the efforts to validate big data from iNaturalist is to cross-check with systematically collected data, such as museum specimens.

From Ideas to Streamlined Research: The Benefits of Full-Cycle Methodology

December 5, 2023
by Farnam Mohebi. As an aspiring leading researcher, I find the full-cycle research methodology crucial for transforming initial curiosities into organized studies and research products. This approach begins with thorough observation, leads to theory and hypothesis development and experimentation, and concludes with synthesizing findings into coherent narratives. It's beneficial for researchers of all backgrounds, enhancing the depth and impact of their work. By embracing this method, researchers comprehensively understand each stage and its contribution to the broader research context and can lead the process of converting an initial unspecified research idea to a streamlined research study and product. This systematic approach is particularly effective in complex studies, fostering thorough, investigative, and innovative research processes.

From Asking Causal Questions to Making Causal Inference

December 5, 2023
by Lauren Liao. What is causality and how do we ask causal questions? It may seem like a difficult and foreign concept, but fear not, I will guide you through the basic concepts in this blog post. We will start from how to ask causal questions then more formally address how to answer these questions. You may find causality more approachable than you think. It follows the same ideas as presented by the scientific method of rigorously testing how interventions produce different outcomes in a controlled environment.

Introduction to Item Response Theory

October 24, 2023
by Mingfeng Xue. Measurements (e.g., tests, surveys, questionnaires) are inevitably involved with various sources of errors. Among many psychometric theories, item response theory stands out for its capability of detailed analyses at the item level and its potential to reduce some of the measurement errors. This post first discussed the limitations of conventional summation and average, which give rise to the IRT models, and then introduced a basic form of the Rasch model, including expressions of the model, the assumptions underlying it, some of its advantages, and software packages. Some codes are also provided.

Americanist Linguistics: on Ethics and Intent

October 17, 2023
by Anna Björklund. In this post, Anna Björklund investigates the origin of the linguistic study of indigenous American languages, its inextricable ties to settler-colonialism, and how linguistics can move forward as a field.