Statistics

Causal Thinking in Thermal Comfort

September 17, 2024

Ruiji Sun

by Ruiji Sun. We demonstrate the importance of causal thinking by comparing two linear regression approaches used in thermal comfort research: Approach (a), which regresses thermal sensation votes (y-axis) on indoor temperature (x-axis); Approach (b), which does the reverse, regressing indoor temperature (y-axis) on thermal sensation votes (x-axis). From a correlational perspective, they may appear interchangeable, but causal thinking reveals substantial and practical differences between them. Using the same data, we found Approach (b) leads to a 10 °C narrower than the conventionally derived comfort zone using Approach (a). This finding has important implications for occupant comfort and building energy efficiency. We highlight the importance of integrating causal thinking into correlation-based statistical methods, especially given the increasing volume of data in the built environment.

Read more about Causal Thinking in Thermal Comfort

Causal Inference in International Political Economy: Hurdles and Advancements

September 9, 2024

Yue Lin

by Yue Lin. What are the key challenges and opportunities of applying experiments in the International Political Economy (IPE) research? In this blog, I reviewed an enduring methodological battle between statistics and experiments, and pointed out that the difficulties of randomization and locating credible counterfactuals have served as main hurdles for IPE scholars to widely adopt experimental tools. However, I further demonstrated some new progress in applying survey, field, and lab experiments in the recent IPE scholarship. I concluded that it is crucial for future researchers to think innovatively about how to combine different research methods to make causal claims in IPE studies.

Read more about Causal Inference in International Political Economy: Hurdles and Advancements

Minding the Gaps: Pay Equity in California

July 9, 2024

Tonya D. Lindsey, Ph.D.

by Tonya D. Lindsey, Ph.D. The gender pay gap continues to reflect that, on average, men outearn women. California is among the states with the smallest pay gaps (outpacing the national number at 13%) and is unique in that it enacted legislation aimed at eliminating pay gaps by sex and race categories. This blog post reflects on California’s pay gap as students study it in an undergraduate social statistics course. Independent variables indicate three theoretical frameworks: 1) human capital, 2) occupational segregation, and 3) discrimination. While the work students do is rigorous using a representative sample of full-time year-round California workers, there remains work to be done and caveats to the data and analyses.

Read more about Minding the Gaps: Pay Equity in California

R Bootcamp: Fall 2021

August 21, 2021, 8:30am

The workshop will be an intensive two-day introduction to R using RStudio. After the first morning session, the workshop will (staffing permitting) be split into two separate tracks. Co-sponsored by the UC Berkeley Statistics Department and the D-Lab.

Read more about R Bootcamp: Fall 2021

Introduction to Propensity Score Matching with MatchIt

April 1, 2024

Alex Ramiller

by Alex Ramiller. When working with observational (i.e. non-experimental) data, it is often challenging to establish the existence of causal relationships between interventions and outcomes. Propensity Score Matching (PSM) provides a powerful tool for causal inference with observational data, enabling the creation of comparable groups that allow us to directly measure the impact of an intervention. This blog post introduces MatchIt – a software package that provides all of the necessary tools for conducting Propensity Score Matching in R – and provides step-by-step instructions on how to conduct and evaluate matches.

Read more about Introduction to Propensity Score Matching with MatchIt

Computational Social Science in a Social World: Challenges and Opportunities

March 26, 2024

José Aveldanes

by José Aveldanes. The rise of AI, Machine Learning, and Data Science are harbingers of the need for a significant shift in social science research. Computational Social Science enables us to go beyond traditional methods such as Ordinary Least Squares, which face challenges in addressing complexities of social phenomena, particularly in modeling nonlinear relationships and managing high-dimensionality data. This paradigmatic shift requires that we embrace these new tools to understand social life and necessitates understanding methodological and ethical challenges, including bias and representation. The integration of these technologies into social science research calls for a collaborative approach among social scientists, technologists, and policymakers to navigate the associated risk and possibilities of these new tools.

Read more about Computational Social Science in a Social World: Challenges and Opportunities

Design Your Observational Study with the Joint Variable Importance Plot

March 12, 2024

Lauren Liao

by Lauren Liao. When evaluating causal inference in observational studies, there often is a natural imbalance in the data. Luckily, variables are often measured alongside that can be helpful for adjustment. However, deciding which variables should be prioritized for adjustment is not trivial – since not all variables are equally important to the intervention or the outcome. I recommend using the joint variable importance plot during the observational study design phase to visualize which variables should be prioritized. This post provides a gentle guide on how to do so and why it is important.

Read more about Design Your Observational Study with the Joint Variable Importance Plot

A Basic Introduction to Hierarchical Linear Modeling

March 4, 2024

Mingfeng Xue

by Mingfeng Xue. Hierarchical Linear Modeling (HLM) is an extension of linear models, which offers an approach to analyzing data structures with nested levels. This blog elucidates HLM's significance over traditional linear regression models, particularly in handling clustered data and multilevel predictors. Illustrated with an example from educational research, the blog demonstrates model implementation and interpretation steps. It showcases how HLM accommodates both independent variables from different levels and hierarchical structure data, providing insights into their impacts on the outcome variable. Recommended resources further aid readers in mastering HLM techniques.

Read more about A Basic Introduction to Hierarchical Linear Modeling

Creating the Ultimate Sweet

January 30, 2024

Emma Turtelboom

by Emma Turtelboom. What is the best Halloween candy? In this blog post, we will identify attributes of popular sweets and create a model to understand how these attributes influence the popularity of the sweet. We’ll discuss alternative model approaches and potential drawbacks, as well as caveats to interpreting the predictions of our model.

Read more about Creating the Ultimate Sweet

Tracking Urban Expansion Through Satellite Imagery

December 12, 2023

Leïla Njee Bugha

by Leïla Njee Bugha. Among its many uses, remote sensing can prove especially useful to document changes and trends from eras or settings, where traditional sources are either inexistent or infrequently collected. This is the case when one wants to study urban expansion in sub-Saharan countries over the past 20 years. To further remedy the lack of data on land cover uses from earlier time periods, classification methods can be used as well. Using easily accessible satellite imagery from Google Earth Engine, I provide here an example combining remote sensing with classification to detect changes in the land cover in Nigeria since 2000 due to urban expansion.

Read more about Tracking Urban Expansion Through Satellite Imagery

« first View: Taxonomy term
‹ previous View: Taxonomy term
1 of 5 View: Taxonomy term
2 of 5 View: Taxonomy term
3 of 5 View: Taxonomy term (Current page)
4 of 5 View: Taxonomy term
5 of 5 View: Taxonomy term
next › View: Taxonomy term
last » View: Taxonomy term