Regional Statistics Conference 2026

Regional Statistics Conference 2026

Causal Inference for Latent Class Analysis for Complex Survey Sampling Data

Conference

Regional Statistics Conference 2026

Format: CPS Abstract - Malta 2026

Keywords: biostatistics, causal inference, complex sampling design, latent variable models

Session: CPS 03 Causal Inference and Bayesian Networks

Wednesday 3 June 10 a.m. - 11 a.m. (Europe/Malta)

Abstract

In various fields such as social sciences, health, and education, it is feasible to represent and describe phenomena that cannot be directly measured. Response patterns are examined through the observation of a set of variables, known as indicators, which are influenced by an unobservable variable referred to as a latent variable or construct. In this context, Latent Class Analysis (LCA) stands out as a significant analytical approach, serving as a statistical modeling technique that utilizes categorical indicators and constructs to define mutually exclusive subgroups of individuals with similar characteristics. The integration of Causal Inference methods with LCA enables researchers to make causal claims regarding measured relationships by analyzing the prevalence of latent classes. Additionally, many studies incorporate complex sampling processes, which include cluster sampling and stratification. This work combines different approaches for causal estimation of interventions on a categorical latent variable using propensity scores within the framework of complex sampling designs. These methodologies are illustrated through an analysis of data regarding the mental health of Brazilians, specifically examining their responses to the nine items of the PHQ-9 (Patient Health Questionnaire-9) from the 2013 National Health Survey (PNS-2013). The objective is to estimate the causal effect of lifestyle choices, such as living alone or not, on depression in adults. Simulation studies are conducted to evaluate the proposed methods.