Latent class model

Within each latent class, the observed variables are statistically independent—an essential aspect of latent class modeling. Usually, the observed variables are statistically dependent. By introducing the latent variable, independence is restored in the sense that within classes, variables are independent (local independence). Therefore, the association between the observed variables is explained by the classes of the latent variable (McCutcheon, 1987). In one form, the LCM is written as : p_{i_1, i_2, \ldots, i_N} \approx \sum_t^T p_t \, \prod_n^N p^n_{i_n, t}, where T is the number of latent classes and p_t are the so-called recruitment or unconditional probabilities that should sum to one. p^n_{i_n, t} are the marginal or conditional probabilities. For a two-way latent class model, the form is : p_{ij} \approx \sum_t^T p_t \, p_{it} \, p_{jt}. This two-way model is related to probabilistic latent semantic analysis and non-negative matrix factorization. The probability model used in LCA is closely related to the Naive Bayes classifier. The main difference is that in LCA, the class membership of an individual is a latent variable, whereas in Naive Bayes classifiers, the class membership is an observed label. == Related methods ==