Back To Index Previous Article Next Article Full Text

Statistica Sinica 33 (2023), 1319-1341

HYBRID HARD-SOFT SCREENING FOR HIGH
DIMENSIONAL LATENT CLASS ANALYSIS

Wei Dong1, Xingxiang Li2, Chen Xu3 and Niansheng Tang1

1Yunnan University, 2Xi'an Jiaotong University and 3University of Ottawa

Abstract: Latent class analysis (LCA) is a powerful tool for detecting unobservable subgroups within a population. When a large number of covariates (features) are considered, an LCA faces great challenges in terms of both classification accuracy and computational efficiency. In this paper, we propose a novel feature screening procedure that eliminates most irrelevant features before an LCA is conducted. The proposed method is built on an EM-based hybrid hard-soft thresholding update (HHS-EM) of the latent class parameters, which naturally accounts for the joint effects between features. We show that the HHS-EM enjoys the sure screening property and leads to a refined LCA that is effective and consistent for high-dimensional classification. The performance of the proposed method is illustrated by means of simulation studies and a real-data example.

Key words and phrases: Feature screening, high-dimensional classification, latent class analysis, misclassification error, sure joint screening.

Back To Index Previous Article Next Article Full Text