Back To Index Previous Article Next Article Full Text

Statistica Sinica 31 (2021), 223-242

ON CUMULATIVE SLICING ESTIMATION FOR HIGH DIMENSIONAL DATA

Cheng Wang, Zhou Yu and Liping Zhu

Shanghai Jiao Tong University, East China Normal University
and Renmin University of China

Abstract: In the context of sufficient dimension reduction (SDR), the sliced inverse regression (SIR) successfully reduces the covariate dimension of a high-dimensional nonlinear regression. When the covariate is low or moderate dimensional, the performance of the SIR is insensitive to the number of slices. However, our empirical studies indicate that the performance of the SIR relies heavily on the number of slices when the covariate is high or ultrahigh dimensional. Determining the optimal number of slices remains an open problem in the SDR literature, despite its importance to the effectiveness of SIR in high- and ultrahigh-dimensional regressions. Thus, we propose an improved version of the SIR, called the cumulative slicing estimation (CUME) method, that does not require selecting an optimal number of slices. We provide a general framework in which to analyze the phase transitions of the CUME method. We show that, without the sparsity assumption, the CUME method is consistent if and only if p/n → 0, where p denotes the covariate dimension, and n denotes the sample size. If we include certain sparsity assumptions, then the thresholding estimate for the CUME method is consistent as long as log(p)/n → 0. We demonstrate the superior performance of the proposed method using extensive numerical experiments.

Key words and phrases: Cumulative slicing estimation, dimension reduction, sliced inverse regression, sparsity, sufficient.

Back To Index Previous Article Next Article Full Text