Back To Index Previous Article Next Article Full Text

Statistica Sinica 31 (2021), 391-420

PENALIZED LINEAR REGRESSION WITH
HIGH-DIMENSIONAL PAIRWISE SCREENING

Siliang Gong1, Kai Zhang2 and Yufeng Liu2

1University of Pennsylvania and 2The University of North Carolina at Chapel Hill

Abstract: In relation to variable selection, most existing screening methods focus on marginal effects and ignore the dependence between covariates. To improve the performance of variable selection, we incorporate pairwise effects in covariates for screening and penalization. We achieve this by studying the asymptotic distribution of the maximal absolute pairwise sample correlation between independent covariates. The novelty of the theory is that the convergence is related to the dimensionality p, and is uniform with respect to the sample size n. Moreover, we obtain an upper bound for the maximal pairwise R squared when regressing the response onto two covariates. Based on these extreme-value results, we propose a screening procedure to detect covariates pairs that are potentially correlated and associated with the response. We further combine the pairwise screening with sure independence screening and develop a new regularized variable selection procedure. Numerical studies show that our method is competitive in terms of both prediction accuracy and variable selection accuracy.

Key words and phrases: Pairwise screening, penalized regression, sure independence screening, variable selection.

Back To Index Previous Article Next Article Full Text