Back To Index Previous Article Next Article Full Text

Statistica Sinica 24 (2014), 1365-1387

VARIABLE SELECTION FOR SPARSE HIGH-DIMENSIONAL
NONLINEAR REGRESSION MODELS BY COMBINING
NONNEGATIVE GARROTE AND SURE
INDEPENDENCE SCREENING
Shuang Wu1, Hongqi Xue1, Yichao Wu2 and Hulin Wu1
1University of Rochester and 2North Carolina State University

Abstract: In many regression problems, the relations between the covariates and the response may be nonlinear. Motivated by the application of reconstructing a gene regulatory network, we consider a sparse high-dimensional additive model with the additive components being some known nonlinear functions with unknown parameters. To identify the subset of important covariates, we propose a method for simultaneous variable selection and parameter estimation by iteratively combining a large-scale variable screening (the nonlinear independence screening, NLIS) and a moderate-scale model selection (the nonnegative garrote, NNG) for the nonlinear additive regressions. We have shown that the NLIS procedure possesses the sure screening property and is able to handle problems with non-polynomial dimensionality; for finite dimension problems, the NNG for the nonlinear additive regressions has selection consistency for the unimportant covariates and estimation consistency for the parameter estimates of the important covariates. The proposed method is applied to simulated data and to real data for identifying gene regulations to illustrate its numerical performance.

Key words and phrases: Gene regulations, independence learning, nonlinear regressions, nonnegative garrote, sigmoid function, sure screening.

Back To Index Previous Article Next Article Full Text