Back To Index Previous Article Next Article Full Text

Statistica Sinica 30 (2020), 783-807

A NEW SEMIPARAMETRIC APPROACH TO FINITE
MIXTURE OF REGRESSIONS USING PENALIZED
REGRESSION VIA FUSION
Erin Austin1 , Wei Pan2 and Xiaotong Shen2
1University of Colorado Denver and 2University of Minnesota, Minneapolis

Abstract: For some modeling problems a population may be better assessed as an aggregate of unknown subpopulations, each with a distinct relationship between a response and associated variables. The finite mixture of regressions (FMR) model, in which an outcome is derived from one of a finite number of linear regression models, is a natural tool in this setting. In this article, we first propose a new penalized regression approach. Then, we demonstrate how the proposed approach better identifies subpopulations and their corresponding models than a semiparametric FMR method does. Our new method fits models for each person via grouping pursuit, utilizing a new group-truncated L1 penalty that shrinks the differences between estimated parameter vectors. The methodology causes the individuals’ models to cluster into a few common models, in turn revealing previously unknown subpopulations. In fact, by varying the penalty strength, the new method can reveal a hierarchical structure among the subpopulations that can be useful in exploratory analyses. Simulations using FMR models and a real-data analysis show that the method performs promisingly well.

Key words and phrases: FMR, group LASSO, group TLP, grouping pursuit, penalized regression, semiparametric.

Back To Index Previous Article Next Article Full Text