Back To Index Previous Article Next Article Full Text

Statistica Sinica 33 (2023), 2463-2493

SIMPLEX-BASED MULTINOMIAL LOGISTIC
REGRESSION WITH DIVERGING NUMBERS
OF CATEGORIES AND COVARIATE

Sheng Fu1, Piao Chen2, Yufeng Liu3 and Zhisheng Ye1

1National University of Singapore, 2Delft University of Technology
and 3University of North Carolina at Chapel Hill

Abstract: Multinomial logistic regression models are popular in multicategory classification analysis, but existing models suffer several intrinsic drawbacks. In particular, the parameters cannot be determined uniquely because of the over-specification. Although additional constraints have been imposed to refine the model, such mod- ifications can be inefficient and complicated. In this paper, we propose a novel and efficient simplex-based multinomial logistic regression technique, seamlessly connecting binomial and multinomial cases under a unified framework. Compared with existing models, our model has fewer parameters, is free of any constraints, and can be solved efficiently using the Fisher scoring algorithm. In addition, the proposed model enjoys several theoretical advantages, including Fisher consistency and sharp comparison inequality. Under mild conditions, we establish the asymptotical normality and convergence for the new model, even when the numbers of categories and covariates increase with the sample size. The proposed framework is illustrated by means of extensive simulations and real applications.

Key words and phrases: Asymptotics, classification, Fisher consistency, kernel learning, MLR, simplex coding scheme.

Back To Index Previous Article Next Article Full Text