Back To Index Previous Article Next Article Full Text

Statistica Sinica 32 (2022), 939-959

ADJUSTING SYSTEMATIC BIAS IN HIGH
DIMENSIONAL PRINCIPAL COMPONENT SCORES

Sungkyu Jung

Seoul National University

Abstract: Principal component analysis continues to be a powerful tool for the dimension reduction of high-dimensional data. We assume a variance-diverging model and use the high-dimension low-sample-size asymptotics to show that even though the principal component directions are not consistent, the sample and prediction principal component scores can be useful in revealing the population structure. We further show that these scores are biased, and that the bias is asymptotically decomposed into rotation and scaling parts. We propose bias-adjustment methods that are shown to be consistent and work well in high-dimensional situations with small sample sizes. The potential advantage of the bias adjustment is demonstrated in a classification setting.

Key words and phrases: HDLSS, jackknife, principal component analysis.

Back To Index Previous Article Next Article Full Text