 
 
 
 
Statistica Sinica 28 (2018), 2541-2564
Abstract:  We study improved  approximations to the distribution of the largest eigenvalue  of the sample covariance matrix of 
𝓃 zero-mean Gaussian observations in dimension p +1. We assume that one population principal component  has variance  𝓁 > 1 and the remaining ‘noise’ components have common variance 1. In the high-dimensional  limit
of the sample covariance matrix of 
𝓃 zero-mean Gaussian observations in dimension p +1. We assume that one population principal component  has variance  𝓁 > 1 and the remaining ‘noise’ components have common variance 1. In the high-dimensional  limit  > 0, we study Edgeworth corrections to the limiting Gaussian distribution of
 > 0, we study Edgeworth corrections to the limiting Gaussian distribution of   in the  supercritical case 𝓁 > 1+
 in the  supercritical case 𝓁 > 1+ . The skewness correction involves a quadratic polynomial, as in classical settings, but the coefficients reflect the high-dimensional structure. The methods involve Edgeworth expansions for sums of independent non-identically distributed variates obtained by conditioning on the sample noise eigenvalues, and the limiting bulk properties and fluctuations of these noise eigenvalues.
. The skewness correction involves a quadratic polynomial, as in classical settings, but the coefficients reflect the high-dimensional structure. The methods involve Edgeworth expansions for sums of independent non-identically distributed variates obtained by conditioning on the sample noise eigenvalues, and the limiting bulk properties and fluctuations of these noise eigenvalues.
Key words and phrases: Edgeworth expansion, Roy’s statistic, spiked PCA model.
 
 
 
