Statistica Sinica 26 (2016), 955-978
Abstract: Let X1,…,Xn be independent observations with Xi ~ N(θi,1), where (θ1,…,θn) is an unknown vector of normal means. Let fn(x) = ∑ i=1n(d∕dx)Pn{Xi ≤ x}∕n be the average marginal density of observations. We consider the problem of testing H0:fn ∈0, where 0 is a family of mixture densities. This includes detecting nonzero normal means with 0 = {fδ0} and testing homogeneity in mixture models with 0 = {fδμ}. We study a generalized likelihood ratio test (GLRT) based on the generalized maximum likelihood estimator (GMLE, Robbins (1950); Kiefer and Wolfowitz (1956)). We establish a large deviation inequality that provides a divergence rate n of the GLRT under the null hypothesis. The inequality implies that the significance level of the test is of equal or smaller order than nn2. We show that the test can detect any alternative that is separated from the null by Hellinger distance n. For the two-component Gaussian mixture, it turns out that the GLRT has full power asymptotically throughout the same region of amplitude sparsity where the Neyman-Pearson likelihood ratio test separates the two hypotheses completely (Donoho and Jin (2004)). We demonstrate the power of the GLRT for moderate samples with numerical experiments.
Key words and phrases: Detection boundary, generalized likelihood ratio test, generalized maximum likelihood estimator, normal mixture, sparse normal means.