Abstract
In this study, we explore a robust testing procedure for the high
dimensional location parameters testing problem. Initially, we introduce a spatialsign based max-type test statistic, which exhibits excellent performance for sparse
alternatives.
Subsequently, we demonstrate the asymptotic independence between this max-type test statistic and the spatial-sign based sum-type test statis-
tic (Feng and Sun, 2016). Building on this, we propose a spatial-sign based maxsum type testing procedure, which shows remarkable performance under varying
signal sparsity. Our simulation studies underscore the superior performance of
the procedures we propose.
Information
| Preprint No. | SS-2024-0051 |
|---|---|
| Manuscript ID | SS-2024-0051 |
| Complete Authors | Jixuan Liu, Long Feng, Ping Zhao, Zhaojun Wang |
| Corresponding Authors | Long Feng |
| Emails | flnankai@nankai.edu.cn |
References
- Ayyala, D. N., J. Park, and A. Roy (2017). Mean vector testing for high-dimensional dependent observations. Journal of Multivariate Analysis 153, 136–155.
- Bai, Z. and H. Saranadasa (1996). Effect of high dimension: by an example of a two sample problem. Statistica Sinica 6, 311–329.
- Bickel, P. J. and E. Levina (2008). Covariance regularization by thresholding. The Annals of Statistics 36(6), 2577–2604.
- Cai, T., W. Liu, and Y. Xia (2013, 08). Two-sample test of high dimensional means under dependence. Journal of the Royal Statistical Society Series B: Statistical Methodology 76(2), 349–372.
- Chang, J., X. Chen, and M. Wu (2024). Central limit theorems for high dimensional dependent data. Bernoulli 30(1), 712–742.
- Chang, J., Q. Jiang, and X. Shao (2023). Testing the martingale difference hypothesis in high dimension. Journal of Econometrics 235(2), 972–1000.
- Chang, J., Q. Yao, and W. Zhou (2017). Testing for high-dimensional white noise using maximum cross-correlations. Biometrika 104(1), 111–127.
- Chen, S. X. and Y.-L. Qin (2010). A two-sample test for high-dimensional data with applications to gene-set testing. The Annals of Statistics 38(2), 808–835.
- Chen, S. X., L.-X. Zhang, and P.-S. Zhong (2010). Tests for high-dimensional covariance matrices. Journal of the American Statistical Association 105(490), 810–819.
- Cheng, G., B. Liu, L. Peng, B. Zhang, and S. Zheng (2019). Testing the equality of two highdimensional spatial sign covariance matrices. Scandinavian Journal of Statistics 46(1), 257–271.
- Cheng, G., L. Peng, and C. Zou (2023). Statistical inference for ultrahigh dimensional location parameter based on spatial median. arXiv preprint arXiv:2301.03126.
- Cutting, C., D. Paindaveine, and T. Verdebout (2017). Testing uniformity on high-dimensional spheres against monotone rotationally symmetric alternatives. The Annals of Statistics 45(3), 1024–1058.
- Fang, K. W. (2018). Symmetric multivariate and related distributions. CRC Press.
- Feng, L., T. Jiang, X. Li, and B. Liu (2022). Asymptotic independence of the sum and maximum of dependent random variables with applications to high-dimensional tests. arXiv preprint arXiv:2205.01638.
- Feng, L., T. Jiang, B. Liu, and W. Xiong (2022). Max-sum tests for cross-sectional independence of high-dimensional panel data. Annals of Statistics 50(2), 1124–1143.
- Feng, L., B. Liu, and Y. Ma (2021). An inverse norm sign test of location parameter for high-dimensional data. Journal of Business & Economic Statistics 39(3), 807–815.
- Feng, L., B. Liu, and Y. Ma (2022). Testing for high-dimensional white noise. arXiv preprint arXiv:2211.02964.
- Feng, L. and F. Sun (2016). Spatial-sign based high-dimensional location test. Electronic Journal of Statistics 10, 2420–2434.
- Feng, L., X. Zhang, and B. Liu (2020). A high-dimensional spatial rank test for two-sample location problems. Computational Statistics & Data Analysis 144, 106889.
- Feng, L., C. Zou, and Z. Wang (2016). Multivariate-sign-based high-dimensional tests for the two-sample location problem. Journal of the American Statistical Association 111(514), 721–735.
- Feng, L., C. Zou, Z. Wang, and L. Zhu (2015). Two-sample behrens-fisher problem for highdimensional data. Statistica Sinica 25, 1297–1312.
- Hallin, M. and D. Paindaveine (2006). Semiparametrically efficient rank-based inference for shape. i. optimal rank-based tests for sphericity. Annals of Statistics 34(6), 2707–2756.
- He, Y., G. Xu, C. Wu, and W. Pan (2021). Asymptotically independent u-statistics in highdimensional testing. Annals of Statistics 49(1), 151–181.
- Hettmansperger, T. P. and R. H. Randles (2002). A practical affine equivariant multivariate median. Biometrika 89(4), 851–860.
- Huang, X., B. Liu, Q. Zhou, and L. Feng (2023). A high-dimensional inverse norm sign test for two-sample location problems. Canadian Journal of Statistics 51(4), 1004–1033.
- Ilmonen, P. and D. Paindaveine (2011). Semiparametrically efficient inference based on signed ranks in symmetric independent component models. Annals of Statistics 39(5), 2448–2476.
- Li, J. and S. X. Chen (2012). Two sample tests for high-dimensional covariance matrices. Annals of Statistics 40(2), 908–940.
- Li, W. and Y. Xu (2022). Asymptotic properties of high-dimensional spatial median in elliptical distributions with application. Journal of Multivariate Analysis 190, 104975.
- Liu, Y. and J. Xie (2020). Cauchy combination test: A powerful test with analytic p-value calculation under arbitrary dependency structures. Journal of the American Statistical Association 115(529), 393–402.
- Long, M., Z. Li, W. Zhang, and Q. Li (2023). The cauchy combination test under arbitrary dependence structures. The American Statistician 77(2), 134–142.
- Ma, H., L. Feng, Z. Wang, and J. Bao (2024). Adaptive testing for alphas in conditional factor models with high dimensional assets. Journal of Business & Economic Statistics 42(4), 1356–1366.
- Ma, H., L. Feng, Z. Wang, and B. Jigang (2024). Testing alpha in high dimensional linear factor pricing models with dependent observations. arXiv preprint arXiv:2401.14052.
- Nordhausen, K., H. Oja, and D. Paindaveine (2009). Signed-rank tests for location in the symmetric independent component model. Journal of Multivariate Analysis 100(5), 821– 834.
- Oja, H. (2010). Multivariate nonparametric methods with R: an approach based on spatial signs and ranks. Springer Science & Business Media.
- Paindaveine, D. and T. Verdebout (2016). On high-dimensional sign tests. Bernoulli 22(3), 1745–1769.
- Park, J. and D. N. Ayyala (2013). A test for the mean vector in large dimension and small samples. Journal of Statistical Planning and Inference 143(5), 929–943.
- Srivastava, M. S. (2009). A test for the mean vector with fewer observations than the dimension under non-normality. Journal of Multivariate Analysis 100(3), 518–532.
- Wang, G. and L. Feng (2023). Computationally efficient and data-adaptive changepoint inference in high dimension. Journal of the Royal Statistical Society Series B: Statistical Methodology 85(3), 936–958.
- Wang, L., B. Peng, and R. Li (2015). A high-dimensional nonparametric multivariate test for mean vector. Journal of the American Statistical Association 110(512), 1658–1669.
- Wu, C., G. Xu, and W. Pan (2019). An adaptive test on high-dimensional parameters in generalized linear models. Statistica Sinica 29(4), 2163–2186.
- Wu, C., G. Xu, X. Shen, and W. Pan (2020). A regularization-based adaptive test for highdimensional generalized linear models. The Journal of Machine Learning Research 21(1), 5005–5071.
- Xu, G., L. Lin, P. Wei, and W. Pan (2016). An adaptive two-sample test for high-dimensional means. Biometrika 103(3), 609–624.
- Yao, J., S. Zheng, and Z. Bai (2015). Sample covariance matrices and high-dimensional data analysis. Cambridge University Press, New York.
- Yu, X., D. Li, and L. Xue (2024). Fisher’s combined probability test for high-dimensional covariance matrices. Journal of the American Statistical Association 119(545), 511–524.
- Yu, X., J. Yao, and L. Xue (2024). Power enhancement for testing multi-factor asset pricing models via fisher’s method. Journal of Econometrics 239(2), 105458.
- Zhang, X. and G. Cheng (2018). Gaussian approximation for high dimensional vector under physical dependence. Bernoulli 24(4A), 2640–2675.
- Zhao, P., D. Chen, and Z. Wang (2024). Spatial-sign-based high-dimensional white noises test. Statistical Theory and Related Fields 8(4), 251–261.
- Zou, C., L. Peng, L. Feng, and Z. Wang (2014). Multivariate sign-based high-dimensional tests for sphericity. Biometrika 101(1), 229–236.
Acknowledgments
The authors thank the editor, the associate editor and three anonymous
referees for helpful comments and discussions. This paper is partially supported by Shenzhen Wukong Investment Company, Tianjin Science Fund
for Outstanding Young Scholar (23JCJQJC00150), the Fundamental Research Funds for the Central Universities under Grant No. ZB22000105 and
63233075, the China National Key R&D Program (Grant Nos. 2019YFC1908502,
2022YFA1003703, 2022YFA1003802, 2022YFA1003803) and the National
Natural Science Foundation of China Grants (Nos. 12271271, 11925106,
12231011, 11931001 and 11971247).
Long Feng and Ping Zhao are the
Supplementary Materials
The online Supplementary Material provides the proofs of the lemmas, theorems given in Section 2-3 and two real data applications. The proofs for
lemmas and theorems are given in Appendix S1 and S2 respectively and
the real data applications are in Appendix S4, in the online Supplementary
Material.