Abstract

This paper is concerned with Spearman’s correlation matrices under large dimensional

regime, in which the data dimension diverges to infinity proportionally with the sample size. We establish the central limit theorem for the linear spectral statistics of Spearman’s correlation matrices,

which extends the results of Bao et al. (2015). We also study the improved Spearman’s correlation

matrices of Hoeffding (1948) which is a standard U-statistic of order 3. As applications, we propose

three new test statistics for large dimensional independent test and numerical studies demonstrate

the applicability of our proposed methods.

Information

Preprint No.SS-2024-0395
Manuscript IDSS-2024-0395
Complete AuthorsHantao Chen, Cheng Wang
Corresponding AuthorsCheng Wang
Emailschengwang@sjtu.edu.cn

References

  1. Anderson, G. W. and O. Zeitouni (2008). A CLT for regularized sample covariance matrices. Annals of Statistics 36(6), 2553–2576.
  2. Anderson, T. (2003). An Introduction to Multivariate Statistical Analysis. Wiley.
  3. Bai, Z., D. Jiang, J. Yao, and S. Zheng (2009). Corrections to lrt on large dimensional covariance matrix by rmt. Annals of Statistics 37(6B), 3822–3840.
  4. Bai, Z. and J. W. Silverstein (2004). CLT for linear spectral statistics of large-dimensional sample covariance matrices. Annals of Probability 32(1A), 553–605.
  5. Bai, Z. and J. W. Silverstein (2010). Spectral analysis of large dimensional random matrices. New York: Springer.
  6. Bai, Z. and W. Zhou (2008). Large sample covariance matrices without independence structures in columns. Statistica Sinica 18, 425–442.
  7. Bandeira, A. S., A. Lodhia, and P. Rigollet (2017). Mar˘cenko-Pastur law for Kendall’s tau. Electronic Communications in Probability 22(32), 1–7.
  8. Bao, Z. (2019). Tracy–widom limit for Kendall’s tau. Annals of Statistics 47(6), 3504–3532.
  9. Bao, Z., L.-C. Lin, G. Pan, and W. Zhou (2015). Spectral statistics of large dimensional Spearman’s rank correlation matrix and its application. Annals of Statistics 43(6), 2588–2623.
  10. Bao, Z., G. Pan, and W. Zhou (2012). Tracy-Widom law for the extreme eigenvalues of sample correlation matrices. Electronic Journal of Probability 17(88), 1–32.
  11. Chen, S. X., L.-X. Zhang, and P.-S. Zhong (2010). Tests for high-dimensional covariance matrices. Journal of the American Statistical Association 105(490), 810–819.
  12. Dobriban, E. and S. Wager (2018). High-dimensional asymptotics of prediction: Ridge regression and classification. Annals of Statistics 46(1), 247–279.
  13. El Karoui, N. (2009). Concentration of measure and spectra of random matrices: Applications to correlation matrices, elliptical distributions and beyond. Annals of Applied Probability 19(6), 2362–2405.
  14. Gao, J., X. Han, G. Pan, and Y. Yang (2017). High dimensional correlation matrices: The central limit theorem and its applications. Journal of the Royal Statistical Society, Series B 79(3), 677–693.
  15. Han, F., S. Chen, and H. Liu (2017). Distribution-free tests of independence in high dimensions. Biometrika 104(4), 813–828.
  16. Hastie, T., A. Montanari, S. Rosset, and R. J. Tibshirani (2022). Surprises in high-dimensional ridgeless least squares interpolation. Annals of Statistics 50(2), 949–986.
  17. Heiny, J. and N. Parolya (2024). Log determinant of large correlation matrices under infinite fourth moment. Annales de l’Institut Henri Poincar´e, Probabilit´es et Statistiques 60(2), 1048–1076.
  18. Heiny, J. and J. Yao (2022). Limiting distributions for eigenvalues of sample correlation matrices from heavy-tailed populations. Annals of Statistics 50(6), 3249–3280.
  19. Hoeffding, W. (1948). A class of statistics with asymptotically normal distribution. Annals of Mathematical Statistics 19(3), 293–325.
  20. Jiang, T. (2004a). The asymptotic distributions of the largest entries of sample correlation matrices. Annals of Applied Probability 14(2), 865–880.
  21. Jiang, T. (2004b). The limiting distributions of eigenvalues of sample correlation matrices. Sankhy¯a: The Indian Journal of Statistics 66(1), 35–48.
  22. Jiang, T. (2019). Determinant of sample correlation matrix with application. Annals of Applied Probability 29(3), 1356–1397.
  23. Leung, D. and M. Drton (2018). Testing independence in high dimensions with sums of rank correlations. Annals of Statistics 46(1), 280–307.
  24. Li, Z., C. Wang, and Q. Wang (2023). On eigenvalues of a high-dimensional Kendall’s rank correlation matrix with dependence. Science China Mathematics 66(11), 2615–2640.
  25. Li, Z., Q. Wang, and R. Li (2021). Central limit theorem for linear spectral statistics of large dimensional Kendall’s rank correlation matrices and its applications. Annals of Statistics 49(3), 1569–1593.
  26. Lytova, A. and L. Pastur (2009). Central limit theorem for linear eigenvalue statistics of random matrices with independent entries. Annals of Probability 37(5), 1778–1840.
  27. Mar˘cenko, V. and L. Pastur (1967). Distribution of eigenvalues for some sets of random matrices. Sbornik: Mathematics 1(4), 457–483.
  28. Mestre, X. and P. Vallet (2017). Correlation tests and linear spectral statistics of the sample correlation matrix. IEEE Transactions on Information Theory 63(7), 4585–4618.
  29. Pan, G. (2014). Comparison between two types of large sample covariance matrices. Annales de l’IHP Probabilit´es et statistiques 50(2), 655–677.
  30. Pan, G. and W. Zhou (2008). Central limit theorem for signal-to-interference ratio of reduced rank linear receiver. Annals of Applied Probability 18(3), 1232–1270.
  31. Parolya, N., J. Heiny, and D. Kurowicka (2024). Logarithmic law of large random correlation matrices. Bernoulli 30(1), 346–370.
  32. Paul, D. and A. Aue (2014). Random matrix theory in statistics: A review. Journal of Statistical Planning and Inference 150, 1–29.
  33. Pillai, N. S. and J. Yin (2012). Edge universality of correlation matrices. Annals of Statistics 40(3), 1737–1763.
  34. Wang, C. and B. Jiang (2018). On the dimension effect of regularized linear discriminant analysis. Electronic Journal of Statistics 12(2), 2709–2742.
  35. Wang, C., J. Yang, B. Miao, and L. Cao (2013). Identity tests for high dimensional data using RMT. Journal of Multivariate Analysis 118, 128–137.
  36. Wang, H., B. Liu, L. Feng, and Y. Ma (2024). Rank-based max-sum tests for mutual independence of highdimensional random vectors. Journal of Econometrics 238(1), 105578.
  37. Wang, Q. and J. Yao (2013). On the sphericity test with large-dimensional observations. Electronic Journal of Statistics 7, 2164–2192.
  38. Wu, Z. and C. Wang (2022). Limiting spectral distribution of large dimensional Spearman’s rank correlation matrices. Journal of Multivariate Analysis 191, 105011.
  39. Yao, J., S. Zheng, and Z. Bai (2015). Large sample covariance matrices and high-dimensional data analysis. Cambridge: Cambridge University Press.
  40. Zheng, S., Z. Bai, and J. Yao (2015). Substitution principle for CLT of linear spectral statistics of high-dimensional sample covariance matrices with applications to hypothesis testing. Annals of Statistics 43(2), 546–591.
  41. Zheng, S., G. Cheng, J. Guo, and H. Zhu (2019). Test for high dimensional correlation matrices. Annals of Statistics 47(5), 2887–2921.
  42. Zhou, W. (2007). Asymptotic distribution of the largest off-diagonal entry of correlation matrices. Transactions of the American Mathematical Society 359(11), 5345–5363. Hantao Chen

Acknowledgments

The authors thank the Editor, the Associate Editor, and anonymous reviewers for their

insightful comments on earlier versions of this paper. Cheng Wang’s research is partially

supported by NSFC 12031005, NSFC 72495121 and the fundamental research funds for

the central universities.

Supplementary Materials

The online Supplementary Material includes the detailed proofs of the main theorems and

additional lemmas.


Supplementary materials are available for download.