Abstract
In this study, we introduce an inferential procedure for assessing the covariance
difference between two-samples of large-scale functional data, utilizing a computationally efficient multiplier bootstrap approach. In contrast to the existing method that focuses
exclusively on a testing procedure, our approach starts by establishing a confidence region
for the covariance difference under fairly flexible conditions. This leads not only to a more
powerful test but also to an accessible estimated power function, which is shown consistent
across a broad range of alternatives. A notable characteristic of the new procedure is that it
requires less stringent distributional assumptions for large-scale functional data, and does
not impose any structural constraints on covariances or correlations. Moreover, the proposed procedure benefits from the desirable properties of being “eigenvalue-decay-free”
and “square-integrable-free”, tailored for functional data. We conduct a simulation study
and a real data application to assess the numerical performance of the proposed method.
Key words and phrases: Functional data; Ultra-high-dimensionality; Covariance function; Confidence region; Multiplier bootstrap
Information
| Preprint No. | SS-2025-0295 |
|---|---|
| Manuscript ID | SS-2025-0295 |
| Complete Authors | Kaijie Xue, Lan Xue, Riquan Zhang |
| Corresponding Authors | Riquan Zhang |
| Emails | zhangriquan@163.com |
References
- Cai, T., W. Liu, and Y. Xia (2013). Two-sample covariance matrix testing and support recovery in high-dimensional and sparse settings. Journal of the American Statistical Association 108(501), 265–277.
- Cai, T. T., W. Liu, and Y. Xia (2014). Two-sample test of high dimensional means under dependence. Journal of the Royal Statistical Society. Series B: Statistical Methodology 76(2), 349–372.
- Cardot, H., F. Ferraty, and P. Sarda (1999). Functional linear model. Statistics & Probability Letters 45(1), 11–22.
- Chang, J., W. Zhou, W.-X. Zhou, and L. Wang (2017). Comparing large covariance matrices under weak conditions on the dependence structure and its application to gene clustering. Biometrics 73(1), 31–41.
- Chang, J.-S., H.-Z. Huang, M. Yuan, Y. Zhou, D. Liu, K.-B. Zhan, and L.-Q.
- Zhu (2025). Alcohol addiction and alzheimers disease: a molecular collision course. Translational Psychiatry 15(1), 410.
- Chernozhukov, V., D. Chetverikov, and K. Kato (2013). Gaussian approximations and multiplier bootstrap for maxima of sums of high-dimensional random vectors. The Annals of Statistics 41(6), 2786–2819.
- Dai, X., H.-G. Muller, and F. Yao (2017). Optimal bayes classifiers for functional data and density ratios. Biometrika 104, 545–560.
- Delaigle, A. and P. Hall (2012). Achieving near perfect classification for functional data. Journal of the Royal Statistical Society. Series B 74(2), 267–286.
- Donoho, D. and J. Jin (2004). Higher criticism for detecting sparse heterogeneous mixtures. The Annals of Statistics 32(3), 962–994.
- Engels, M. M., C. J. Stam, W. M. Van Der Flier, P. Scheltens, H. de Waal, and
- E. C. van Straaten (2015). Declining functional connectivity and changing hub locations in alzheimers disease: an eeg study. BMC neurology 15(1), 145.
- Fan, Y., G. M. James, and P. Radchenko (2015). Functional additive regression. Annals of Statistics 43(5), 2296–2325.
- Fang, Q., Q. Jiang, and X. Qiao (2024). Large-scale multiple testing of crosscovariance functions with applications to functional network models. arXiv preprint arXiv:2407.19399.
- Ferraty, F. and P. Vieu (2006). Nonparametric Functional Data Analysis. New York: Springer, New York.
- Fremdt, S., J. G. Steinebach, L. Horvath, and P. Kokoszka (2013). Testing the equality of covariance operators in functional samples. Scandinavian Journal of Statistics 40(1), 138–152.
- Guo, J., B. Zhou, and J.-T. Zhang (2019). New tests for equality of several covariance functions for functional data. Journal of the American Statistical Association 114(527), 1251–1263.
- Hall, P. and J. L. Horowitz (2007). Methodology and convergence rates for functional linear regression. Annals of Statistics 35(1), 70–91.
- Hall, P. and M. Hosseini-Nasab (2006). On properties of functional principal components analysis. Journal of the Royal Statistical Society: Series B 68(1), 109–126.
- Han, Y. and W. B. Wu (2020). Test for high dimensional covariance matrices. The Annals of Statistics 48(6), 3565–3588.
- He, J. and S. Chen (2018). High-dimensional two-sample covariance matrix testing via super-diagonals. Statistica Sinica 28(4), 2671–2696.
- Kong, D., K. Xue, F. Yao, and H. H. Zhang (2016). Partially functional linear regression in high dimensions. Biometrika 103(1), 147–159.
- Maleki, N., K. S. Sawyer, S. Levy, G. J. Harris, and M. Oscar-Berman (2022). Intrinsic brain functional connectivity patterns in alcohol use disorder. Brain Communications 4(6), fcac290.
- Neeraj, V. Singhal, J. Mathew, and R. K. Behera (2021). Detection of alcoholism using eeg signals and a cnn-lstm-attn network. Computers in Biology and Medicine 138, 104940.
- Panaretos, V. M., D. Kraus, and J. H. Maddocks (2010). Second-order comparison of gaussian random functions and the geometry of dna minicircles. Journal of the American Statistical Association 105(490), 670–682.
- Paparoditis, E. and T. Sapatinas (2016). Bootstrap-based testing of equality of mean functions or equality of covariance operators for functional data. Biometrika 103(3), 727–733.
- Pigoli, D., J. A. D. Aston, I. L. Dryden, and P. Secchi (2014). Distances and inference for covariance operators. Biometrika 101(2), 409–422.
- Poldrack, R. A., J. A. Mumford, and T. E. Nichols (2024). Handbook of functional MRI data analysis. Cambridge University Press.
- Ramsay, J. O. and C. J. Dalzell (1991). Some tools for functional data analysis. Journal of the Royal Statistical Society: Series B (Statistical Methodology) 53(3), 539–572.
- Ramsay, J. O. and B. W. Silverman (2005). Functional Data Analysis (2nd ed.). New York: Springer.
- Rzepecki-Smith, C. I., S. A. Meda, V. D. Calhoun, M. C. Stevens, M. J. Jafri,
- R. S. Astur, and G. D. Pearlson (2010). Disruptions in functional network connectivity during alcohol intoxicated driving. Alcoholism: clinical and experimental research 34(3), 479–487.
- Schott, J. R. (2007). A test for the equality of covariance matrices when the dimension is large relative to the sample sizes. Computational Statistics and Data Analysis 51(12), 6535–6542.
- Srivastava, M. S. and H. Yanagihara (2010). Testing the equality of several covariance matrices with fewer observations than the dimension. Journal of Multivariate Analysis 101(6), 1319–1329.
- Wang, J.-L., J.-M. Chiou, and H.-G. M¨uller (2016). Functional data analysis. Annual Review of Statistics and its application 3(1), 257–295.
- Weiland, B. J., A. Sabbineni, V. D. Calhoun, R. C. Welsh, A. D. Bryan, R. E.
- Jung, A. R. Mayer, and K. E. Hutchison (2014). Reduced left executive control network functional connectivity is associated with alcohol use disorders. Alcoholism: Clinical and Experimental Research 38(9), 2445–2453.
- Wu, T.-L. and P. Li (2020). Projected tests for high-dimensional covariance matrices. Journal of Statistical Planning and Inference 207, 73–85.
- Xue, K. (2023). Distribution/correlation-free test for two-sample means in highdimensional functional data with eigenvalue decay relaxed. SCIENCE CHINA Mathematics 66, 2337–2346.
- Xue, K., J. Yang, and F. Yao (2024). Optimal linear discriminant analysis for high-dimensional functional data. Journal of the American Statistical Association 119(546), 1055–1064.
- Xue, K., J. Yang, and R. Zhang (2025). Testing the equality of covariances for large-scale functional data. SCIENCE CHINA Mathematics, DOI: 10.1007/s11425–023–2441–0.
- Xue, K. and F. Yao (2020). Distribution and correlation-free two-sample test of high-dimensional means. The Annals of Statistics 48(3), 1304–1328.
- Xue, K. and F. Yao (2021). Hypothesis testing in large-scale functional linear regression. Statistica Sinica 31, 1101–1123.
- Xue, K. and F. Yao (2024). Inference on large-scale partially functional linear model with heterogeneous errors. Statistica Sinica 34, 679–697.
- Yao, F., H.-G. M¨uller, and J.-L. Wang (2005). Functional linear regression analysis for longitudinal data. The Annals of Statistics 33(6), 2873–2903.
- Zou, T., R. Lin, S. rong Zheng, and G.-L. Tian (2021). Two-sample tests for high-dimensional covariance matrices using both difference and ratio. Electronic Journal of Statistics 15, 135–210.
Acknowledgments
Kaijie Xue and Lan Xue are the co-first authors. Riquan Zhang is the corical order, and all authors have made equal contributions. This research was
supported by the National Natural Science Foundation of China (12371268,
12531013, 12371272), the Youth Project of Shanghai Eastern Talent Program
(QNJY2024152), the Basic Research Project of Shanghai Science and Technology Commission (22JC1400800), and awards from the National Institutes of
Diabetes, Digestive, and Kidney Disease Award numbers R01DK132385.
Supplementary Materials
The auxiliary lemmas with their proofs, along with the proofs of the main
theorems, are relegated to an online Supplementary Material to save space.