Abstract
This work develops inference tools for the mean function of functional data over
a multi-dimensional domain.
A two-step mean estimator based on tensor product spline
estimates of individual trajectories is shown oracally efficient, i.e., it is asymptotically indistinguishable from the infeasible estimator using unobservable trajectories. Consistent esti-
mates of covariance function as well as exact quantile of the limiting maximal deviation are
obtained by innovative use of results on sharp comparison of Gaussian extreme distributions
and quantiles, leading to asymptotic coverage and order n−1/2 uniformly adaptive width of
data-driven simultaneous confidence regions (SCRs). Also formulated are one-sided SCRs
that can be used for testing against uniform upper and lower bound of the mean function.
Extensive Monte Carlo experiments corroborate the theory, and a satellite ocean dataset
collected by Copernicus Marine Environment Monitoring Service (CMEMS) illustrates how
the proposed SCR is used.
Information
| Preprint No. | SS-2024-0344 |
|---|---|
| Manuscript ID | SS-2024-0344 |
| Complete Authors | Qirui Hu, Lijian Yang |
| Corresponding Authors | Lijian Yang |
| Emails | yanglijian@tsinghua.edu.cn |
References
- Adler, R. J., and Taylor, J. E. (2007). Random Fields and Geometry. Springer, New York.
- Cao, G., Wang, L., Li, Y., and Yang, L. (2016). Oracle-efficient confidence envelopes for covariance functions in dense functional data. Statist. Sinica 26 359–383.
- Cao, G., Yang, L., and Todem, D. (2012). Simultaneous inference for the mean function based on dense functional data. J. Nonparametr. Stat. 24 359–377.
- Chernozhukov, V., Chetverikov, D., and Kato, K. (2015). Comparison and anticoncentration bounds for maxima of Gaussian random vectors. Probab. Theory Relat. Fields 162 47–70.
- Cs˝org˝o, M., and R´ev´esz, P. (1981). Strong Approximations in Probability and Statistics. Academic Press, New York-London.
- De Boor, C. (2001). A Practical Guide to Splines. Springer, New York.
- Degras, D. A. (2011). Simultaneous confidence bands for nonparametric regression with functional data. Statist. Sinica 21 1735–1765.
- Ferraty, F., and Vieu, P. (2006). Nonparametric Functional Data Analysis: Theory and Practice. Springer, New York.
- G¨otze, F., and Zaitsev, A. Y. (2010). Rates of approximation in the multidimensional invariance principle for sums of i.i.d. random vectors with finite moments. J. Math. Sci. 167 495–500.
- Gu, L., Wang, L., H¨ardle, W., and Yang, L. (2014). A simultaneous confidence corridor for varying coefficient regression with sparse functional data. TEST 23 806–843.
- Gu, L., and Yang, L. (2015). Oracally efficient estimation for single-index link function with simultaneous confidence band. Electron. J. Statist. 9 1540–1561.
- Hall, P., M¨uller, H. G., and Wang, J. (2006). Properties of principal component methods for functional and longitudinal data analysis. Ann. Statist. 34 1493–1517.
- Horv´ath, L., and Kokoszka, P. (2012). Inference for Functional Data with Applications. Springer, New York.
- Hsing, T., and Eubank, R. (2015). Theoretical Foundations of Functional Data Analysis, with an Introduction to Linear Operators. Wiley, Chichester.
- Hu, Q., and Li, J. (2024). Statistical inference for mean function of longitudinal imaging data over complicated domains. Statist. Sinica 34 955–982.
- Huang, K., Zheng, S., and Yang, L. (2022). Inference for dependent error functional data with application to event-related potentials. TEST 31 1100–1120.
- Klenke, A. (2014). Probability Theory, 2nd edn. Springer, London.
- Li, J., and Yang, L. (2023). Statistical inference for functional time series. Statist. Sinica 33 519–549.
- Ma, S., Yang, L., and Carroll, R. (2012). A simultaneous confidence band for sparse longitudinal regression. Statist. Sinica 22 95–122.
- Ramsay, J., and Dalzell, C. (1991). Some tools for functional data analysis. J. Roy. Statist. Soc. Ser. B 53 539–572.
- Schumaker, L. (1981). Spline Functions: Basic Theory. Wiley.
- Song, Q., and Yang, L. (2009). Spline confidence bands for variance functions. J. Nonparametr. Stat. 5 589–609.
- Tsirel’Son, V. S. (1976). The density of the distribution of the maximum of a Gaussian process. Theory Probab. Appl. 20 847–856.
- Wang, J., Cao, G., Wang, L., and Yang, L. (2020). Simultaneous confidence band for stationary covariance function of dense functional data. J. Multivariate Anal. 176 104584.
- Wang, Y., Wang, G., Wang, L., and Ogden, R. T. (2020). Simultaneous confidence corridors for mean functions in functional data analysis of imaging data. Biometrics 76 427–437.
- Wang, J., and Yang, L. (2009). Polynomial spline confidence bands for regression curves. Statist. Sinica 19 325–342.
- Yang, L. (2025a). Strict monotonicity of stochastic process extreme distributions. Statist. Probab. Lett. 217 110292.
- Yang, L. (2025b). Continuity of Gaussian extreme distributions. Statist. Probab. Lett.
- 215 110274.
- Yu, S., Wang, G., Wang, L., and Yang, L. (2021). Multivariate spline estimation and inference for image-on-scalar regression. Statist. Sinica 31 1463–1487.
- Zheng, S., Liu, R., Yang, L., and H¨ardle, W. (2016). Statistical inference for generalized additive models: Simultaneous confidence corridors and variable selection. TEST 25 607–626.
- Zheng, S., Yang, L., and H¨ardle, W. (2014). A smooth simultaneous confidence corridor for the mean of sparse functional data. J. Amer. Statist. Assoc. 109 661– 673.
- Zhong, C., and Yang, L. (2023). Statistical inference for functional time series: Autocovariance function. Statist. Sinica 33 2519–2543.
Acknowledgments
This work is partially supported by the National Natural Science Foundation of
China under award No. 12171269.
Supplementary Materials
This supplement provides tables and figures of in Section 4, additional simulations
and detailed proofs of the theoretical results with necessarily technical lemmas.