Abstract
In this paper, we propose a robust nonparametric method for learning and inferring from noisy
imaging data by modeling it as contaminated functional data, which enables accurate estimation of the
underlying signals and efficient detection and localization of significant effects. The proposed robust and
smoothed M-estimator is based on bivariate penalized splines over triangulation, effectively addressing
challenges posed by contaminated imaging data, spatial dependencies, and irregular domains commonly
encountered in imaging applications. We establish the L2 convergence of the proposed M-based mean
function estimator under certain regularity conditions and investigate its asymptotic normality. Additionally,
we present a novel approach for constructing a simultaneous confidence corridor for the mean signal of a
set of noisy imaging data. Extensive simulation studies and a real-data application with brain imaging data
illustrate the effectiveness of the proposed robust methods.
Information
| Preprint No. | SS-2024-0402 |
|---|---|
| Manuscript ID | SS-2024-0402 |
| Complete Authors | Yang Long, Guanqun Cao, David Kepplinger, Lily Wang |
| Corresponding Authors | Lily Wang |
| Emails | lwang41@gmu.edu |
References
- Bali, J. L., Boente, G., Tyler, D. E., and Wang, J.-L. (2011), “Robust functional principal components: a projection-pursuit approach,” The Annals of Statistics, 39, 2852–2882.
- Belloni, A., Chernozhukov, V., Chetverikov, D., and Fernández-Val, I. (2019), “Conditional quantile processes based on series or many regressors,” Journal of Econometrics, 213, 4–29, annals: In Honor of Roger Koenker.
- Boente, G. and Salibian-Barrera, M. (2015), “S-Estimators for functional principal component analysis,” Journal of the American Statistical Association, 110, 1100–1111.
- Chang, C., Lin, X., and Ogden, R. T. (2017), “Simultaneous confidence bands for functional regression models,” Journal of Statistical Planning and Inference, 188, 67–81.
- Chen, X. and Zhou, W.-X. (2020), “Robust inference via multiplier bootstrap,” The Annals of Statistics, 48, 1665–1691.
- Choi, H. and Reimherr, M. (2017), “A geometric approach to confidence regions and bands for functional parameters,” Journal of the Royal Statistical Society Series B: Statistical Methodology, 80, 239–260.
- Degras, D. (2017), “Simultaneous confidence bands for the mean of functional data,” Wiley Interdisciplinary Reviews: Computational Statistics, 9, e1397.
- Eddelbuettel, D. and Sanderson, C. (2014), “RcppArmadillo: Accelerating R with highperformance C++ linear algebra,” Computational Statistics & Data Analysis, 71, 1054– 1063.
- Gervini, D. (2008), “Robust functional estimation using the median and spherical principal components,” Biometrika, 95, 587–600.
- Gu, L., Wang, L., Härdle, W. K., and Yang, L. (2014), “A simultaneous confidence corridor for varying coefficient regression with sparse functional data,” TEST, 23, 806–843.
- Gu, Z., Yu, S., Wang, G., Lai, M.-J., and and, L. W. (2025), “TSSS: a novel triangulated spherical spline smoothing for surface-based data,” Journal of Nonparametric Statistics, 37, 683–712.
- Huang, C. and Zhu, H. (2022), “Functional hybrid factor regression model for handling heterogeneity in imaging studies,” Biometrika, 109, 1133–1148.
- Huber, P. J. (1964), “Robust estimation of a location parameter,” The Annals of Mathematical Statistics, 35, 73–101.
- Huber, P. J. (1973), “Robust regression: asymptotics, conjectures and Monte Carlo,” The Annals of Statistics, 1, 799–821.
- Kalogridis, I. and Van Aelst, S. (2023a), “Robust optimal estimation of location from discretely sampled functional data,” Scandinavian Journal of Statistics, 50, 411–451.
- Kalogridis, I. and Van Aelst, S. (2023b), “Robust penalized estimators for functional linear regression,” Journal of Multivariate Analysis, 194, 105104.
- Lai, M.-J. and Schumaker, L. L. (2007), Spline Functions on Triangulations, Encyclopedia of Mathematics and Its Applications, Cambridge: Cambridge University Press.
- Lai, M.-J. and Wang, L. (2013), “Bivariate penalized splines for regression,” Statistica Sinica, 23, 1399–1417.
- Lee, S., Shin, H., and Billor, N. (2013), “M-type smoothing spline estimators for principal functions,” Computational Statistics & Data Analysis, 66, 89–100.
- Li, J. and Yang, L. (2023), “Statistical inference for functional time series,” Statistica Sinica, 33, 519–549.
- Li, X., Yu, S., Wang, Y., Wang, G., Wang, L., and Lai, M.-J. (2024), “Nonparametric regression for 3D point cloud learning,” Journal of Machine Learning Research, 25, 1–56.
- Lima, I. R., Cao, G., and Billor, N. (2019a), “M-based simultaneous inference for the mean function of functional data,” Annals of the Institute of Statistical Mathematics, 71, 577–598.
- Lima, I. R., Cao, G., and Billor, N. (2019b), “Robust simultaneous inference for the mean function of functional data,” TEST, 28, 785–803.
- Liu, Y., Li, M., and Morris, J. S. (2023), Scalable function-on-scalar quantile regression for densely sampled functional data, available at: https://arxiv.org/abs/2002. 03355.
- Maronna, R. A., Martin, R. D., Yohai, V. J., and Salibián-Barrera, M. (2019), Robust Statistics: Theory and Methods (with R), Wiley Series in Probability and Statistics, Hoboken, NJ: John Wiley & Sons, Ltd, 2nd ed.
- Sanderson, C. and Curtin, R. (2016), “Armadillo: a template-based C++ library for linear algebra,” Journal of Open Source Software, 1, 26.
- Shewchuk, J. R. (1994), “An introduction to the conjugate gradient method without the agonizing pain,” Tech. rep., Carnegie Mellon University, USA, available at: https://www. cs.cmu.edu/~quake-papers/painless-conjugate-gradient.pdf.
- Shieh, D. and Ogden, R. T. (2023), “Permutation-based inference for function-on-scalar regression with an application in PET brain imaging,” Journal of Nonparametric Statistics, 35, 820–838.
- Shin, H. and Lee, S. (2016), “An RKHS approach to robust functional linear regression,” Statistica Sinica, 26, 255–272.
- Stone, C. J. (1985), “Additive regression and other nonparametric models,” The Annals of Statistics, 13, 689–705.
- Sun, Q., Zhou, W.-X., and Fan, J. (2020), “Adaptive Huber regression,” Journal of the American Statistical Association, 115, 254–265.
- Tang, Q. and Cheng, L. (2008), “M-estimation and B-spline approximation for varying coefficient models with longitudinal data,” Journal of Nonparametric Statistics, 20, 611– 625.
- Wang, S. and Cao, G. (2022), “Robust deep neural network estimation for multi-dimensional functional data,” Electronic Journal of Statistics, 16, 6461–6488.
- Wang, Y., Wang, G., Klinedinst, B., Willette, A., and Wang, L. (2025), “Statistical inference for mean functions of complex 3D objects,” Statistica Sinica, 35, 1–27.
- Wang, Y., Wang, G., Wang, L., and Ogden, R. T. (2020), “Simultaneous confidence corridors for mean functions in functional data analysis of imaging data,” Biometrics, 76, 427–437.
- Wei, Y. and He, X. (2006), “Conditional growth charts,” The Annals of Statistics, 34, 2069– 2097.
- Wood, S. (2017), Generalized Additive Models: An Introduction with R, Chapman and Hall/CRC, 2nd ed.
- Wood, S. N. (2003), “Thin plate regression splines,” Journal of the Royal Statistical Society Series B: Statistical Methodology, 65, 95–114.
- Wood, S. N., Bravington, M. V., and Hedley, S. L. (2008), “Soap film smoothing,” Journal of the Royal Statistical Society Series B: Statistical Methodology, 70, 931–955.
- Yu, S., Wang, G., and Wang, L. (2025), “Distributed heterogeneity learning for generalized partially linear models with spatially varying coefficients,” Journal of the American Statistical Association, 120, 779–793.
- Yu, S., Wang, G., Wang, L., Liu, C., and Yang, L. (2020), “Estimation and inference for generalized geoadditive models,” Journal of the American Statistical Association, 115, 761–774.
- Yu, S., Wang, G., Wang, L., and Yang, L. (2021), “Multivariate spline estimation and inference for image-on-scalar regression,” Statistica Sinica, 31, 1463–1487.
- Zhong, C. and Yang, L. (2023), “Statistical inference for functional time series: autocovariance function,” Statistica Sinica, 33, 2519–2543.
- Zhou, X., Kong, D., Kashlak, A., Kong, L., Karunamuni, R., and Zhu, H. (2023), “Functional response quantile regression model,” Statistica Sinica, 33, 2643–2667.
- Zhu, H., Fan, J., and Kong, L. (2014), “Spatially varying coefficient model for neuroimaging data With jump discontinuities,” Journal of the American Statistical Association, 109, 1084–1098.
Acknowledgments
Data used in the preparation of this article were obtained from the Alzheimer’s Disease
Neuroimaging Initiative (ADNI) database (http://adni.loni.usc.edu). As such, the
investigators within the ADNI contributed to the design and implementation of ADNI and/or
provided data but did not participate in analysis or writing of this report. A complete listing of
ADNI investigators can be found at: http://adni.loni.usc.edu/wp-content/
uploads/how_to_apply/ADNI_Acknowledgement_List.pdf.
The research work of Yang Long and Lily Wang is partially supported by the National
Institutes of Health award 1R01AG085616 and the National Science Foundation grant DMS-
- Guanqun Cao’s research is partially supported by the National Science Foundation
under Grants DMS-2413301, CNS-2319342 and CNS-2319343.
This project was further supported by resources provided by the Office of Research
Computing at George Mason University (URL: https://orc.gmu.edu) and funded in
part by grants from the National Science Foundation (Award Number 2018631).
Supplementary Materials
In the Supplementary Material, we provide additional simulation studies and detailed proofs
of the theoretical results in this paper.