Abstract
We consider a large-scale network where a scalar response and a functional predictor are
observed for each individual. To incorporate the network information and to depict the dynamic impact
of the functional predictor on the response of each individual, we investigate a network functional linear
model. The model assumes that each individual’s response can be explained by a linear combination
of the responses of the neighbors and a functional regression of the individual. We first approximate
functional regression coefficient by a finite representation based on functional principal component
analysis technique (FPCA) and then introduce a novel least-squares type of procedure to estimate
the coefficients after dimension reduction. In addition, we introduce two modified BIC-type criteria
for choosing the number of principal components. We study the convergence rates of the functional
regression coefficients and establish the asymptotic normality of the network autoregression coefficients,
as well as the consistency of the model selection procedures. Extensive simulation studies are conducted
to evaluate the finite sample performance of our proposed method. Finally, we illustrate the usefulness
of our method by applying it to two applications.
Information
| Preprint No. | SS-2024-0282 |
|---|---|
| Manuscript ID | SS-2024-0282 |
| Complete Authors | Xingyu Yan, Yanyuan Ma |
| Corresponding Authors | Xingyu Yan |
| Emails | yan@jsnu.edu.cn |
References
- Barab´asi, A.-L. and Albert, R. (1999). Emergence of scaling in random networks. Science, 286(5439):509–512.
- Cai, T. T. and Yuan, M. (2012). Minimax and adaptive prediction for functional linear regression. Journal of the American Statistical Association, 107(499):1201–1216.
- Chen, C., Guo, S., and Qiao, X. (2022). Functional linear regression: dependence and error contamination. Journal of Business & Economic Statistics, 40(1):444–457.
- Clauset, A., Shalizi, C. R., and Newman, M. E. (2009). Power-law distributions in empirical data. SIAM Review, 51(4):661–703.
- Crambes, C., Kneip, A., and Sarda, P. (2009). Smoothing splines estimators for functional linear regression. The Annals of Statistics, 37(1):35–72.
- Fan, J. and Peng, H. (2004). Nonconcave penalized likelihood with diverging number of parameters. The Annals of Statistics, 32(3):928–961.
- Fan, Y., Liu, Y., and Zhu, L. (2021). Optimal subsampling for linear quantile regression models. Canadian Journal of Statistics, 49(4):1039–1057.
- Fan, Y. and Tang, C. Y. (2013). Tuning parameter selection in high dimensional penalized likelihood. Journal of the Royal Statistical Society Series B: Statistical Methodology, 75(3):531–552.
- Ferraty, F. and Vieu, P. (2006). Nonparametric Functional Data Analysis: Theory and Practice. New York: Springer.
- Hall, P. and Horowitz, J. L. (2007). Methodology and convergence rates for functional linear regression. The Annals of Statistics, 35(1):70–91.
- Holland, P. W. and Leinhardt, S. (1981). An exponential family of probability distributions for directed graphs. Journal of the American Statistical Association, 76(373):33–50.
- Hu, X. and Yao, F. (2022). Sparse functional principal component analysis in high dimensions. Statistica Sinica, 32(4):1939–1960.
- Huang, D., Lan, W., Zhang, H. H., and Wang, H. (2019). Least squares estimation of spatial autoregressive models for large-scale social networks. Electronic Journal of Statistics, 13(1):1935–7524.
- Huang, J., Ma, S., and Zhang, C.-H. (2008). Adaptive lasso for sparse high-dimensional regression models. Statistica Sinica, 18(4):1603–1618.
- Kato, K. (2012). Estimation in functional linear quantile regression. The Annals of Statistics, 40(6):3108–3136.
- Kokoszka, P. and Reimherr, M. (2017). Introduction to Functional Data Analysis. Boca Raton: CRC Press.
- Kong, D. H., Xue, K. J., Yao, F., and Zhang, H. H. (2016). Partially functional linear regression in high dimensions. Biometrika, 103(1):147–159.
- Le, C. M. and Li, T. (2022). Linear regression and its inference on noisy network-linked data. Journal of the Royal Statistical Society Series B: Statistical Methodology, 84(5):1851– cal framework for spatio-temporal pollution prediction and estimation of its long-term impact on health. Biostatistics, 18(2):370–385.
- Lee, L.-F., Liu, X., and Lin, X. (2010). Specification and estimation of social interaction models with network structures. The Econometrics Journal, 13(2):145–176.
- Lei, J. and Rinaldo, A. (2015). Consistency of spectral clustering in stochastic block models. The Annals of Statistics, 43(1):215–237.
- Li, Y. H., Wang, N., and Carroll, R. J. (2010). Generalized functional linear models with semiparametric single-index interactions. Journal of the American Statistical Association, 105(490):621–633.
- Nowicki, K. and Snijders, T. A. B. (2001). Estimation and prediction for stochastic blockstructures. Journal of the American Statistical Association, 96(455):1077–1087.
- Park, Y., Li, B., and Li, Y. (2023). Crop yield prediction using bayesian spatially varying coefficient models with functional predictors. Journal of the American Statistical Association, 118(541):70–83.
- Ren, Y., Li, Z., Zhu, X., Gao, Y., and Wang, H. (2024). Distributed estimation and inference for spatial autoregression model with large scale networks. Journal of Econometrics, 238(2):105629.
- Tho, Z. Y., Ding, D., Hui, F. K., Welsh, A., and Zou, T. (2023). On the robust estimation of spatial autoregressive models. Econometrics and Statistics, https://doi.org/10.1016/j.ecosta.2023.01.004.
- Wang, H., Li, B., and Leng, C. (2009). Shrinkage tuning parameter selection with a diverging number of parameters. Journal of the Royal Statistical Society Series B: Statistical Methodology, 71(3):671–683.
- Wang, X., Zhu, H., and Initiative, A. D. N. (2017). Generalized scalar-on-image regression models via total variation. Journal of the American Statistical Association, 112(519):1156–1168.
- Wang, Y. J. and Wong, G. Y. (1987). Stochastic blockmodels for directed graphs. Journal of the American Statistical Association, 82(397):8–19.
- Wong, R. K., Li, Y., and Zhu, Z. (2019). Partially linear functional additive models for multivariate functional data. Journal of the American Statistical Association, 114(525):406– 418.
- Xu, Y., Li, Y., and Nettleton, D. (2018). Nested hierarchical functional data modeling and inference for the analysis of functional plant phenotypes. Journal of the American Statistical Association, 113(522):593–606.
- Yao, F., M¨uller, H.-G., and Wang, J.-L. (2005). Functional linear regression analysis for longitudinal data. The Annals of Statistics, 33(6):2873–2903.
- Zhou, L., Huang, J. Z., and Carroll, R. J. (2008). Joint modelling of paired sparse functional data using principal components. Biometrika, 95(3):601–619.
- Zhu, L., Miao, B., and Peng, H. (2006). On sliced inverse regression with high-dimensional covariates. Journal of the American Statistical Association, 101(474):630–643.
- Zhu, X. N., Huang, D. Y., Pan, R., and Wang, H. S. (2020). Multivariate spatial autoregressive model for large scale social networks. Journal of Econometrics, 215(2):591–606.
- Zhu, X. N., Pan, R., Li, G. D., Liu, Y. W., and Wang, H. S. (2017). Network vector autoregression. The Annals of Statistics, 45(3):1096–1123.
Acknowledgments
We are grateful to the Editor, an anonymous Associate Editor and reviewers for their careful
reading and valuable suggestions, which have helped to improve the article. This research
was supported by the National Key R&D Program of China under Grant 2024YFA1012200,
the National Natural Science Foundation of China under Grant 12571285, and the National
Institute of Health.
Supplementary Materials
The Supplementary Material contains auxiliary lemmas, additional simulation results, theoretical analysis, and all proofs of Theorems 1-4.