Abstract

We propose a novel functional linear model incorporating latent factors, where scalar

response, scalar covariates, and functional covariates have repeated measurements for each

subject. Our model accounts for latent factors that may impact the response but remain unobservable. To unveil and estimate these latent factors, we propose an iterated profile estimation

method. We then establish the consistency and asymptotic properties of the estimators. To

demonstrate the efficacy of our proposed estimation procedure, we conduct simulation studies

across various scenarios. We compare our results with estimations derived from conventional

functional linear models, revealing the superior performance of our method in addressing latent

factors. We further illustrate our proposed model and methodology by analyzing real data from

both financial markets and air pollution datasets. In these analyses, we successfully uncover

hidden factors that exert influence in these specific fields.

Information

Preprint No.SS-2024-0028
Manuscript IDSS-2024-0028
Complete AuthorsZixuan Han, Tao Li, Jinhong You, Jiguo Cao
Corresponding AuthorsJiguo Cao
EmailsJiguo_cao@sfu.ca

References

  1. Bai, J. (2009). Panel data models with interactive fixed effects. Econometrica, 77:1229–1279.
  2. Bai, J. and Li, K. (2014). Theory and methods of panel data models with interactive fixed effects. The Annals of Statistics, 42(1):142–170.
  3. Bai, J. and Ng, S. (2002). Determining the number of factors in approximate factor models. Econometrica, 70:191–221.
  4. Chen, C., Guo, S., and Qiao, X. (2022). Functional linear regression: dependence and error contamination. Journal of Business and Economic Statistics, 40(1):444–457.
  5. Chen, J., Li, D., Wei, L., and Zhang, W. (2021). Nonparametric homogeneity pursuit in functional-coefficient models. Journal of Nonparametric Statistics, 33(3-4):387–416.
  6. Crambes, C., Kneip, A., and Sarda, P. (2009). Smoothing splines estimators for functional linear regression. The Annals of Statistics, 37(1):35–72.
  7. Fan, J., Li, K., and Liao, Y. (2021). Recent developments on factor models and its applications in econometric learning. Annual Review of Financial Economics, 13:401–430.
  8. Fan, J., Liao, Y., and Mincheva, M. (2011). High dimensional covariance matrix estimation in approximate factor models. The Annals of Statistics, 39:3320–3356.
  9. Fang, K., Zhang, X., Ma, S., and Zhang, Q. (2020). Smooth and locally sparse estimation for multiple-output functional linear regression. Journal of Statistical Computation and Simulation, 90(2):341–354.
  10. Feng, S., Li, G., Peng, H., and Tong, T. (2018). Varying coefficient panel data model with interactive fixed effects. arXiv preprint, page arXiv:1803.02714.
  11. Ferraty, F. and Vieu, P. (2006). Nonparametric functional data analysis.
  12. Springer, New York.
  13. Gao, Y., Shang, H., and Yang, Y. (2024). Factor-augmented model for
  14. functional data. Statistica Sinica, 34 (2024):1–26.
  15. Guan, T., Lin, Z., and Cao, J. (2020). Estimating truncated functional linear models with a nested group bridge approach. Journal of Computational and Graphical Statistics, 29(3):620–628.
  16. Hilgert, N., Mas, A., and Verzelen, N. (2013). Minimax adaptive tests for the functional linear model. The Annals of Statistics, 41(2):838–869.
  17. Huang, J. Z., Wu, C. O., and Zhou, L. (2004). Polynomial spline estimation and inference for varying coefficient models with longitudinal data. Statistica Sinica, 14:763–788.
  18. Jiang, C. and Wang, J. (2011). Functional single index models for longitudinal data. The Annals of Statistics, 39(1):362–388.
  19. Li, J., Huang, C., and Zhu, H. (2017). A functional varying-coefficient single-index model for functional response data. Journal of the American Statistical Association, 112(519):1169–1181.
  20. Lin, Z., Cao, J., Wang, L., and Wang, H. (2017). Locally sparse estimator for functional regression models. Journal of Computational and Graphical Statistics, 26(2):306–318.
  21. Liu, B., Wang, L., and Cao, J. (2017). Estimating functional linear mixedeffects regression models. Computational Statistics and Data Analysis, 106:153–164.
  22. Liu, R., Wang, H., and Wang, S. (2018). Functional variable selection via Gram-Schmidt orthogonalization for multiple linear regression. Journal of Statistical Computation and Simulation, 88(18):3664–3680.
  23. Liu, X., Ma, S., and Chen, K. (2022). Multivariate functional regression via nested reduced-rank regularization. Journal of Computational and Graphic Statistics, 31:231–240.
  24. Morris, J. (2015). Funcitional regression. Annual Review of Statistics and Its Application, 2:321–359.
  25. Peng, X. and Wang, H. (2022). A generalized quantile tree method for subgroup identification. Journal of Computational and Graphical Statistics, 31(3):824–834.
  26. Pesaran, M. (2006). Estimation and inference in large heterogeneous panels with a multifactor error structure. Econometrica, 74:967–1012.
  27. Ramsay, J. (1982). When the data are functions. Psychometrika, 47(4):379– 396.
  28. Ramsay, J. and Silverman, B. (2005). Functional data analysis. Springer Series in Statistics, New York: Springer, 2nd ed.
  29. Wahba, G. (1990). Spline models for observational data. SIAM, CBMS-NSF Regional Conference Series in Applied Mathematics, Philadelphia.
  30. Wu, Y., Fan, J., and M¨uller, H. (2010). Varying-coefficient functional linear regression. Bernoulli, 16:730–758.
  31. Yuan, M. and Cai, T. (2010). A reproducing kernel Hilbert space approach to functional linear regression. The Annals of Statistics, 38(6):3412–3444.
  32. Zhang, Y., Wang, H., and Zhu, Z. (2019). Quantile-regression-based clustering for panel data. Journal of Econometrics, 213(1):54–67.
  33. Zheng, Y., Zhu, Q., Li, G., and Xiao, Z. (2018). Hybrid quantile regression estimation for time series models with conditional heteroscedasticity. Journal of the Royal Statistical Society Series B: Statistical Methodology, 80(5):975–993.
  34. Zhu, H., Chen, K., Luo, X., Yuan, Y., and Wang, J. (2019). Fmem: Functional mixed effects models for longitudinal functional responses. Statistica Sinica, 29(4):2007–2033.

Acknowledgments

The authors would like to thank the editor, the associate editor, and two referees for many insightful comments. These comments are very helpful for us

to improve our work. T. Li’s research was supported by the Humanities and

Social Science Fund of Ministry of Education of China (21YJA910001). J.

You’s research was supported by the National Natural Science Foundation

of China (Grant No.11971291) and Innovative Research Team of Shanghai

University of Finance and Economics. J. Cao’s research is supported by a

Discovery grant ( RGPIN-2023-04057) from the Natural Sciences and Engineering Research Council of Canada (NSERC) and the Canada Research

Chairs Program.

Supplementary Materials

The supplementary document includes the additional numerical results and

detailed proofs for theoretical results. We also provide the R codes for the

simulation studies and real data analysis on the website


Supplementary materials are available for download.