Abstract
Matrix-variate time series data are increasingly popular in economics, statistics, and environmental
studies, among other fields. The bilinear autoregressive structure is a popular modeling approach for such data,
as it reduces model complexity while capturing dynamic interactions between rows and columns. However, in
high-dimensional settings, the conventional iterated least-squares method requires estimating a large number of
parameters, which hampers interpretability and scalability. To address this challenge, we propose regularized
estimation procedures designed for settings in which the autoregressive coefficient matrices exhibit banded
or sparse structures. Specifically, we introduce a Bayesian Information Criterion (BIC)-based approach to
estimate the bandwidth in the banded case, and employ the LASSO technique for enforcing sparsity in the
coefficient matrices. We derive asymptotic properties for both methods as the dimensions diverge and the
sample size T →∞. Simulations and real data examples demonstrate the effectiveness of our methods,
comparing their forecasting performance against common autoregressive models in the literature.
Information
| Preprint No. | SS-2024-0341 |
|---|---|
| Manuscript ID | SS-2024-0341 |
| Complete Authors | Hangjin Jiang, Baining Shen, Yuzhou Li, Zhaoxing Gao |
| Corresponding Authors | Zhaoxing Gao |
| Emails | mazxgao@gmail.com |
References
- Ahn, S. C., and, Horenstein, A. R. (2013). Eigenvalue ratio test for the number of factors. Econometrica, 81(3), 1203–1227.
- Andrews, D. W. (1991). Heteroskedasticity and autocorrelation consistent covariance matrix estimation. Econometrica, 59, 817–858.
- Bai J. (2003) Inferential theory for factor models of large dimensions. Econometrica, 71(1), 135–171.
- Bai, J. and Ng, S. (2002). Determining the number of factors in approximate factor models. Econometrica, 70, 191–221.
- Black, F. (1986). Noise. The Journal of Finance, 41(3), 528–543.
- Box, G. E. P. and Tiao, G. C. (1977). A canonical analysis of multiple time series. Biometrika, 64, 355–365.
- B¨uhlmann, P. and Van De Geer, S. (2011). Statistics for high-dimensional data: methods, theory and applications. Springer Science and Business Media.
- Chang, J., Yao, Q. and Zhou, W. (2017). Testing for high-dimensional white noise using maximum cross-correlations. Biometrika, 104(1), 111–127.
- Chen, E.Y., Tsay, R.S., and Chen, R. (2020). Constrained factor models for high-dimensional matrix-variate time series. Journal of the American Statistical Association, 115(530), 775– 793. Regularized Estimation of MAR Models
- Chen, R., Xiao, H., and Yang, D. (2021). Autoregressive models for matrix-valued time series. Journal of Econometrics, 222(1), 539–560.
- Cohen, J. (1960) A coefficient of agreement for nominal scales. Educational and Psychological Measurement, 20, 37-46.
- Ding, S. and Cook, R. D. (2018). Matrix variate regressions and envelope models. Journal of the Royal Statistical Society: Series B, 80(2), 387–408.
- Fama, E. F. and French, K. R. (2015). A five-factor asset pricing model. Journal of Financial Economics, 116(1), 1–22.
- Fan, J., Liao, Y., and Mincheva, M. (2013). Large covariance estimation by thresholding principal orthogonal complements (with discussion). Journal of the Royal Statistical Society, Series B, 75(4), 603–680.
- Forni, M., Hallin, M., Lippi, M. and Reichlin, L. (2000). Reference cycles: the NBER methodology revisited (No. 2400). Centre for Economic Policy Research. url: https://ideas.repec.org/p/cpr/ceprdp/2400.html.
- Forni, M., Hallin, M., Lippi, M. and Reichlin, L. (2005). The generalized dynamic factor model: one-sided estimation and forecasting. Journal of the American Statistical Association, 100(471), 830–840.
- Gao, Z. (2020). Segmenting high-dimensional matrix-valued time series via sequential transformations. arXiv:2002.03382.
- Gao, Z., Ma, Y., Wang, H. and Yao, Q. (2019). Banded spatio-temporal autoregressions. Journal of Econometrics, 208(1), 211–230. Regularized Estimation of MAR Models
- Gao, Z. and Tsay, R. S. (2019). A structural-factor approach for modeling high-dimensional time series and space-time data. Journal of Time Series Analysis, 40, 343–362.
- Gao, Z. and Tsay, R. S. (2021). Modeling high-dimensional unit-root time series. International Journal of Forecasting, 37(4), 1535–1555.
- Gao, Z. and Tsay, R. S. (2022). Modeling high-dimensional time series: a factor model with dynamically dependent factors and diverging eigenvalues. Journal of the American Statistical Association, 117(539), 1398-1414.
- Gao, Z. and Tsay, R. S. (2023a). A two-way transformed factor model for matrix-variate time series. Econometrics and Statistics, 27, 83–101.
- Gao, Z. and Tsay, R. S. (2023b). Divide-and-conquer: a distributed hierarchical factor approach to modeling large-scale time series data. Journal of the American Statistical Association, 118(544), 2698–2711.
- Guo, S., Wang, Y. and Yao, Q. (2016). High dimensional and banded vector autoregression. Biometrika, 103(4), 889–903.
- Han, Y., Chen, R., Yang, D., and Zhang, C. H. (2024). Tensor factor model estimation by iterative projection. The Annals of Statistics, 52(6), 2641–2667.
- Han, Y., Yang, D., Zhang, C. H., and Chen, R. (2024). CP factor model for dynamic tensors. Journal of the Royal Statistical Society Series B: Statistical Methodology, 86(5), 1383–1413.
- Hauglustaine, D. A., and Ehhalt, D. H. (2002). A three-dimensional model of molecular hydrogen in the troposphere. Journal of Geophysical Research: Atmospheres, 107(D17), ACH-4.
- Hsu, N. J., Huang, H. C., and Tsay, R. S. (2021). Matrix autoregressive spatio-temporal modRegularized Estimation of MAR Models els. Journal of Computational and Graphical Statistics, 30(4), 1143–1155.
- Hsu, N. J., Chang, Y. M., and Huang, H. C. (2012). A group lasso approach for non-stationary spatial–temporal covariance estimation. Environmetrics, 23, 12-23.
- Hosking, J. R. (1980). The multivariate portmanteau statistic. Journal of the American Statistical Association, 75(371), 602–608.
- Hung, H., Wu, P., Tu, I., and Huang, S. (2012). On multilinear principal component analysis of order-two tensors. Biometrika, 99(3), 569–583.
- Lam, C. and Yao, Q. (2012). Factor modeling for high-dimensional time series: inference for the number of factors. The Annals of Statistics, 40(2), 694–726.
- Lam, C., Yao, Q. and Bathia, N. (2011). Estimation of latent factors for high-dimensional time series. Biometrika, 98, 901–918.
- Li Z. and Xiao H. (2021). Multi-linear tensor autoregressive models. arXiv:2110.00928.
- Lozano, A. C., Li, H., Niculescu-Mizil, A., Liu, Y., Perlich, C., Hosking, J., and Abe, N.
- (2009). Spatial-temporal causal modeling for climate change attribution. In Proceedings of the 15th ACM SIGKDD international conference on Knowledge discovery and data mining, 587–596.
- Meinshausen N. and Buhlmann P. (2010). Stability selection. Journal of the Royal Statistical
- Society, Series B, 72, 414-473.
- Pan, J. and Yao, Q. (2008). Modelling multiple time series via common factors. Biometrika, 95(2), 365–379.
- Sharpe, W. F. (1964). Capital asset prices: A theory of market equilibrium under conditions of Regularized Estimation of MAR Models risk. The Journal of Finance, 19(3), 425–442.
- Shen, D., Shen, H. and Marron, J. S. (2016). A general framework for consistency of principal component analysis. Journal of Machine Learning Research, 17(150), 1–34.
- Stewart, G. W., and Sun, J. (1990). Matrix Perturbation Theory. Academic Press.
- Stock, J. H. and Watson, M. W. (2002). Forecasting using principal components from a large number of predictors. Journal of the American Statistical Association, 97, 1167–1179.
- Stock, J. H. and Watson, M. W. (2005). Implications of dynamic factor models for VAR analysis. NBER Working Paper 11467. doi: 10.3386/w11467
- Sun, W., Wang, J. H., and Fang, Y. X. (2013). Consistent Selection of Tuning Parameters via Variable Selection Stability. Journal of Machine Learning Research, 71(14), 3419-3440.
- Tiao, G. C. and Tsay, R. S. (1989). Model specification in multivariate time series (with discussion). Journal of the Royal Statistical Society, B51, 157–213.
- Tsay, R. S. (2014). Multivariate Time Series Analysis. Wiley, Hoboken, NJ.
- Tsay, R. S. (2020). Testing for serial correlations in high-dimensional time series via extreme value theory. Journal of Econometrics, 216, 106–117.
- Wainwright, M. J. (2019). High-Dimensional Statistics: A Non-Asymptotic Viewpoint. Cambridge University Press.
- Walden, A. and Serroukh, A. (2002). Wavelet analysis of matrix-valued time series. Proceedings: Mathematical, Physical and Engineering Sciences, 458(2017), 157–179.
- Wang, D., Liu, X. and Chen, R. (2019). Factor models for matrix-valued high-dimensional time series. Journal of Econometrics, 208(1), 231–248. Regularized Estimation of MAR Models
- Wang, D., Zheng, Y., and Li, G. (2024). High-dimensional low-rank tensor autoregressive time series modeling. Journal of Econometrics, 238(1), 105544.
- Wang, D., Zheng, Y., Lian, H., and Li, G. (2022). High-dimensional vector autoregressive time series modeling via tensor decomposition. Journal of the American Statistical Association, 117(539), 1338–1356.
- Werner, K., Jansson, M., and Stoica, P. (2008). On estimation of covariance matrices with Kronecker product structure. IEEE Transactions on Signal Processing, 56(2), 478–491.
- Xiao, H., Han, Y., Chen, R., and Liu, C. (2022). Reduced rank autoregressive models for matrix time series. Working paper, available at https://yuefenghan.github.io/ papers/Reduced_Rank_MAR.pdf.
- Yang, D, Ma, Z., and Buja, A. (2016). Rate optimal denoising of simultaneously sparse and low rank matrices. Journal of Machine Learning Research, 17(92), 1–27.
- Ye, J. (2005). Generalized low rank approximations of matrices. Machine Learning, 61(1-3), 167–191.
- Yu, L., He, Y., Kong, X., and Zhang, X. (2022). Projected estimation for large-dimensional matrix factor models. Journal of Econometrics, 229(1), 201–217.
- Yu, R., Chen, R., Xiao, H., and Han, Y. (2024). Dynamic matrix factor models for high dimensional time series. arXiv: 2407.05624.
Acknowledgments
We thank the Editor, Associate Editor, and anonymous referees for their constructive comments and valuable suggestions, which have greatly improved the presentation and quality of
this article. Z.G. acknowledges partial support from the National Natural Science Foundation
of China (NSFC) under Grant Nos. 12201558, 72573029, and U23A2064, and from the Tianfu
Emei Youth Talent Project of Sichuan Province. H.J. acknowledges partial support from the
High-level Talent Special Support Program of Zhejiang Province and the National Natural
Regularized Estimation of MAR Models
Science Foundation of China (No.12531013).
Supplementary Materials
The online Supplementary Material provides additional simulation results and proofs of the
theoretical results.