Fisher's Combined Probability Test for Cross-Sectional Independence in Panel Data Models with Serial Correlation

Hongfei Wang, Binghui Liu, Long Feng and Yanyuan Ma

doi:10.5705/ss.202023.0348

Abstract

Testing cross-sectional independence in panel data models is of fundamental importance in econometric analysis with high-dimensional panels. Recently, econo-

metricians began to turn their attention to the problem in the presence of serial

dependence. The existing procedure for testing cross-sectional independence with

serial correlation is based on the sum of the sample cross-sectional correlations,

which generally performs well when the alternative has dense cross-sectional correlations, but suffers from low power against sparse alternatives. To deal with

sparse alternatives, we propose a test based on the maximum of the squared

sample cross-sectional correlations.

Furthermore, we propose a combined test

to combine the p-values of the max based and sum based tests, which performs

well under both dense and sparse alternatives. The combined test relies on the

asymptotic independence of the max based and sum based test statistics, which

we show rigorously. We show that the proposed max based and combined tests

have attractive theoretical properties and demonstrate the superior performance

via extensive simulation results.

We demonstrate the practicality of the proposed tests through two empirical applications.

Key words and phrases: Asymptotic independence, Cross-sectional dependence, Heterogeneous panel data models, High dimensionality 1

Information

Preprint No.	SS-2023-0348
Manuscript ID	SS-2023-0348
Complete Authors	Hongfei Wang, Binghui Liu, Long Feng, Yanyuan Ma
Corresponding Authors	Long Feng
Emails	flnankai@nankai.edu.cn

References

Anselin, L. and A. K. Bera (1998). Spatial dependence in linear regression models with an introduction to spatial econometrics. Statistics textbooks and monographs 155, 237–290.
Bai, J. (2009). Panel data models with interactive fixed effects. Econometrica 77(4), 1229–1279.
Baltagi, B. H., Q. Feng, and C. Kao (2012). A lagrange multiplier test for cross-sectional dependence in a fixed effects panel data model. Journal of Econometrics 170(1), 164–177.
Baltagi, B. H., Q. Feng, and C. Kao (2016). Estimation of heterogeneous panels with structural breaks. Journal of Econometrics 191(1), 176–195.
Baltagi, B. H., C. Kao, and B. Peng (2016). Testing cross-sectional correlation in large panel data models with serial correlation. Econometrics 4(4), 1–24.
Box, G. E., G. M. Jenkins, G. C. Reinsel, and G. M. Ljung (2015). Time series analysis: forecasting and control. John Wiley & Sons.
Breusch, T. S. and A. R. Pagan (1980). The lagrange multiplier test and its applications to model specification in econometrics. The review of economic studies 47(1), 239–253.
Cai, T. T., Z. Ren, and H. H. Zhou (2016). Estimating structured high-dimensional covariance and precision matrices: optimal rates and adaptive estimation. Electronic Journal of Statistics 10(1), 1–59.
Chen, X. and W. Liu (2018). Testing independence with high-dimensional correlated samples. The Annals of Statistics 46(2), 866–894.
Feng, L., Y. Ding, and B. Liu (2020). Rank-based tests for cross-sectional dependence in large (n, t) fixed effects panel data models. Oxford Bulletin of Economics and Statistics 82(5), 1198–1216.
Feng, L., T. Jiang, B. Liu, and W. Xiong (2022). Max-sum tests for cross-sectional dependence of high-demensional panel data. The Annals of Statistics 50(2), 1124–1143.
Gao, J., X. Han, G. Pan, and Y. Yang (2017). High dimensional correlation matrices: The central limit theorem and its applications. Journal of the Royal Statistical Society: Series B (Statistical Methodology) 79(3), 677–693.
Gao, J., F. Liu, B. Peng, and Y. Yan (2023). Binary response models for heterogeneous panel data with interactive fixed effects. Journal of Econometrics 235(2), 1654–1679.
Hong, Y. (2010). Serial correlation and serial dependence. In Macroeconometrics and Time Series Analysis, pp. 227–244. Springer.
Kapoor, M., H. H. Kelejian, and I. R. Prucha (2007). Panel data models with spatially correlated error components. Journal of econometrics 140(1), 97–130.
Kelejian, H. H. and I. R. Prucha (1999). A generalized moments estimator for the autoregressive parameter in a spatial model. International economic review 40(2), 509–533.
Lan, W., R. Pan, R. Luo, and Y. Cheng (2017). High dimensional cross-sectional dependence test under arbitrary serial correlation. Science China Mathematics 60(2), 345–360.
Lee, L.-f. (2007). Gmm and 2sls estimation of mixed regressive, spatial autoregressive models. Journal of Econometrics 137(2), 489–514.
Lee, L.-f. and J. Yu (2010). Estimation of spatial autoregressive panel data models with fixed effects. Journal of econometrics 154(2), 165–185.
Li, D. and L. Xue (2015). Joint limiting laws for high-dimensional independence tests. arXiv:1512.08819.
Liu, W.-D., Z. Lin, and Q.-M. Shao (2008). The asymptotic distribution and berry–esseen bound of a new test for independence in high dimension with an application to stochastic optimization. The Annals of Applied Probability 18(6), 2337–2366.
Pesaran, M. H. (2004). General diagnostic tests for cross section dependence in panels (iza discussion paper no. 1240). Institute for the Study of Labor (IZA).
Pesaran, M. H. (2015). Testing weak cross-sectional dependence in large panels. Econometric reviews 34(6-10), 1089–1117.
Pesaran, M. H., A. Ullah, and T. Yamagata (2008). A bias-adjusted lm test of error cross-section independence. The Econometrics Journal 11(1), 105–127.
Rudelson, M. and R. Vershynin (2013). Hanson-wright inequality and sub-gaussian concentration. Electronic Communications in Probability 18, 1–9.
Serlenga, L. and Y. Shin (2007). Gravity models of intra-eu trade: application of the ccep-ht estimation in heterogeneous panels with unobserved common time-specific factors. Journal of applied econometrics 22(2), 361–381.
Wei, W. (2006). Time Series Analysis: Univariate and Multivariate Methods. Pearson Addison Wesley. Hongfei Wang

Acknowledgments

We express our sincere gratitude to the editor, associate editor, and anonymous reviewers for their invaluable guidance, support, and constructive

feedback. Wang and Liu’s research was partially supported by National

Natural Science Foundation of China grant 12171079 and the China National Key R&D Program grant 2020YFA0714100. Long Feng is partially

supported by Shenzhen Wukong Investment Company, the Open Research

Fund of Key Laboratory of Advanced Theory and Application in Statistics

and Data Science (East China Normal University), Ministry of Education,

Tianjin Science Fund for Outstanding Young Scholar (23JCJQJC00150),

the Fundamental Research Funds for the Central Universities under Grant

No. ZB22000105 and 63233075, the China National Key R&D Program

(Grant Nos. 2019YFC1908502, 2022YFA1003703, 2022YFA1003802, 2022YFA1003803)

and the National Natural Science Foundation of China Grants (Nos. 12271271,

11925106, 12231011, 11931001 and 11971247). Ma’s research was partially

supported by grants from national institute of health. Binghui liu and Long

Feng are co-corresponding authors and contributed equally to this paper.

Supplementary Materials

The Supplementary Material contains some additional numerical results,

an additional empirical application and the technical proofs.

Supplementary materials are available for download.

[1] Anselin, L. and A. K. Bera (1998). Spatial dependence in linear regression models with an introduction to spatial econometrics. Statistics textbooks and monographs 155, 237–290.

[2] Bai, J. (2009). Panel data models with interactive fixed effects. Econometrica 77(4), 1229–1279.

[3] Baltagi, B. H., Q. Feng, and C. Kao (2012). A lagrange multiplier test for cross-sectional dependence in a fixed effects panel data model. Journal of Econometrics 170(1), 164–177.

[4] Baltagi, B. H., Q. Feng, and C. Kao (2016). Estimation of heterogeneous panels with structural breaks. Journal of Econometrics 191(1), 176–195.

[5] Baltagi, B. H., C. Kao, and B. Peng (2016). Testing cross-sectional correlation in large panel data models with serial correlation. Econometrics 4(4), 1–24.

[6] Box, G. E., G. M. Jenkins, G. C. Reinsel, and G. M. Ljung (2015). Time series analysis: forecasting and control. John Wiley & Sons.

[7] Breusch, T. S. and A. R. Pagan (1980). The lagrange multiplier test and its applications to model specification in econometrics. The review of economic studies 47(1), 239–253.

[8] Cai, T. T., Z. Ren, and H. H. Zhou (2016). Estimating structured high-dimensional covariance and precision matrices: optimal rates and adaptive estimation. Electronic Journal of Statistics 10(1), 1–59.

[9] Chen, X. and W. Liu (2018). Testing independence with high-dimensional correlated samples. The Annals of Statistics 46(2), 866–894.

[10] Feng, L., Y. Ding, and B. Liu (2020). Rank-based tests for cross-sectional dependence in large (n, t) fixed effects panel data models. Oxford Bulletin of Economics and Statistics 82(5), 1198–1216.

[11] Feng, L., T. Jiang, B. Liu, and W. Xiong (2022). Max-sum tests for cross-sectional dependence of high-demensional panel data. The Annals of Statistics 50(2), 1124–1143.

[12] Gao, J., X. Han, G. Pan, and Y. Yang (2017). High dimensional correlation matrices: The central limit theorem and its applications. Journal of the Royal Statistical Society: Series B (Statistical Methodology) 79(3), 677–693.

[13] Gao, J., F. Liu, B. Peng, and Y. Yan (2023). Binary response models for heterogeneous panel data with interactive fixed effects. Journal of Econometrics 235(2), 1654–1679.

[14] Hong, Y. (2010). Serial correlation and serial dependence. In Macroeconometrics and Time Series Analysis, pp. 227–244. Springer.

[15] Kapoor, M., H. H. Kelejian, and I. R. Prucha (2007). Panel data models with spatially correlated error components. Journal of econometrics 140(1), 97–130.

[16] Kelejian, H. H. and I. R. Prucha (1999). A generalized moments estimator for the autoregressive parameter in a spatial model. International economic review 40(2), 509–533.

[17] Lan, W., R. Pan, R. Luo, and Y. Cheng (2017). High dimensional cross-sectional dependence test under arbitrary serial correlation. Science China Mathematics 60(2), 345–360.

[18] Lee, L.-f. (2007). Gmm and 2sls estimation of mixed regressive, spatial autoregressive models. Journal of Econometrics 137(2), 489–514.

[19] Lee, L.-f. and J. Yu (2010). Estimation of spatial autoregressive panel data models with fixed effects. Journal of econometrics 154(2), 165–185.

[20] Li, D. and L. Xue (2015). Joint limiting laws for high-dimensional independence tests. arXiv:1512.08819.

[21] Liu, W.-D., Z. Lin, and Q.-M. Shao (2008). The asymptotic distribution and berry–esseen bound of a new test for independence in high dimension with an application to stochastic optimization. The Annals of Applied Probability 18(6), 2337–2366.

[22] Pesaran, M. H. (2004). General diagnostic tests for cross section dependence in panels (iza discussion paper no. 1240). Institute for the Study of Labor (IZA).

[23] Pesaran, M. H. (2015). Testing weak cross-sectional dependence in large panels. Econometric reviews 34(6-10), 1089–1117.

[24] Pesaran, M. H., A. Ullah, and T. Yamagata (2008). A bias-adjusted lm test of error cross-section independence. The Econometrics Journal 11(1), 105–127.

[25] Rudelson, M. and R. Vershynin (2013). Hanson-wright inequality and sub-gaussian concentration. Electronic Communications in Probability 18, 1–9.

[26] Serlenga, L. and Y. Shin (2007). Gravity models of intra-eu trade: application of the ccep-ht estimation in heterogeneous panels with unobserved common time-specific factors. Journal of applied econometrics 22(2), 361–381.

[27] Wei, W. (2006). Time Series Analysis: Univariate and Multivariate Methods. Pearson Addison Wesley. Hongfei Wang