Abstract

We propose a novel approach for detecting change points in high

dimensional linear regression models. Unlike previous research that relies on strict

Gaussian/sub-Gaussian error assumptions and has prior knowledge of change

points, we propose a tail-adaptive method for change point detection and estimation. We use a weighted combination of composite quantile and least squared

losses to build a new loss function, allowing us to leverage information from both

conditional means and quantiles. For change point testing, we develop a family

of individual testing statistics with different weights to account for unknown tail

structures. These individual tests are further aggregated to construct a powerful

tail-adaptive test for sparse regression coefficient changes. For change point estimation, we propose a family of argmax-based individual estimators. We provide

theoretical justifications for the validity of these tests and change point estimators. Additionally, we introduce a new algorithm for detecting multiple change

points in a tail-adaptive manner using the wild binary segmentation. Extensive

numerical results show the effectiveness of our proposed method.

Information

Preprint No.SS-2023-0337
Manuscript IDSS-2023-0337
Complete AuthorsBin Liu, Zhengling Qi, Xinsheng Zhang, Yufeng Liu
Corresponding AuthorsYufeng Liu
Emailsyfliu@email.unc.edu

References

  1. Bai, J. and P. Perron (1998). Estimating and testing linear models with multiple structural changes. Econometrica 66(1), 47–78.
  2. Dette, H., J. G¨osmann, et al. (2018). Relevant change points in high dimensional time series. Electronic Journal of Statistics 12(2), 2578–2636.
  3. Fryzlewicz, P. (2014). Wild binary segmentation for multiple change-point detection. The Annals of Statistics 42(6), 2243–2281.
  4. Horv´ath, L. (1995). Detecting changes in linear regressions. Statistics 26(3), 189–208.
  5. Jiang, F., Z. Zhao, and X. Shao (2022). Modelling the covid-19 infection trajectory: A piecewise linear quantile trend model. Journal of the Royal Statistical Society: Series B (Statistical Methodology) 84(5), 1589–1607.
  6. Jirak, M. (2015). Uniform change point tests in high dimension. The Annals of Statistics 43(6), 2451–2483.
  7. Kaul, A., V. K. Jandhyala, and S. B. Fotopoulos (2019). An efficient two step algorithm for high dimensional change point regression models without grid search. Journal of Machine Learning Research 20(111), 1–40.
  8. Koenker, R. and G. Bassett (1978). Regression quantiles. Econometrica 46, 33–50.
  9. Lee, S., Y. Liao, M. H. Seo, and Y. Shin (2018). Oracle estimation of a change point in high dimensional quantile regression. Journal of the American Statistical Association 113(523), 1184–1194.
  10. Lee, S., M. H. Seo, and Y. Shin (2016). The lasso for high dimensional regression with a possible change point. Journal of the Royal Statistical Society: Series B (Statistical Methodology) 78(1), 193–210.
  11. Leonardi, F. and P. B¨uhlmann (2016). Computationally efficient change point detection for high-dimensional regression. arXiv preprint: 1601.03704.
  12. Liu, B., C. Zhou, , X.-S. Zhang, and Y. Liu (2020). A unified data-adaptive framework for high dimensional change point detection. Journal of the Royal Statistical Society: Series B (Statistical Methodology) 82(4), 933–963.
  13. Oka, T. and Z. Qu (2011). Estimating structural changes in regression quantiles. Journal of Econometrics 162(2), 248–267.
  14. Page, E. (1955). Control charts with warning lines. Biometrika 42(1-2), 243–257. Qu, Z.
  15. (2008). Testing for structural change in regression quantiles. Journal of Econometrics 146(1), 170–184.
  16. Quandt, R. E. (1958). The estimation of the parameters of a linear regression system obeying two separate regimes. Journal of the American Statistical Association 53(284), 873–880.
  17. Rinaldo, A., D. Wang, Q. Wen, R. Willett, and Y. Yu (2021). Localizing changes in highdimensional regression models. In International Conference on Artificial Intelligence and Statistics, pp. 2089–2097. PMLR.
  18. Roy, S., Y. Atchad´e, and G. Michailidis (2017). Change point estimation in high dimensional Markov random-field models. Journal of the Royal Statistical Society: Series B (Statistical Methodology) 79(4), 1187–1206.
  19. Shao, X. and X. Zhang (2010). Testing for change points in time series. Journal of the American Statistical Association 105(491), 1228–1240.
  20. Wang, D., Z. Zhao, K. Z. Lin, and R. Willett (2021). Statistically and computationally efficient change point localization in regression settings. Journal of Machine Learning Research 22(248), 1–46.
  21. Wang, X., B. Liu, Y. Liu, and X. Zhang (2023). Efficient multiple change point detection for high-dimensional generalized linear models. The Canadian Journal of Statistics 51(2), 596–629.
  22. Xia, Y., T. Cai, and T. T. Cai (2018). Two-sample tests for high-dimensional linear regression with an application to detecting interactions. Statistica Sinica 28(1), 63–92.
  23. Xu, H., D. Wang, Z. Zhao, and Y. Yu (2022). Change point inference in high-dimensional regression models under temporal dependence. arXiv preprint arXiv:2207.12453.
  24. Zhang, L., H. J. Wang, and Z. Zhu (2014). Testing for change points due to a covariate threshold in quantile regression. Statistica Sinica 24, 1859–1877.
  25. Zhao, T., M. Kolar, and H. Liu (2014). A general framework for robust testing and confidence regions in high-dimensional quantile regression. arXiv preprint: 1601.03704.
  26. Zhou, C., W.-X. Zhou, X.-S. Zhang, and H. Liu (2018). A unified framework for testing high dimensional parameters: a data-adaptive approach. arXiv preprint: 1601.03704.
  27. Zou, H. and M. Yuan (2008). Composite quantile regression and the oracle model selection theory. The Annals of Statistics 36(3), 1108–1126.

Acknowledgments

The authors are indebted to the editor, the associate editor, and two

referees, whose helpful comments and suggestions led to a much improved

presentation. Bin Liu was supported in part by the National Natural Science Foundation of China 12101132, 12331009.

Supplementary Materials

The online Supplementary Material provides detailed basic assumptions

and proofs of the main theory, and additional numerical results including

size, power, multiple change point detection. In addition, an interesting

application to the S&P100 data analysis is also provided.


Supplementary materials are available for download.