Abstract
Estimating the structures at high or low quantiles has become an important subject and attracted
increasing attention across numerous fields. However, due to data sparsity at tails, it usually is a challenging task
to obtain reliable estimation, especially for high-dimensional data. This paper suggests a flexible parametric
structure to tails, and this enables us to conduct the estimation at quantile levels with rich observations and then
to extrapolate the fitted structures to far tails. The proposed model depends on some quantile indices and hence
is called the quantile index regression. Moreover, the composite quantile regression method is employed to
obtain non-crossing quantile estimators, and this paper further establishes their theoretical properties, including
asymptotic normality for the case with low-dimensional covariates and non-asymptotic error bounds for that
with high-dimensional covariates. Simulation studies and an empirical example are presented to illustrate the
usefulness of the new model.
Information
| Preprint No. | SS-2025-0069 |
|---|---|
| Manuscript ID | SS-2025-0069 |
| Complete Authors | Yingying Zhang, Qianqian Zhu, Yuefeng Si, Guodong Li |
| Corresponding Authors | Qianqian Zhu |
| Emails | zhu.qianqian@mail.shufe.edu.cn |
References
- Abrevaya, J. (2001). The effect of demographics and maternal behavior on the distribution of birth outcomes. Empirical Economics 26, 247–259.
- Beirlant, J. and Y. Goegebeur (2004). Local polynomial maximum likelihood estimation for Pareto-type distributions. Journal of Multivariate Analysis 89, 97–118.
- Belloni, A. and V. Chernozhukov (2011). l1-penalized quantile regression in high-dimensional sparse models. The Annals of Statistics 39, 82–130.
- Belloni, A., V. Chernozhukov, D. Chetverikov, and I. Fernandez-Val (2019). Conditional quantile processes based on series or many regressors. Journal of Econometrics 213, 4–29.
- Belloni, A., V. Chernozhukov, and K. Kato (2019). Valid post-selection inference in high-dimensional approximately sparse quantile regression models. Journal of the American Statistical Association 114, 749–758.
- Bertsekas, D. P. (2016). Nonlinear Programming. Athena Scientific.
- Cade, B. S. and B. R. Noon (2003). A gentle introduction to quantile regression for ecologists. Frontiers in Ecology and the Environment 1, 412–420.
- Chernozhukov, V. (2005). Extremal quantile regression. The Annals of Statistics 33, 806–839.
- Chernozhukov, V., I. Fern´andez-Val, and A. E. Kowalski (2015). Quantile regression with censoring and endogeneity. Journal of Econometrics 186, 201–221.
- Chernozhukov, V., I. Fern´andez-Val, and B. Melly (2022). Fast algorithms for the quantile regression process. Empirical economics 62, 1–27.
- Davino, C., M. Furno, and D. Vistocco (2014). Quantile Regression: Theory and Applications. New York: Wiley.
- de Haan, L. and A. Ferreira (2006). Extreme Value Theory: An Introduction. Springer.
- Dong, C., G. Li, and X. Feng (2019). Lack-of-fit tests for quantile regression models. Journal of the Royal Statistical Society, Series B 81, 629–648.
- Elsner, J. B., J. P. Kossin, and T. H. Jagger (2008). The increasing intensity of the strongest tropical cyclones. Nature 455, 92–95.
- Fan, J. and R. Li (2001). Variable selection via nonconcave penalized likelihood and its oracle properties. Journal of the American Statistical Association 96, 1348–1360.
- Fenske, N., T. Kneib, and T. Hothorn (2011). Identifying risk factors for severe childhood malnutrition by boosting additive quantile regression. Journal of the American Statistical Association 106, 494–510.
- Fisher, R. A. and L. H. C. Tippett (1928). Limiting forms of the frequency distribution of the largest or smallest member of a sample. Mathematical Proceedings of the Cambridge Philosophical Society 24, 180–190.
- Fournier, B., N. Rupin, M. Bigerelle, D. Najjar, A. Iost, and R. Wilcox (2007). Estimating the parameters of a generalized lambda distribution. Computational Statistics & Data Analysis 51, 2813–2835.
- Freimer, M., G. Kollia, G. S. Mudholkar, and C. T. Lin (1988). A study of the generalized tukey lambda family. Communications in Statistics-Theory and Methods 17, 3547–3567.
- Galvao, A. F. and K. Kato (2016). Smoothed quantile regression for panel data. Journal of Econometrics 193, 92–112.
- Gilchrist, W. G. (2000). Statistical modelling with quantile functions. London: Chapman & Hall/CRC.
- Hendricks, W. and R. Koenker (1991). Hierarchical spline models for conditional quantiles and the demand for electricity. Journal of the American Statistical Association 87, 58–68.
- Jagger, T. H. and J. B. Elsner (2008). Modeling tropical cyclone intensity with quantile regression. International Journal of Climatology 29, 1351–1361.
- Jiang, X., J. Jiang, and X. Song (2012). Oracle model selection for nonlinear models based on weighted composite quantile regression. Statistica Sinica 22, 1479–1506.
- Kai, B., R. Li, and H. Zou (2010). Local composite quantile regression smoothing: An efficient and safe alternative to local polynomial regression. Journal of the Royal Statistical Society, Series B 72, 49–69.
- Kato, K., A. F. Galvao, and G. V. Montes-Rojas (2012). Asymptotics for panel quantile regression models with individual effects. Journal of Econometrics 170, 76–91.
- Koenker, R. (2005). Quantile regression. Cambridge: Cambridge University Press.
- Koenker, R. (2011). Additive models for quantile regression: Model selection and confidence bandaids. Brazilian Journal of Probability and Statistics 25, 239–262.
- Koenker, R. (2017). Quantile regression: 40 years on. Annual Review of Economics 9, 155–176.
- Koenker, R. and G. Bassett (1978). Regression quantiles. Econometrica 46, 33–50.
- Kuester, K., S. Mittnik, and M. S. Paolella (2006). Value-at-Risk prediction: A comparison of alternative strategies. Journal of Financial Econometrics 4, 53–89.
- Linton, O. and Z. Xiao (2017). Quantile regression applications in finance. In Handbook of Quantile Regression, pp. 381–407. Chapman and Hall/CRC.
- Lobeto, H., M. Menendez, and I. J. Losada (2021). Future behavior of wind wave extremes due to climate change. Scientific reports 11, 1–12.
- Loh, P. (2017). Statistical consistency and asymptotic normality for high-dimensional robust M-estimators. The Annals of Statistics 45, 866–896.
- Loh, P. and M. J. Wainwright (2015). Regularized M-estimators with nonconvexity: Statistical and algorithmic theory for local optima. Journal of Machine Learning Research 16, 559–616.
- Meinshausen, N. and G. Ridgeway (2006). Quantile regression forests. Journal of Machine Learning Research 7, 983–999.
- Moon, S. J., J.-J. Jeon, J. S. H. Lee, and Y. Kim (2021). Learning multiple quantiles with neural networks. Journal of Computational and Graphical Statistics 30, 1238–1248.
- Newey, K. and D. McFadden (1994). Large sample estimation and hypothesis testing. Handbook of Econometrics 4, 2112–2245.
- Noufaily, A. and M. C. Jones (2013). Parametric quantile regression based on the generalized gamma distribution. Journal of the Royal Statistical Society, Series C 62, 723–740.
- Pan, X. and W.-X. Zhou (2021). Multiplier bootstrap for quantile regression: Non-asymptotic theory under random design. Information and Inference 3, 813–861.
- Pickands, J. (1975). Statistical Inference Using Extreme Order Statistics. The Annals of Statistics 3, 119–131.
- Pollard, D. (1985). New ways to prove central limit theorems. Econometric Theory 1, 295–313.
- Racine, J. S. and K. Li (2017). Nonparametric conditional quantile estimation: a locally weighted quantile kernel approach. Journal of Econometrics 201, 72–94.
- Sherwood, B., L. Wang, et al. (2016). Partially linear additive quantile regression in ultra-high dimension. The Annals of Statistics 44, 288–317.
- Sivakumar, V. and A. Banerjee (2017). High-dimensional structured quantile regression. In International Conference on Machine Learning, pp. 3220–3229.
- van der Vaart, A. W. (1998). Asymptotic Statistics. Cambridge: Cambridge University Press.
- Vasicek, O. (1976). A test for normality based on sample entropy. Journal of the Royal Statistical Society, Series B 38, 54–59.
- Wang, H. and C.-L. Tsai (2009). Tail index regression. Journal of the American Statistical Association 104, 1233–1240.
- Wang, H. J. and D. Li (2013). Estimation of extreme conditional quantiles through power transformation. Journal of the American Statistical Association 108, 1062–1074.
- Wang, H. J., D. Li, and X. He (2012). Estimation of high conditional quantiles for heavy-tailed distributions. Journal of the American Statistical Association 107, 1453–1464.
- Wang, L. and X. He (2022). Analysis of global and local optima of regularized quantile regression in high dimensions: A subgradient approach. Econometric Theory 40, 233–277.
- Wang, L., Y. Wu, and R. Li (2012). Quantile regression for analyzing heterogeneity in ultra-high dimension. Journal of the American Statistical Association 107, 214–222.
- Yu, K., Z. Lu, and J. Stander (2003). Quantile regression: applications and current research areas. Journal of the Royal Statistical Society: Series D 52, 331–350.
- Zhang, C.-H. (2010). Nearly unbiased variable selection under minimax concave penalty. The Annals of Statistics 38, 894–942.
- Zhang, Y., H. Lian, and Y. Yu (2020). Ultra-high dimensional single-index quantile regression. Journal of Machine Learning Research 21, 1–25.
- Zheng, Q., L. Peng, and X. He (2015). Globally adaptive quantile regression with ultra-high dimensional data. The Annals of Statistics 43, 2225–2258.
- Zheng, Y., Q. Zhu, G. Li, and Z. Xiao (2018). Hybrid quantile regression estimation for time series models with conditional heteroscedasticity. Journal of the Royal Statistical Society, Series B 80, 975–993.
- Zhu, K. and S. Ling (2011). Global self-weighted and local quasi-maximum exponential likelihood estimators for arma– garch/igarch models. The Annals of Statistics 39, 2131–2163.
- Zhu, Q. and G. Li (2022). Quantile double autoregression. Econometric Theory 38, 793–839.
- Zhu, Q., Y. Zheng, and G. Li (2018). Linear double autoregression. Journal of Econometrics 207, 162–174.
- Zou, H. and M. Yuan (2008). Composite quantile regression and the oracle model selection theory. The Annals of Statistics 36, 1108–1126. Yingying Zhang, East China Normal University, KLATASDS - MOE, School of Statistics, Academy of Statistics and Interdisciplinary Sciences
Acknowledgments
We are deeply grateful to the Co-Editor, the Associate Editor and two anonymous referees for
their valuable comments that led to the substantial improvement in the quality of this paper.
Zhang’s research was supported by the National Natural Science Foundation of China Grants
12471280 and 12101241. Zhu’s research was supported by the National Natural Science Foundation of China Grants 72373087 and 12271330. Li’s research was supported by the Hong
Kong Research Grant Council Grants 17313722 and 17309625, and the National Natural Science Foundation of China Grant 72033002.
Supplementary Materials
The online supplementary materials include the proofs of Proposition 1 and all theorems, and
additional numerical results for simulation and real data analysis.