Estimation of Conditional Extremiles in Reproducing Kernel Hilbert Spaces with Application to Large Commercial Banks Data

Fang Chen and Caixing Wang

doi:10.5705/ss.202025.0294

Abstract

As analogs of quantiles, extremiles are coherent spectral risk measures

with explicit formulations and intuitive interpretations. Their inherent sensitivity to the magnitude of extreme outcomes makes them particularly suitable for

heavy-tailed data.

However, existing extremile estimation methods rarely exploit rich auxiliary covariate information, which limits their ability to capture

conditional extreme patterns and to extrapolate reliably at very high risk levels.

This paper proposes a new nonparametric framework for estimating conditional

extremiles in the presence of multiple covariates. By combining reproducing kernel Hilbert spaces (RKHS) with a quantile regression process approximation, our

method flexibly models the conditional extremile structure while enabling reliable

extrapolation for heavy-tailed distributions. We establish the non-asymptotic error bound for the estimation error, rigorously justifying its theoretical validity.

Simulation studies show that our approach outperforms existing competitors in

both efficiency and extrapolation accuracy in heavy-tailed settings. An empirical

application to large commercial banks further illustrates its practical value for

extreme risk measurement.

Key words and phrases: Asymmetric least squares, extreme value theory, heavy tails, quantile regression

Information

Preprint No.	SS-2025-0294
Manuscript ID	SS-2025-0294
Complete Authors	Fang Chen, Caixing Wang
Corresponding Authors	Caixing Wang
Emails	wangcaixing96@gmail.com

References

Acerbi, C. and Szekely, B. (2014). Back-testing expected shortfall. Risk, 27(11):76–81.
Adrian, T. and Fleming, M. (2022). The bond market selloff in historical perspective. Liberty street economics working paper, Federal Reserve Bank of New York.
Aon (2020). Economic losses from natural disasters top $232 billion in 2019 as the costliest decade on record comes to a close. Catastrophe report, Aon. https://caribbeannewsglobal. com/economic-losses-from-natural-disasters-top-232-billion-in-2019/.
Artzner, P., Delbaen, F., Eber, J., and Heath, D. (1999). Coherent measures of risk. Mathematical Finance, 9(3):203–228.
Belloni, A., Chernozhukov, V., Chetverikov, D., and Fern´andez-Val, I. (2019). Conditional quantile processes based on series or many regressors. Journal of Econometrics, 213(1):4–29.
Berlinet, A. and Thomas-Agnan, C. (2004). Reproducing Kernel Hilbert Spaces in Probability and Statistics. New York: Springer.
Blanchard, G., Bousquet, O., and Massart, P. (2008). Statistical performance of support vector machines. Annals of Statistics, 36(2):489–531.
Chen, F., He, X., and Wang, J. (2021). Learning sparse conditional distribution: An efficient kernel-based approach. Electronic Journal of Statistics, 15:1610–1635.
Daouia, A., Gijbels, I., and Stupfler, G. (2019). Extremiles: A new perspective on asymmetric least squares. Journal of the American Statistical Association, 114(527):1366–1381.
Daouia, A., Gijbels, I., and Stupfler, G. (2022). Extremile regression. Journal of the American Statistical Association, 117(539):1579–1586.
Daouia, A., Girard, S., and Stupfler, G. (2018). Estimation of tail risk based on extreme expectiles. Journal of the Royal Statistical Society: Series B, 80(2):263–292.
Daouia, A., Girard, S., and Stupfler, G. (2020). Tail expectile process and risk assessment. Bernoulli, 26(1):531–556.
De Haan, L. and Ferreira, A. (2006). Extreme Value Theory: An Introduction. Springer.
Dupuis, D., Sun, Y., and Wang, H. (2015). Detecting change-points in temperature extremes. Statistics and Its Interface, 8:19–31.
Embrechts, P., Kl¨uppelberg, C., and Mikosch, T. (1997). Modelling Extremal Events for Insurance and Finance. New York: Springer.
Feng, X., Liu, Q., and Wang, C. (2023). A lack-of-fit test for quantile regression process models. Statistics & Probability Letters, 192:109680.
Friederichs, P. and Hense, A. (2007). Statistical downscaling of extreme precipitation events using censored quantile regression. Monthly Weather Review, 135(7):2388–2401.
Gardes, L. and Girard, S. (2010). Conditional extremes from heavy-tailed distributions: An application to the estimation of extreme rainfall return level. Extremes, 13:177–204.
Girard, S., Stupfler, G., and Usseglio-Carleve, A. (2022). Nonparametric extreme conditional expectile estimation. Scandinavian Journal of Statistics, 49(1):78–115.
He, X., Wang, J., and Lv, S. (2021). Efficient kernel-based variable selection with sparsistency. Statistica Sinica, 31(4):2123–2151.
Hill, B. M. (1975). A simple general approach to inference about the tail of a distribution. Annals of Statistics, 3(5):1163–1174.
Horv´ath, L. and Yandell, B. S. (1988). Asymptotics of conditional empirical processes. Journal of Multivariate Analysis, 26(2):184–206.
Li, D. and Wang, H. J. (2019). Extreme quantile estimation for autoregressive models. Journal of Business & Economic Statistics, 37(4):661–670.
Li, Y., Liu, Y., and Zhu, J. (2007). Quantile regression in reproducing kernel hilbert spaces. Journal of the American Statistical Association, 102(477):255–268.
Lin, S.-B., Guo, X., and Zhou, D.-X. (2017). Distributed learning with regularized least squares. Journal of Machine Learning Research, 18(92):1–31.
Linsmeier, T. J. and Pearson, N. D. (2000). Value at risk. Financial Analysts Journal, 56:47–67.
Marimoutou, V., Raggad, B., and Trabelsi, A. (2009). Extreme value theory and value at risk: Application to oil market. Energy Economics, 31(4):519–530.
Newey, W. K. and Powell, J. L. (1987). Asymmetric least squares estimation and testing. Econometrica, 55(4):819–847.
Odening, M. and Hinrichs, J. (2003). Using extreme value theory to estimate value at risk. Agricultural Finance Review, 63:55–73.
Rahimi, A. and Recht, B. (2007). Random features for large-scale kernel machines. Advances in Neural Information Processing Systems, 20:1177–1184.
Resnick, S. (2007). Heavy-Tail Phenomena: Probabilistic and Statistical Modeling. New York: Springer.
Rudi, A., Camoriano, R., and Rosasco, L. (2015). Less is more: Nystr¨om computational regularization. Advances in Neural Information Processing Systems, 28:1657–1665.
Schaumburg, J. (2012). Predicting extreme value at risk: Nonparametric quantile regression with refinements from extreme value theory. Computational Statistics and Data Analysis, 56(12):4081–4096.
Shawe-Taylor, J. and Cristianini, N. (2004). Kernel Methods for Pattern Analysis. Cambridge University Press.
Smale, S. and Zhou, D. (2007). Learning theory estimates via integral operators and their approximations. Constructive Approximation, 26(2):153–172.
Steinwart, I. (2005). Consistency of support vector machines and other regularized kernel classifiers. IEEE Transactions on Information Theory, 51(1):128–142.
Steinwart, I. and Scovel, C. (2007). Fast rates for support vector machines using Gaussian kernels. Annals of Statistics, 35(2):575–607.
Takeuchi, I., Le, Q. V., Sears, T. D., and Smola, A. J. (2006). Nonparametric quantile estimation. Journal of Machine Learning Research, 7:1231–1264.
Wahba, G. (1990). Spline Models for Observational Data. Society for Industrial and Applied
Mathematics, Philadelphia, PA.
Wang, C. and Feng, X. (2024). Optimal kernel quantile learning with random features. In International Conference on Machine Learning, pages 50419–50452. PMLR.
Wang, C., Li, T., Zhang, X., Feng, X., and He, X. (2024). Communication-efficient nonparametric quantile regression via random features. Journal of Computational and Graphical Statistics, 33(4):1175–1184.
Wang, H. and Tsai, C. L. (2009). Tail index regression. Journal of the American Statistical Association, 104(485):1233–1240.
Wang, H. J. and Li, D. (2013). Estimation of extreme conditional quantiles through power transformation. Journal of the American Statistical Association, 108(503):1062–1074.
Wang, H. J., Li, D., and He, X. (2012). Estimation of high conditional quantiles for heavy-tailed distributions. Journal of the American Statistical Association, 107(498):1453–1464.
Wang, S., Shao, J., and Kim, J. K. (2014). An instrumental variable approach for identification and estimation with nonignorable nonresponse. Statistica Sinica, 24(3):1097–1116.
Weissman, I. (1978). Estimation of parameters and large quantiles based on the k largest observations. Journal of the American Statistical Association, 73(364):812–815.
Xu, W., Hou, Y., and Li, D. (2022a). Prediction of extremal expectile based on regression models with heteroscedastic extremes. Journal of Business & Economic Statistics, 40:522–536.
Xu, W., Wang, H., and Li, D. (2022b). Extreme quantile estimation for single index model. Statistica Sinica, 32:893–914.
Yang, L., Lv, S., and Wang, J. (2016). Model-free variable selection in reproducing kernel Hilbert space. Journal of Machine Learning Research, 17(1):2885–2908.
Yang, Y., Zhang, T., and Zou, H. (2018). Flexible expectile regression in reproducing kernel Hilbert spaces. Technometrics, 60(1):26–35.
Yao, Q. and Tong, H. (1996). Asymmetric least squares regression estimation: A nonparametric approach. Journal of Nonparametric Statistics, 6(2-3):273–292.
Yi, Y., Feng, X., and Huang, Z. (2014). Estimation of extreme value-at-risk: an EVT approach for quantile GARCH model. Economics Letters, 124(3):378–381.
Zhang, C., Liu, Y., and Wu, Y. (2016). On quantile regression in reproducing kernel Hilbert spaces with the data sparsity constraint. Journal of Machine Learning Research, 17(1):1374– 1418.

Acknowledgments

The authors thank the editor, the associate editor, and two anonymous referees for their constructive suggestions, which significantly improved this paper.

Supplementary Materials

The supplementary materials contain some useful lemmas and the detailed proofs of the main

results in this paper.

Supplementary materials are available for download.

[1] Acerbi, C. and Szekely, B. (2014). Back-testing expected shortfall. Risk, 27(11):76–81.

[2] Adrian, T. and Fleming, M. (2022). The bond market selloff in historical perspective. Liberty street economics working paper, Federal Reserve Bank of New York.

[3] Aon (2020). Economic losses from natural disasters top $232 billion in 2019 as the costliest decade on record comes to a close. Catastrophe report, Aon. https://caribbeannewsglobal. com/economic-losses-from-natural-disasters-top-232-billion-in-2019/.

[4] Artzner, P., Delbaen, F., Eber, J., and Heath, D. (1999). Coherent measures of risk. Mathematical Finance, 9(3):203–228.

[5] Belloni, A., Chernozhukov, V., Chetverikov, D., and Fern´andez-Val, I. (2019). Conditional quantile processes based on series or many regressors. Journal of Econometrics, 213(1):4–29.

[6] Berlinet, A. and Thomas-Agnan, C. (2004). Reproducing Kernel Hilbert Spaces in Probability and Statistics. New York: Springer.

[7] Blanchard, G., Bousquet, O., and Massart, P. (2008). Statistical performance of support vector machines. Annals of Statistics, 36(2):489–531.

[8] Chen, F., He, X., and Wang, J. (2021). Learning sparse conditional distribution: An efficient kernel-based approach. Electronic Journal of Statistics, 15:1610–1635.

[9] Daouia, A., Gijbels, I., and Stupfler, G. (2019). Extremiles: A new perspective on asymmetric least squares. Journal of the American Statistical Association, 114(527):1366–1381.

[10] Daouia, A., Gijbels, I., and Stupfler, G. (2022). Extremile regression. Journal of the American Statistical Association, 117(539):1579–1586.

[11] Daouia, A., Girard, S., and Stupfler, G. (2018). Estimation of tail risk based on extreme expectiles. Journal of the Royal Statistical Society: Series B, 80(2):263–292.

[12] Daouia, A., Girard, S., and Stupfler, G. (2020). Tail expectile process and risk assessment. Bernoulli, 26(1):531–556.

[13] De Haan, L. and Ferreira, A. (2006). Extreme Value Theory: An Introduction. Springer.

[14] Dupuis, D., Sun, Y., and Wang, H. (2015). Detecting change-points in temperature extremes. Statistics and Its Interface, 8:19–31.

[15] Embrechts, P., Kl¨uppelberg, C., and Mikosch, T. (1997). Modelling Extremal Events for Insurance and Finance. New York: Springer.

[16] Feng, X., Liu, Q., and Wang, C. (2023). A lack-of-fit test for quantile regression process models. Statistics & Probability Letters, 192:109680.

[17] Friederichs, P. and Hense, A. (2007). Statistical downscaling of extreme precipitation events using censored quantile regression. Monthly Weather Review, 135(7):2388–2401.

[18] Gardes, L. and Girard, S. (2010). Conditional extremes from heavy-tailed distributions: An application to the estimation of extreme rainfall return level. Extremes, 13:177–204.

[19] Girard, S., Stupfler, G., and Usseglio-Carleve, A. (2022). Nonparametric extreme conditional expectile estimation. Scandinavian Journal of Statistics, 49(1):78–115.

[20] He, X., Wang, J., and Lv, S. (2021). Efficient kernel-based variable selection with sparsistency. Statistica Sinica, 31(4):2123–2151.

[21] Hill, B. M. (1975). A simple general approach to inference about the tail of a distribution. Annals of Statistics, 3(5):1163–1174.

[22] Horv´ath, L. and Yandell, B. S. (1988). Asymptotics of conditional empirical processes. Journal of Multivariate Analysis, 26(2):184–206.

[23] Li, D. and Wang, H. J. (2019). Extreme quantile estimation for autoregressive models. Journal of Business & Economic Statistics, 37(4):661–670.

[24] Li, Y., Liu, Y., and Zhu, J. (2007). Quantile regression in reproducing kernel hilbert spaces. Journal of the American Statistical Association, 102(477):255–268.

[25] Lin, S.-B., Guo, X., and Zhou, D.-X. (2017). Distributed learning with regularized least squares. Journal of Machine Learning Research, 18(92):1–31.

[26] Linsmeier, T. J. and Pearson, N. D. (2000). Value at risk. Financial Analysts Journal, 56:47–67.

[27] Marimoutou, V., Raggad, B., and Trabelsi, A. (2009). Extreme value theory and value at risk: Application to oil market. Energy Economics, 31(4):519–530.

[28] Newey, W. K. and Powell, J. L. (1987). Asymmetric least squares estimation and testing. Econometrica, 55(4):819–847.

[29] Odening, M. and Hinrichs, J. (2003). Using extreme value theory to estimate value at risk. Agricultural Finance Review, 63:55–73.

[30] Rahimi, A. and Recht, B. (2007). Random features for large-scale kernel machines. Advances in Neural Information Processing Systems, 20:1177–1184.

[31] Resnick, S. (2007). Heavy-Tail Phenomena: Probabilistic and Statistical Modeling. New York: Springer.

[32] Rudi, A., Camoriano, R., and Rosasco, L. (2015). Less is more: Nystr¨om computational regularization. Advances in Neural Information Processing Systems, 28:1657–1665.

[33] Schaumburg, J. (2012). Predicting extreme value at risk: Nonparametric quantile regression with refinements from extreme value theory. Computational Statistics and Data Analysis, 56(12):4081–4096.

[34] Shawe-Taylor, J. and Cristianini, N. (2004). Kernel Methods for Pattern Analysis. Cambridge University Press.

[35] Smale, S. and Zhou, D. (2007). Learning theory estimates via integral operators and their approximations. Constructive Approximation, 26(2):153–172.

[36] Steinwart, I. (2005). Consistency of support vector machines and other regularized kernel classifiers. IEEE Transactions on Information Theory, 51(1):128–142.

[37] Steinwart, I. and Scovel, C. (2007). Fast rates for support vector machines using Gaussian kernels. Annals of Statistics, 35(2):575–607.

[38] Takeuchi, I., Le, Q. V., Sears, T. D., and Smola, A. J. (2006). Nonparametric quantile estimation. Journal of Machine Learning Research, 7:1231–1264.

[39] Wahba, G. (1990). Spline Models for Observational Data. Society for Industrial and Applied

[40] Mathematics, Philadelphia, PA.

[41] Wang, C. and Feng, X. (2024). Optimal kernel quantile learning with random features. In International Conference on Machine Learning, pages 50419–50452. PMLR.

[42] Wang, C., Li, T., Zhang, X., Feng, X., and He, X. (2024). Communication-efficient nonparametric quantile regression via random features. Journal of Computational and Graphical Statistics, 33(4):1175–1184.

[43] Wang, H. and Tsai, C. L. (2009). Tail index regression. Journal of the American Statistical Association, 104(485):1233–1240.

[44] Wang, H. J. and Li, D. (2013). Estimation of extreme conditional quantiles through power transformation. Journal of the American Statistical Association, 108(503):1062–1074.

[45] Wang, H. J., Li, D., and He, X. (2012). Estimation of high conditional quantiles for heavy-tailed distributions. Journal of the American Statistical Association, 107(498):1453–1464.

[46] Wang, S., Shao, J., and Kim, J. K. (2014). An instrumental variable approach for identification and estimation with nonignorable nonresponse. Statistica Sinica, 24(3):1097–1116.

[47] Weissman, I. (1978). Estimation of parameters and large quantiles based on the k largest observations. Journal of the American Statistical Association, 73(364):812–815.

[48] Xu, W., Hou, Y., and Li, D. (2022a). Prediction of extremal expectile based on regression models with heteroscedastic extremes. Journal of Business & Economic Statistics, 40:522–536.

[49] Xu, W., Wang, H., and Li, D. (2022b). Extreme quantile estimation for single index model. Statistica Sinica, 32:893–914.

[50] Yang, L., Lv, S., and Wang, J. (2016). Model-free variable selection in reproducing kernel Hilbert space. Journal of Machine Learning Research, 17(1):2885–2908.

[51] Yang, Y., Zhang, T., and Zou, H. (2018). Flexible expectile regression in reproducing kernel Hilbert spaces. Technometrics, 60(1):26–35.

[52] Yao, Q. and Tong, H. (1996). Asymmetric least squares regression estimation: A nonparametric approach. Journal of Nonparametric Statistics, 6(2-3):273–292.

[53] Yi, Y., Feng, X., and Huang, Z. (2014). Estimation of extreme value-at-risk: an EVT approach for quantile GARCH model. Economics Letters, 124(3):378–381.

[54] Zhang, C., Liu, Y., and Wu, Y. (2016). On quantile regression in reproducing kernel Hilbert spaces with the data sparsity constraint. Journal of Machine Learning Research, 17(1):1374– 1418.