Win Ratio for Partially Ordered Data

Lu Mao

doi:10.5705/ss.202023.0321

Abstract

The win ratio, initially developed for time-to-event data, can be ex

tended to any data type equipped with a partial order. We study this extension

in both nonparametric inference and semiparametric regression. We begin by

formulating the win ratio as an estimand of contrast between two populations

with partially ordered responses, showing that it reduces to the familiar odds

ratio in the case of binary data. For hypothesis testing, we prove that the empirical two-sample win ratio is consistent against stochastically ordered distributions

and efficient against proportional odds alternatives under a total order. In regression, we model the conditional win ratio multiplicatively against covariates,

extending logistic regression from binary to partially ordered responses.

This

model is implied by a generalized continuation-ratio logit model but requires

fewer assumptions on the relationship between response levels. To make inference, we construct a class of weighted U-statistic estimating equations and derive

pseudo-efficient weights to improve efficiency.

Simulation studies demonstrate

that the proposed procedures perform well in both testing and regression under

finite samples. As illustrations, we analyze bivariate radiologic assessments in

a recent liver disease study and subject smoking status in a youth tobacco use

study, treating them both as partially ordered outcomes. The proposed methodology is implemented in the R package poset, publicly available on GitHub at

work (CRAN).

Key words and phrases: Continuation ratio; Logistic regression; Odds ratio; Or- dinal data; Stochastic order; U-statistic

Information

Preprint No.	SS-2023-0321
Manuscript ID	SS-2023-0321
Complete Authors	Lu Mao
Corresponding Authors	Lu Mao
Emails	lmao@biostat.wisc.edu

References

Agresti, A. (2010). Analysis of ordinal categorical data. New York: John Wiley & Sons.
Andersen, P. K. and Gill, R. D. (1982). Cox’s regression model for counting processes: a large sample study. Annals of Statistics 10, 1100-1120.
Arcones, M. A. and Gin´e, E. (1993). Limit theorems for U-processes. Annals of Probability 21, 1494–1542.
Armstrong, B. G. and Sloan, M. (1989). Ordinal regression models for epidemiologic data. American Journal of Epidemiology 129, 191–204.
Bebu, I. and Lachin, J. M. (2016). Large sample inference for a win ratio analysis of a composite outcome based on prioritized components. Biostatistics 17, 178–187.
Bickel, P. J., Klaassen, C. A., Ritov, Y. A. and Wellner, J. A. (1993). Efficient and Adaptive Estimation for Semiparametric Models. Baltimore: Johns Hopkins University Press.
Dong, G., Li, D., Ballerstedt, S. and Vandemeulebroecke, M. (2016). A generalized analytic solution to the win ratio to analyze a composite endpoint considering the clinical importance order among components. Pharmaceutical Statistics 15, 430–437.
Edge, S., Byrd, D. R., Compton, C. C., Fritz, A. G. and Greene, F. L. (2010). AJCC Cancer Staging Handbook: From the AJCC Cancer Staging Manual. New York: Springer.
Fill, J. A. and Machida, M. (2001). Stochastic monotonicity and realizable monotonicity. Annals of Probability 29, 938–978.
Garg, V. K. (2015). Introduction to Lattice Theory with Computer Science Applications. Hoboken: Wiley.
Hoeffding, W. (1948). A class of statistics with asymptotically normal distribution. Annals of Mathematical Statistics 19, 293–325.
Kamae, T. and Krengel, U. (1978). Stochastic partial ordering. Annals of Probability 6, 1044–1049.
Lin, Y., Wang, S. & Chappell, R. J.
(2018). Lasso tree for cancer staging with survival data. Biostatistics 14, 327–339.
Luo, X., Tian, H., Mohanty, S. and Tsai, W. Y. (2015). An alternative approach to confidence interval estimation for the win ratio statistic. Biometrics 71, 139–145.
Mao, L. (2018). On causal estimation using U-statistics. Biometrika 105, 215-220.
Mao, L. (2022). On the relative efficiency of intent-to-treat Wilcoxon–Mann–Whitney test in the presence of non-compliance. Biometrika, 109, 873–880.
Mao, L. and Wang, T.
(2021). A class of proportional win-fractions regression models for composite outcomes. Biometrics, 77, 1265–1275.
McCullagh, P. (1980). Regression models for ordinal data. Journal of the Royal Statistical
Society, Series B, 42, 109–127.
Mondal, D. and Hinrichs, N. (2016). Rank tests from partially ordered data using importance and MCMC sampling methods. Statistical Science, 31, 325–347.
Peyhardi, J., Trottier, C. and Gu´edon, Y. (2016). Partitioned conditional generalized linear models for categorical responses. Statistical Mododelling 16, 297–321.
Pocock, S. J., Ariti, C. A., Collier, T. J. and Wang, D. (2012). The win ratio: a new approach to the analysis of composite endpoints in clinical trials based on clinical priorities. European Heart Journal 33, 176–182.
Rosenbaum, P. R. (1991). Some poset statistics. Annals of Statistics 19, 1091–1097.
Thas, O., Neve, J. D.,Clement, L. and Ottoy, J. P. (2012). Probabilistic index models. Journal of the Royal Statistical Society, Series B, 74, 623–671.
Trotter, W. T. (1992). Combinatorics and Partially Ordered Sets: Dimension Theory. Baltimore: Johns Hopkins University Press.
van der Vaart, A. W. (1998). Asymptotic Statistics. Cambridge: Cambridge University Press.
van der Vaart, A. W. and Wellner, J. A. (1996). Weak Convergence and Empirical Processes. New York: Springer-Verlag.
Vermeulen, K., Amorim, G., De Neve, J., Thas, O. and Vansteelandt, S.
(2023). Semiparametric estimation of probabilistic index models: efficiency and bias. Statistica Sinica, 33, 1003-1024.
Wittkowski, K. M., Lee, E., Nussbaum, R., Chamian, F. N. and Krueger, J. G. (2004). Combining several ordinal measures in clinical studies. Statistics in Medicine 23, 1579– 1592.
Zhang, Q. and Ip, E. H. (2012). Generalized linear model for partially ordered data. Statistics in Medicine 31, 56–68.

Acknowledgments

This research was supported by the U.S. National Institutes of Health grant

R01HL149875 and National Science Foundation grant DMS-2015526.

Supplementary Materials

includes technical results and additional numerical

studies. An R-package poset that implements the proposed methodology

is available on GitHub at https://lmaowisc.github.io/poset as well as

the Comprehensive R Archive Network (CRAN), both with a tutorial based

on the liver study in Section 5.1.

Supplementary materials are available for download.

[1] Agresti, A. (2010). Analysis of ordinal categorical data. New York: John Wiley & Sons.

[2] Andersen, P. K. and Gill, R. D. (1982). Cox’s regression model for counting processes: a large sample study. Annals of Statistics 10, 1100-1120.

[3] Arcones, M. A. and Gin´e, E. (1993). Limit theorems for U-processes. Annals of Probability 21, 1494–1542.

[4] Armstrong, B. G. and Sloan, M. (1989). Ordinal regression models for epidemiologic data. American Journal of Epidemiology 129, 191–204.

[5] Bebu, I. and Lachin, J. M. (2016). Large sample inference for a win ratio analysis of a composite outcome based on prioritized components. Biostatistics 17, 178–187.

[6] Bickel, P. J., Klaassen, C. A., Ritov, Y. A. and Wellner, J. A. (1993). Efficient and Adaptive Estimation for Semiparametric Models. Baltimore: Johns Hopkins University Press.

[7] Dong, G., Li, D., Ballerstedt, S. and Vandemeulebroecke, M. (2016). A generalized analytic solution to the win ratio to analyze a composite endpoint considering the clinical importance order among components. Pharmaceutical Statistics 15, 430–437.

[8] Edge, S., Byrd, D. R., Compton, C. C., Fritz, A. G. and Greene, F. L. (2010). AJCC Cancer Staging Handbook: From the AJCC Cancer Staging Manual. New York: Springer.

[9] Fill, J. A. and Machida, M. (2001). Stochastic monotonicity and realizable monotonicity. Annals of Probability 29, 938–978.

[10] Garg, V. K. (2015). Introduction to Lattice Theory with Computer Science Applications. Hoboken: Wiley.

[11] Hoeffding, W. (1948). A class of statistics with asymptotically normal distribution. Annals of Mathematical Statistics 19, 293–325.

[12] Kamae, T. and Krengel, U. (1978). Stochastic partial ordering. Annals of Probability 6, 1044–1049.

[13] Lin, Y., Wang, S. & Chappell, R. J.

[14] (2018). Lasso tree for cancer staging with survival data. Biostatistics 14, 327–339.

[15] Luo, X., Tian, H., Mohanty, S. and Tsai, W. Y. (2015). An alternative approach to confidence interval estimation for the win ratio statistic. Biometrics 71, 139–145.

[16] Mao, L. (2018). On causal estimation using U-statistics. Biometrika 105, 215-220.

[17] Mao, L. (2022). On the relative efficiency of intent-to-treat Wilcoxon–Mann–Whitney test in the presence of non-compliance. Biometrika, 109, 873–880.

[18] Mao, L. and Wang, T.

[19] (2021). A class of proportional win-fractions regression models for composite outcomes. Biometrics, 77, 1265–1275.

[20] McCullagh, P. (1980). Regression models for ordinal data. Journal of the Royal Statistical

[21] Society, Series B, 42, 109–127.

[22] Mondal, D. and Hinrichs, N. (2016). Rank tests from partially ordered data using importance and MCMC sampling methods. Statistical Science, 31, 325–347.

[23] Peyhardi, J., Trottier, C. and Gu´edon, Y. (2016). Partitioned conditional generalized linear models for categorical responses. Statistical Mododelling 16, 297–321.

[24] Pocock, S. J., Ariti, C. A., Collier, T. J. and Wang, D. (2012). The win ratio: a new approach to the analysis of composite endpoints in clinical trials based on clinical priorities. European Heart Journal 33, 176–182.

[25] Rosenbaum, P. R. (1991). Some poset statistics. Annals of Statistics 19, 1091–1097.

[26] Thas, O., Neve, J. D.,Clement, L. and Ottoy, J. P. (2012). Probabilistic index models. Journal of the Royal Statistical Society, Series B, 74, 623–671.

[27] Trotter, W. T. (1992). Combinatorics and Partially Ordered Sets: Dimension Theory. Baltimore: Johns Hopkins University Press.

[28] van der Vaart, A. W. (1998). Asymptotic Statistics. Cambridge: Cambridge University Press.

[29] van der Vaart, A. W. and Wellner, J. A. (1996). Weak Convergence and Empirical Processes. New York: Springer-Verlag.

[30] Vermeulen, K., Amorim, G., De Neve, J., Thas, O. and Vansteelandt, S.

[31] (2023). Semiparametric estimation of probabilistic index models: efficiency and bias. Statistica Sinica, 33, 1003-1024.

[32] Wittkowski, K. M., Lee, E., Nussbaum, R., Chamian, F. N. and Krueger, J. G. (2004). Combining several ordinal measures in clinical studies. Statistics in Medicine 23, 1579– 1592.

[33] Zhang, Q. and Ip, E. H. (2012). Generalized linear model for partially ordered data. Statistics in Medicine 31, 56–68.