Abstract
The win ratio, initially developed for time-to-event data, can be ex
tended to any data type equipped with a partial order. We study this extension
in both nonparametric inference and semiparametric regression. We begin by
formulating the win ratio as an estimand of contrast between two populations
with partially ordered responses, showing that it reduces to the familiar odds
ratio in the case of binary data. For hypothesis testing, we prove that the empirical two-sample win ratio is consistent against stochastically ordered distributions
and efficient against proportional odds alternatives under a total order. In regression, we model the conditional win ratio multiplicatively against covariates,
extending logistic regression from binary to partially ordered responses.
This
model is implied by a generalized continuation-ratio logit model but requires
fewer assumptions on the relationship between response levels. To make inference, we construct a class of weighted U-statistic estimating equations and derive
pseudo-efficient weights to improve efficiency.
Simulation studies demonstrate
that the proposed procedures perform well in both testing and regression under
finite samples. As illustrations, we analyze bivariate radiologic assessments in
a recent liver disease study and subject smoking status in a youth tobacco use
study, treating them both as partially ordered outcomes. The proposed methodology is implemented in the R package poset, publicly available on GitHub at
work (CRAN).
Information
| Preprint No. | SS-2023-0321 |
|---|---|
| Manuscript ID | SS-2023-0321 |
| Complete Authors | Lu Mao |
| Corresponding Authors | Lu Mao |
| Emails | lmao@biostat.wisc.edu |
References
- Agresti, A. (2010). Analysis of ordinal categorical data. New York: John Wiley & Sons.
- Andersen, P. K. and Gill, R. D. (1982). Cox’s regression model for counting processes: a large sample study. Annals of Statistics 10, 1100-1120.
- Arcones, M. A. and Gin´e, E. (1993). Limit theorems for U-processes. Annals of Probability 21, 1494–1542.
- Armstrong, B. G. and Sloan, M. (1989). Ordinal regression models for epidemiologic data. American Journal of Epidemiology 129, 191–204.
- Bebu, I. and Lachin, J. M. (2016). Large sample inference for a win ratio analysis of a composite outcome based on prioritized components. Biostatistics 17, 178–187.
- Bickel, P. J., Klaassen, C. A., Ritov, Y. A. and Wellner, J. A. (1993). Efficient and Adaptive Estimation for Semiparametric Models. Baltimore: Johns Hopkins University Press.
- Dong, G., Li, D., Ballerstedt, S. and Vandemeulebroecke, M. (2016). A generalized analytic solution to the win ratio to analyze a composite endpoint considering the clinical importance order among components. Pharmaceutical Statistics 15, 430–437.
- Edge, S., Byrd, D. R., Compton, C. C., Fritz, A. G. and Greene, F. L. (2010). AJCC Cancer Staging Handbook: From the AJCC Cancer Staging Manual. New York: Springer.
- Fill, J. A. and Machida, M. (2001). Stochastic monotonicity and realizable monotonicity. Annals of Probability 29, 938–978.
- Garg, V. K. (2015). Introduction to Lattice Theory with Computer Science Applications. Hoboken: Wiley.
- Hoeffding, W. (1948). A class of statistics with asymptotically normal distribution. Annals of Mathematical Statistics 19, 293–325.
- Kamae, T. and Krengel, U. (1978). Stochastic partial ordering. Annals of Probability 6, 1044–1049.
- Lin, Y., Wang, S. & Chappell, R. J.
- (2018). Lasso tree for cancer staging with survival data. Biostatistics 14, 327–339.
- Luo, X., Tian, H., Mohanty, S. and Tsai, W. Y. (2015). An alternative approach to confidence interval estimation for the win ratio statistic. Biometrics 71, 139–145.
- Mao, L. (2018). On causal estimation using U-statistics. Biometrika 105, 215-220.
- Mao, L. (2022). On the relative efficiency of intent-to-treat Wilcoxon–Mann–Whitney test in the presence of non-compliance. Biometrika, 109, 873–880.
- Mao, L. and Wang, T.
- (2021). A class of proportional win-fractions regression models for composite outcomes. Biometrics, 77, 1265–1275.
- McCullagh, P. (1980). Regression models for ordinal data. Journal of the Royal Statistical
- Society, Series B, 42, 109–127.
- Mondal, D. and Hinrichs, N. (2016). Rank tests from partially ordered data using importance and MCMC sampling methods. Statistical Science, 31, 325–347.
- Peyhardi, J., Trottier, C. and Gu´edon, Y. (2016). Partitioned conditional generalized linear models for categorical responses. Statistical Mododelling 16, 297–321.
- Pocock, S. J., Ariti, C. A., Collier, T. J. and Wang, D. (2012). The win ratio: a new approach to the analysis of composite endpoints in clinical trials based on clinical priorities. European Heart Journal 33, 176–182.
- Rosenbaum, P. R. (1991). Some poset statistics. Annals of Statistics 19, 1091–1097.
- Thas, O., Neve, J. D.,Clement, L. and Ottoy, J. P. (2012). Probabilistic index models. Journal of the Royal Statistical Society, Series B, 74, 623–671.
- Trotter, W. T. (1992). Combinatorics and Partially Ordered Sets: Dimension Theory. Baltimore: Johns Hopkins University Press.
- van der Vaart, A. W. (1998). Asymptotic Statistics. Cambridge: Cambridge University Press.
- van der Vaart, A. W. and Wellner, J. A. (1996). Weak Convergence and Empirical Processes. New York: Springer-Verlag.
- Vermeulen, K., Amorim, G., De Neve, J., Thas, O. and Vansteelandt, S.
- (2023). Semiparametric estimation of probabilistic index models: efficiency and bias. Statistica Sinica, 33, 1003-1024.
- Wittkowski, K. M., Lee, E., Nussbaum, R., Chamian, F. N. and Krueger, J. G. (2004). Combining several ordinal measures in clinical studies. Statistics in Medicine 23, 1579– 1592.
- Zhang, Q. and Ip, E. H. (2012). Generalized linear model for partially ordered data. Statistics in Medicine 31, 56–68.
Acknowledgments
This research was supported by the U.S. National Institutes of Health grant
R01HL149875 and National Science Foundation grant DMS-2015526.
Supplementary Materials
includes technical results and additional numerical
studies. An R-package poset that implements the proposed methodology
is available on GitHub at https://lmaowisc.github.io/poset as well as
the Comprehensive R Archive Network (CRAN), both with a tutorial based
on the liver study in Section 5.1.