Abstract
This paper presents a unified rank-based inferential procedure for fitting
the accelerated failure time model to partially interval-censored data. A Gehantype monotone estimating function is constructed based on the idea of the familiar
weighted log-rank test, and an extension to a general class of rank-based estimating
functions is suggested. The proposed estimators can be obtained via linear programming and are shown to be consistent and asymptotically normal via standard
empirical process theory.
Unlike common maximum likelihood-based estimators
for partially interval-censored regression models, our approach can directly provide a regression coefficient estimator without involving a complex nonparametric
estimation of the underlying residual distribution function. An efficient variance estimation procedure for the regression coefficient estimator is considered. Moreover,
we extend the proposed rank-based procedure to the linear regression analysis of
multivariate clustered partially interval-censored data. The finite-sample operating
characteristics of our approach are examined via simulation studies. Data example from a colorectal cancer study illustrates the practical usefulness of the method.
Information
| Preprint No. | SS-2024-0003 |
|---|---|
| Manuscript ID | SS-2024-0003 |
| Complete Authors | Taehwa Choi, Sangbum Choi, Dipankar Bandyopadhyay |
| Corresponding Authors | Dipankar Bandyopadhyay |
| Emails | bandyopd@gmail.com |
References
- Anderson-Bergman, C. (2017). icenReg: Regression models for interval censored data in R. Journal of Statistical Software 81(12), 1–23.
- Bogaerts, K., A. Kom´arek, and E. Lesaffre (2017). Survival Analysis with Interval Censored Data: A Practical Approach with Examples in R, SAS, and BUGS. London: Chapman & Hall/CRC.
- Buckley, J. and I. James (1979). Linear regression with censored data. Biometrika 66(3), 429–436.
- Cai, T. and S. Cheng (2004). Semiparametric regression analysis for doubly censored data. Biometrika 91(2), 277–290.
- Chen, K., Z. Jin, and Z. Ying (2002). Semiparametric analysis of transformation models with censored data. Biometrika 89(3), 659–668.
- Choi, S. and X. Huang (2021). Efficient inferences for linear transformation models with doubly censored data. Communications in Statistics-Theory and Methods 50(9), 2188–2200.
- Choi, T. and S. Choi (2021). A fast algorithm for the accelerated failure time model with highdimensional time-to-event data. Journal of Statistical Computation and Simulation 91(16), 3385–3403.
- Choi, T., A. K. Kim, and S. Choi (2021). Semiparametric least-squares regression with doublycensored data. Computational Statistics & Data Analysis. 164, 107306.
- Fygenson, M. and Y. Ritov (1994). Monotone estimating equations for censored data. The Annals of Statistics 22(2), 732–746.
- Gao, F., D. Zeng, and D. Y. Lin (2017). Semiparametric estimation of the accelerated failure time model with partly interval-censored data. Biometrics 73(4), 1161–1168.
- Gehan, E. A. (1965). A generalized two-sample Wilcoxon test for doubly censored data. Biometrika 52(3/4), 650–653.
- Groeneboom, P. and K. Hendrickx (2018). Current status linear regression. The Annals of Statistics 46(4), 1415–1444.
- Jin, Z., D. Y. Lin, L. J. Wei, and Z. Ying (2003). Rank-based inference for the accelerated failure time model. Biometrika 90(2), 341–353.
- Jin, Z., D. Y. Lin, and Z. Ying (2006a).
- On least-squares regression with censored data. Biometrika 93(1), 147–161.
- Jin, Z., D. Y. Lin, and Z. Ying (2006b). Rank regression analysis of multivariate failure time data based on marginal linear models. Scandinavian Journal of Statistics 33(1), 1–23.
- Kim, J. S. (2003). Maximum likelihood estimation for the proportional hazards model with partly interval-censored data. Journal of the Royal Statistical Society: Series B (Statistical Methodology) 65(2), 489–502.
- Kim, Y.-J. (2010). Regression analysis of clustered interval-censored data with informative cluster size. Statistics in Medicine 29(28), 2956–2962.
- Koenker, R. (2008). Censored quantile regression redux. Journal of Statistical Software 27(6), 1–25.
- Lai, T. L. and Z. Ying (1991). Large sample theory of a modified buckley-james estimator for regression analysis with censored data. The Annals of Statistics 19(3), 1370–1402.
- Lam, K. F., C. Y. Lee, K. Y. Wong, and D. Bandyopadhyay (2021). Marginal analysis of current status data with informative cluster size using a class of semiparametric transformation cure models. Statistics in Medicine 40(10), 2400–2412.
- Li, L. and Z. Pu (2003). Rank estimation of log-linear regression with interval-censored data. Lifetime Data Analysis 9, 57–70.
- Lin, Y. and K. Chen (2013). Efficient estimation of the censored linear regression model. Biometrika 100(2), 525–530.
- Pan, C., B. Cai, and L. Wang (2020). A Bayesian approach for analyzing partly intervalcensored data under the proportional hazards model. Statistical Methods in Medical Research 29(11), 3192–3204.
- Peeters, M., T. Price, A. Cervantes, et al. (2010). Randomized phase III study of panitumumab with fluorouracil, leucovorin, and irinotecan (FOLFIRI) compared with FOLFIRI alone as second-line treatment in patients with metastatic colorectal cancer. Journal of Clinical Oncology 28(31), 4706–4713.
- Sun, J. and X. Zhao (2013). Statistical Analysis of Panel Count Data. New York: Springer.
- Tian, L. and T. Cai (2006). On the accelerated failure time model for current status and interval censored data. Biometrika 93, 329–342.
- Turnbull, B. W. (1974). Nonparametric estimation of a survivorship function with doubly censored data. Journal of the American Statistical Association 69(345), 169–173.
- van der Vaart, A. and J. Wellner (1996). Weak Convergence and Empirical Processes: With Applications to Statistics. New York: Springer.
- Wang, S., B. Nan, J. Zhu, and D. G. Beer (2008). Doubly penalized Buckley–James method for survival data with high-dimensional covariates. Biometrics 64(1), 132–140.
- Wellner, J. A. and Y. Zhan (1997). A hybrid algorithm for computation of the nonparametric maximum likelihood estimator from censored data. Journal of the American Statistical Association 92(439), 945–959.
- Xu, Y., D. Zeng, and D. Lin (2023). Marginal proportional hazards models for multivariate interval-censored data. Biometrika 110(3), 815–830.
- Zeng, D. and D. Y. Lin (2008). Efficient resampling methods for nonsmooth estimating functions. Biostatistics 9(2), 355–363.
- Zhang, J. and D. F. Heitjan (2006). A simple local sensitivity analysis tool for nonignorable coarsening: application to dependent censoring. Biometrics 62(4), 1260–1268.
- Zhao, X., Q. Zhao, J. Sun, and J. S. Kim (2008). Generalized log-rank tests for partly intervalcensored failure time data. Biometrical Journal 50(3), 375–385.
Acknowledgments
The colorectal cancer data was derived based on raw data sets obtained from
Data Sphere, LLC. The authors acknowledge support from grant (RS-2024-
00340298) from the National Research Foundation of S. Korea (T. Choi),
grants (2022R1A2C1008514, 2022M3J6A1063595) from the National Research Foundation of S. Korea and grant (K2305261) from Korea University
(S. Choi), and grants (P20CA252717, P20CA264067, and R21DE031879)
from the United States National Institutes of Health (D. Bandyopadhyay).
Supplementary Materials
Proofs of Theorems 1 and 2, details on estimation under the DC setup, and
additional simulation studies are relegated to the Supplementary Material.