Testing for Treatment Effect in Multitreatment Case

Pier Luigi Conti, Livia De Giovanni and Ayoub Mounim

doi:10.5705/ss.202024.0255

Abstract

In this paper the problem of testing for the presence/absence of a (multi-level)treatment

effect is considered. A new test-statistic, essentially based on the same principles as the classical

Kruskal-Wallis test, is introduced, and its theoretical properties are studied. Test-statistics for

stochastic dominance problems are also studied. The good behaviour of the proposed test in

terms of both significance level and power, with respect to other commonly used test procedures, is showed through a simulation study. Finally, an application to real data is provided.

Key words and phrases: Kruskal-Wallis, Multitreatment, Nonparametrics, Stochastic domi- nance

Information

Preprint No.	SS-2024-0255
Manuscript ID	SS-2024-0255
Complete Authors	Pier Luigi Conti, Livia De Giovanni, Ayoub Mounim
Corresponding Authors	Livia De Giovanni
Emails	ldegiovanni@luiss.it

References

Abadie, A. (2002). Bootstrap Tests for Distributional Treatment Effects in Instrumental Variable Models. Journal of the American Statistical Association 97, 284–292.
Anderson, G. (1996). Nonparametric Tests of Stochastic Dominance in Income Distribution. Econometrica 64, 1183–1193.
Cheng, L., R. Guo, K. Shu, and H. Liu (2021). Causal understanding of fake news dissemination on social media. In Proceedings of the 27th ACM SIGKDD Conference on Knowledge Discovery & Data Mining. ACM.
Conti, P. L. and L. De Giovanni (2022). Testing for the presence of treatment effect under selection on observables. AStA Advances in Statistical Analysis, DOI https://doi.org/10.1007/s10182–022–00454–8.
Crump, R., V. J. Hotz, G. W. Imbens, and O. Mitnik (2009). Dealing with limited overlap in estimation of average treatment effects. Biometrika 96, 187–199.
Ding, P. (2017). A Paradox from Randomization-Based Causal Inference. Statistical Science 32, 331–345.
Donald, S. G. and Y. C. Hsu (2014). Estimation and inference for distribution functions and quantile functions in treatment effect models. Journal of Econometrics 178, 383–397.
Donald, S. G. and Y. C. Hsu (2016). Improving the power of tests of stochastic dominance. Econometric Reviews 35, 553–585.
Firpo, S. (2007). Efficient Semiparametric Estimation of Quantile Treatment Effect. Econometrica 75, 259– 276.
Grover, A. and J. Leskovec (2016). Node2vec: Scalable feature learning for networks. In Proceedings of the 20th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, KDD ’16, New
York, NY, USA, pp. 855–864. Association for Computing Machinery.
Hettmansperger, T. P. and J. W. McKean (2011). Robust Nonparametric Statistical Methods II Ed. Boca Raton: CRC Press.
Khosla, M., J. Leonhardt, W. Nejdl, and A. Anand (2019). Node representation learning for directed graphs. In Joint European Conference on Machine Learning and Knowledge Discovery in Databases, pp. 395–411. Springer.
Kim, K. (2013). An Alternative Efficient Estimation of Average Treatment Effects. Journal of Market Economy 42, 1–41.
Li, F. and F. Li (2019). Propensity Score Weighting for Causal Inference with Multiple Treatments. The Annals of Applied Statistics 13(4), pp. 2389–2415.
Li, F., K. L. Morgan, and A. M. Zaslavsky (2018). Balancing covariates via propensity score weighting. Journal of the American Statistical Association 113, 390–400.
Li, F., L. E. Thomas, and F. Li (2019). Addressing Extreme Propensity Scores via the Overlap Weights. American Journal of Epidemiology 188, 250–257.
Lopez, M. J. and R. Gutman (2017). Estimation of Causal Effects with Multiple Treatments: A Review and New Ideas. Statistical Science 32(3), 432 – 454.
McFadden, D. (1989). Testing for stochastic dominance. In Studies in the economics of uncertainty: In honor of Josef Hadar, pp. 113–134. Springer.
Mikolov, T., K. Chen, G. S. Corrado, and J. Dean (2013). Efficient estimation of word representations in vector space. In International Conference on Learning Representations.
Perozzi, B., R. Al-Rfou, and S. Skiena (2014). Deepwalk: Online learning of social representations. In Proceedings of the 20th ACM SIGKDD International Conference on Knowledge Discovery and Data
Mining, KDD ’14, New York, NY, USA, pp. 701–710. Association for Computing Machinery.
Politis, D. N. and J. P. Romano (1994). Sample Confidence Regions Based on Subsamples under Minimal Assumptions. The Annals of Statistics 22, 2031–2050.
Shu, K., D. Mahudeswaran, S. Wang, D. Lee, and H. Liu (2020). Fakenewsnet: A data repository with news content, social context and dynamic information for studying fake news on social media. Journal of Big Data 8(3), 171–188.
St¨urmer, T., K. J. Rothman, and J. A. R. J. Glynn (2010). Treatment effects in the presence of unmeasured confounding: dealing with observations in the tails of the propensity score distribution—a simulation study. American Journal of Epidemiology 172, 843–854.
van der Vaart, A. (1998). Asymptotic Statistics. Cambridge: Cambridge University Press.
Wu, J. and P. Ding (2021). Randomization tests for weak null hypotheses in randomized experiments. Journal of the American Statistical Association 116(536), 1898–1913.
Yang, S., G. W. Imbens, Z. Cui, D. E. Faries, and Z. Kadziola (2016). Propensity score matching and subclassification in observational studies with multi-level treatments. Biometrics 72(4), 1055–1065.

Acknowledgments

The research of Livia De Giovanni and Ayoub Mounim is supported by the IDMO

program of the Italian Digital Media Observatory, under project No. 101158697 funded

by the European Digital Executive Agency (HADEA) Grant Agreement.

Supplementary Materials

The online Supplementary Material contains details on the estimation of the generalized propensity score, the proofs of the main results and the in-depth description of

Simulation Study 2.

Supplementary materials are available for download.

[1] Abadie, A. (2002). Bootstrap Tests for Distributional Treatment Effects in Instrumental Variable Models. Journal of the American Statistical Association 97, 284–292.

[2] Anderson, G. (1996). Nonparametric Tests of Stochastic Dominance in Income Distribution. Econometrica 64, 1183–1193.

[3] Cheng, L., R. Guo, K. Shu, and H. Liu (2021). Causal understanding of fake news dissemination on social media. In Proceedings of the 27th ACM SIGKDD Conference on Knowledge Discovery & Data Mining. ACM.

[4] Conti, P. L. and L. De Giovanni (2022). Testing for the presence of treatment effect under selection on observables. AStA Advances in Statistical Analysis, DOI https://doi.org/10.1007/s10182–022–00454–8.

[5] Crump, R., V. J. Hotz, G. W. Imbens, and O. Mitnik (2009). Dealing with limited overlap in estimation of average treatment effects. Biometrika 96, 187–199.

[6] Ding, P. (2017). A Paradox from Randomization-Based Causal Inference. Statistical Science 32, 331–345.

[7] Donald, S. G. and Y. C. Hsu (2014). Estimation and inference for distribution functions and quantile functions in treatment effect models. Journal of Econometrics 178, 383–397.

[8] Donald, S. G. and Y. C. Hsu (2016). Improving the power of tests of stochastic dominance. Econometric Reviews 35, 553–585.

[9] Firpo, S. (2007). Efficient Semiparametric Estimation of Quantile Treatment Effect. Econometrica 75, 259– 276.

[10] Grover, A. and J. Leskovec (2016). Node2vec: Scalable feature learning for networks. In Proceedings of the 20th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, KDD ’16, New

[11] York, NY, USA, pp. 855–864. Association for Computing Machinery.

[12] Hettmansperger, T. P. and J. W. McKean (2011). Robust Nonparametric Statistical Methods II Ed. Boca Raton: CRC Press.

[13] Khosla, M., J. Leonhardt, W. Nejdl, and A. Anand (2019). Node representation learning for directed graphs. In Joint European Conference on Machine Learning and Knowledge Discovery in Databases, pp. 395–411. Springer.

[14] Kim, K. (2013). An Alternative Efficient Estimation of Average Treatment Effects. Journal of Market Economy 42, 1–41.

[15] Li, F. and F. Li (2019). Propensity Score Weighting for Causal Inference with Multiple Treatments. The Annals of Applied Statistics 13(4), pp. 2389–2415.

[16] Li, F., K. L. Morgan, and A. M. Zaslavsky (2018). Balancing covariates via propensity score weighting. Journal of the American Statistical Association 113, 390–400.

[17] Li, F., L. E. Thomas, and F. Li (2019). Addressing Extreme Propensity Scores via the Overlap Weights. American Journal of Epidemiology 188, 250–257.

[18] Lopez, M. J. and R. Gutman (2017). Estimation of Causal Effects with Multiple Treatments: A Review and New Ideas. Statistical Science 32(3), 432 – 454.

[19] McFadden, D. (1989). Testing for stochastic dominance. In Studies in the economics of uncertainty: In honor of Josef Hadar, pp. 113–134. Springer.

[20] Mikolov, T., K. Chen, G. S. Corrado, and J. Dean (2013). Efficient estimation of word representations in vector space. In International Conference on Learning Representations.

[21] Perozzi, B., R. Al-Rfou, and S. Skiena (2014). Deepwalk: Online learning of social representations. In Proceedings of the 20th ACM SIGKDD International Conference on Knowledge Discovery and Data

[22] Mining, KDD ’14, New York, NY, USA, pp. 701–710. Association for Computing Machinery.

[23] Politis, D. N. and J. P. Romano (1994). Sample Confidence Regions Based on Subsamples under Minimal Assumptions. The Annals of Statistics 22, 2031–2050.

[24] Shu, K., D. Mahudeswaran, S. Wang, D. Lee, and H. Liu (2020). Fakenewsnet: A data repository with news content, social context and dynamic information for studying fake news on social media. Journal of Big Data 8(3), 171–188.

[25] St¨urmer, T., K. J. Rothman, and J. A. R. J. Glynn (2010). Treatment effects in the presence of unmeasured confounding: dealing with observations in the tails of the propensity score distribution—a simulation study. American Journal of Epidemiology 172, 843–854.

[26] van der Vaart, A. (1998). Asymptotic Statistics. Cambridge: Cambridge University Press.

[27] Wu, J. and P. Ding (2021). Randomization tests for weak null hypotheses in randomized experiments. Journal of the American Statistical Association 116(536), 1898–1913.

[28] Yang, S., G. W. Imbens, Z. Cui, D. E. Faries, and Z. Kadziola (2016). Propensity score matching and subclassification in observational studies with multi-level treatments. Biometrics 72(4), 1055–1065.