Abstract
Clustered effects are often encountered in multiple hypothesis testing of spatial
signals. In this paper, we propose a new method, termed two-dimensional spatial multiple
testing (2d-SMT) procedure, to control the false discovery rate (FDR) and improve the detection power by exploiting the spatial information encoded in neighboring observations. The
proposed method provides a novel perspective of utilizing spatial information by gathering
signal patterns and spatial dependence into an auxiliary statistic. 2d-SMT rejects the null
when a primary statistic at the location of interest and the auxiliary statistic constructed
based on nearby observations are greater than their corresponding cutoffs. 2d-SMT can also
be combined with different variants of the weighted BH procedures to improve the detection
power further. A fast algorithm is developed to accelerate the search for optimal cutoffs in
2d-SMT. In theory, we establish the asymptotic FDR control of 2d-SMT under weak spatial
dependence. Extensive numerical experiments demonstrate that the 2d-SMT method combined with various weighted BH procedures achieves the most competitive performance in
FDR and power trade-off.
Information
| Preprint No. | SS-2024-0152 |
|---|---|
| Manuscript ID | SS-2024-0152 |
| Complete Authors | Linsui Deng, Kejun He, Xianyang Zhang |
| Corresponding Authors | Kejun He |
| Emails | kejunhe@ruc.edu.cn |
References
- Andrews, D. W. K. (1984). Non-strong mixing autoregressive processes. J. Appl. Probab. 21(4), 930–934.
- Basu, P., T. T. Cai, K. Das, and W. Sun (2018). Weighted false discovery rate control in large-scale multiple testing. J. Amer. Statist. Assoc. 113(523), 1172–1183.
- Benjamini, Y. and R. Heller (2007). False discovery rates for spatial signals. J. Amer. Statist. Assoc. 102(480), 1272–1281.
- Benjamini, Y. and Y. Hochberg (1995). Controlling the false discovery rate: A practical and powerful approach to multiple testing. J. R. Stat. Soc., Ser. B, Methodol. 57(1), 289–300.
- Cai, T. T., W. Sun, and Y. Xia (2022). Laws: A locally adaptive weighting and screening approach to spatial multiple testing. J. Amer. Statist. Assoc. 117(539), 1370–1383.
- Cao, H., J. Chen, and X. Zhang (2022). Optimal false discovery rate control for large scale multiple testing with auxiliary information. Ann. Stat. 50(2), 807–857.
- Chin, M. et al. (1994). Relationship of ozone and carbon monoxide over north america. J. Geophys. Res. Atmos. 99(D7), 14565–14573.
- Cliff, A. D. and J. K. Ord (1981). Spatial Processes: Models and Applications. London, UK: Pion Ltd.
- Cressie, N. (1993). Statistics for Spatial Data. Wiley-Interscience. Part II: Lattice Data.
- Davidson, J. (1994, 10). Near-epoch dependence. In Stochastic Limit Theory: An Introduction for Econometricians, pp. 241–278. Oxford University Press.
- Ferreira, J. A. and A. H. Zwinderman (2006). On the benjamini–hochberg method. Ann. Stat. 34(4), 1827–1849, 23.
- Fithian, W. and L. Lei (2022). Conditional calibration for false discovery rate control under dependence. Ann. Stat. 50(6), 3091–3118.
- French, J. P. and S. R. Sain (2013). Spatio-temporal exceedance locations and confidence regions. Ann. Appl. Stat. 7(3), 1421–1449, 29.
- Genovese, C. R., K. Roeder, and L. Wasserman (2006). False discovery control with p-value weighting. Biometrika 93(3), 509–524.
- Gorodetskii, V. (1978). On the strong mixing property for linear sequences. Theory Probab. Its Appl. 22(2), 411–413.
- Han, S. et al. (2011). Analysis of the relationship between o3, no and no2 in tianjin, china. Aerosol. Air Qual. Res. 11(2), 128–139.
- Heller, R., D. Stanley, D. Yekutieli, N. Rubin, and Y. J. N. Benjamini (2006). Cluster-based analysis of fmri data. Neuroimage 33, 599–608.
- Hu, J., H. Zhao, and H. Zhou (2010). False discovery rate control with groups. J. Amer. Statist. Assoc. 105, 1215–1227.
- Huang, H.-C., N. Cressie, A. Zammit-Mangion, and G. Huang (2021). False discovery rates to detect signals from incomplete spatially aggregated data. J. Comput. Graph. Stat. 30(4), 1081–1094.
- Ignatiadis, N. and W. Huber (2021). Covariate powered cross-weighted multiple testing. J. R. Stat. Soc., Ser. B, Stat. Methodol. 83(4), 720–751.
- Ignatiadis, N., B. Klaus, J. B. Zaugg, and W. Huber (2016). Data-driven hypothesis weighting increases detection power in genome-scale multiple testing. Nat. Methods 13(7), 577–580.
- Jenish, N. (2012). Nonparametric spatial regression under near-epoch dependence. J. Econom. 167(1), 224–239.
- Jenish, N. and I. R. Prucha (2012). On spatial processes and asymptotic inference under near-epoch dependence. J. Econom. 170(1), 178–190.
- Katzfuss, M. and J. Guinness (2021). A general framework for vecchia approximations of gaussian processes. Stat Sci 36(1), 124–141.
- Kiefer, J. and J. Wolfowitz (1956). Consistency of the maximum likelihood estimator in the presence of infinitely many incidental parameters. Ann. Math. Stat. 27(4), 887–906.
- Koenker, R. and I. Mizera (2014). Convex optimization, shape constraints, compound decisions, and empirical bayes rules. J. Amer. Statist. Assoc. 109(506), 674–685.
- Lei, J., M. G’Sell, A. Rinaldo, R. J. Tibshirani, and L. Wasserman (2018). Distribution-free predictive inference for regression. J. Amer. Statist. Assoc. 113(523), 1094–1111.
- Li, A. and R. F. Barber (2019). Multiple testing with the structure-adaptive benjamini-hochberg algorithm. J. R. Stat. Soc., Ser. B, Stat. Methodol. 81(1), 45–74.
- Liu, W. et al. (2022). Stratospheric ozone depletion and tropospheric ozone increases drive southern ocean interior warming. Nat. Clim. Chang. 12(4), 365–372.
- Liu, W., Y. Ke, J. Liu, and R. Li (2022). Model-free feature screening and fdr control with knockofffeatures. J. Amer. Statist. Assoc. 117(537), 428–443.
- Mardia, K. V. and R. J. Marshall (1984). Maximum likelihood estimation of models for residual covariance in spatial regression. Biometrika 71(1), 135–146.
- Sang, H. and J. Z. Huang (2012). A full scale approximation of covariance functions for large spatial data sets. J. R. Stat. Soc., Ser. B, Stat. Methodol. 74(1), 111–132.
- Scott, J. G., R. C. Kelly, M. A. Smith, P. Zhou, and R. E. Kass (2015). False discovery rate regression: An application to neural synchrony detection in primary visual cortex. J. Amer. Statist. Assoc. 110(510), 459–471.
- Shen, X., H.-C. Huang, and N. Cressie (2002). Nonparametric hypothesis testing for a spatial signal. J. Amer. Statist. Assoc. 97(460), 1122–1140.
- Storey, J. D. (2002). A direct approach to false discovery rates. J. R. Stat. Soc., Ser. B, Methodol. 64(3), 479–498.
- Storey, J. D., J. E. Taylor, and D. Siegmund (2004). Strong control, conservative point estimation and simultaneous conservative consistency of false discovery rates: A unified approach. J. R. Stat. Soc., Ser. B, Stat. Methodol. 66(1), 187–205.
- Sun, W., B. J. Reich, T. T. Cai, M. Guindani, and A. Schwartzman (2015). False discovery control in large-scale spatial multiple testing. J. R. Stat. Soc., Ser. B, Stat. Methodol. 77(1), 59–83.
- Tansey, W., O. Koyejo, R. A. Poldrack, and J. G. Scott (2018). False discovery rate smoothing. J. Amer. Statist. Assoc. 113(523), 1156–1171.
- Vingarzan, R. (2004). A review of surface ozone background levels and trends. Atmospheric Environ. 38(21), 3431–3442.
- Warneck, P. (2000). Chapter 5 ozone in the troposphere. In P. Warneck (Ed.), Chemistry of the Natural
- Atmosphere, Volume 71 of International Geophysics, pp. 211–263. Academic Press.
- Wasserman, L. and K. Roeder (2009). High dimensional variable selection. Ann. Stat. 37(5A), 2178–2201.
- Weinhold, B. (2008). Ozone nation: Epa standard panned by the people. Environ. Health Perspect. 116(7), A302–A305.
- Yi, S., X. Zhang, L. Yang, J. Huang, Y. Liu, C. Wang, D. J. Schaid, and J. Chen (2021). 2dfdr: A new approach to confounder adjustment substantially increases detection power in omics association studies. Genome Biol. 22(1), 208.
- Yun, S., X. Zhang, and B. Li (2022). Detection of local differences in spatial characteristics between two spatiotemporal random fields. J. Amer. Statist. Assoc. 117(537), 291–306.
- Zhang, C.-H. (2009). Generalized maximum likelihood estimation of normal mixture densities. Stat. Sin. 19(3), 1297–1318.
- Zhang, X. and J. Chen (2022). Covariate adaptive false discovery rate control with applications to omicswide multiple testing. J. Amer. Statist. Assoc. 117(537), 411–427.
Acknowledgments
The authors thank the editor, the associate editor, and anonymous reviewers for
their helpful comments and suggestions. This research was supported by the Public
Computing Cloud, Renmin University of China. The research of Zhang was partially
supported by NSF DMS-2113359, NIH 1R01GM144351-01 and NIH 1R21HG011662.
Supplementary Materials
The online Supplementary Material contains our proofs of Theorems 1 and 2, additional numerical results, some discussions about the estimation for the covariance
of noises, the details of Algorithm 1, and the address to download the reproducible
code of this work.