Statistica Sinica 35 (2025), 1133-1153
Abstract: Testing the independence between two high-dimensional random vectors is a fundamental and challenging problem in statistics. Most existing tests based on distance and kernel may fail to detect the non-linear dependence in the high-dimensional regime. To tackle this obstacle, this paper proposes a kernel independence test for assessing the independence between two random vectors based on a class of Gaussian projections relying on tuning parameters. The proposed test can be generally implemented for a wide class of distance-based kernels and completely characterizes dependence in the low-dimensional regime. Besides, the test captures pure non-linear dependence in the high-dimensional regime. Theoretically, we develop central limit theorem and associated rate of convergence for the proposed statistic under some mild regularity conditions and the null hypothesis. Moreover, we derive the asymptotic power of the proposed test enabling us to select suitable parameters for a special alternative, to achieve superior power in the high-dimensional regime. The choices of tuning parameters ensure that the proposed test has comparable power with the original kernel-based test in the moderately high-dimensional regime. Numerical experiments also demonstrate the satisfactory empirical performance of the proposed test in various scenarios.
Key words and phrases: High-dimension, independence test, kernel independence measure, random projections, U-statistics.