Back To Index Previous Article Next Article Full Text

Statistica Sinica 32 (2022), 1027-1048

OPTIMAL ESTIMATION OF SIMULTANEOUS
SIGNALS USING ABSOLUTE INNER PRODUCT
WITH APPLICATIONS TO INTEGRATIVE GENOMICS

Rong Ma, T. Tony Cai and Hongzhe Li

University of Pennsylvania

Abstract: Integrating the summary statistics from a genome-wide association study and expression quantitative trait loci data provides a powerful way of identifying genes with expression levels that are potentially associated with complex diseases. We introduce a parameter called T-score that quantifies the genetic overlap between a gene and the disease phenotype based on the summary statistics, based on the mean values of two Gaussian sequences. Specifically, given two independent samples xn ~ N (θ, Σ1) and yn ~ N (µ, Σ2), the T-score is defined as ΣJ32N217-11 |θiµi|, a nonsmooth functional, that characterizes the number of shared signals between two absolute normal mean vectors |θ| and |µ|. Using approximation theory, estimators are constructed and shown to be minimax rate-optimal and adaptive over various parameter spaces. Simulation studies demonstrate the superiority of the proposed estimators over existing methods. Lastly, the method is applied to an integrative analysis of heart failure genomics data sets and we identify several genes and biological pathways that are potentially causal to human heart failure.

Key words and phrases: Approximation theory, eQTL, GWAS, minimax lower bound, non-smooth functional.

Back To Index Previous Article Next Article Full Text