Abstract
High-dimensional compositional data are increasingly prevalent across
diverse fields of modern scientific research. Regression analysis involving compositional data presents unique challenges, particularly when covariate measure-
ment errors are present. These errors can propagate across composition components due to their inherent dependency structure, complicating the application
of conventional error-in-variables regression techniques. To simultaneously address the compositional nature and measurement errors in the high-dimensional
design matrix of compositional covariates, we propose the Error-in-Composition
(Eric) Lasso, a novel method for regression analysis with high-dimensional compositional covariates subject to measurement error.
We establish theoretical
guarantees for Eric Lasso, including estimation error bounds and asymptotic
sign-consistent variable selection properties. The finite-sample performance of
the method is demonstrated through simulation studies and a real-world appli-
The authors are listed in alphabetical order. Correspondence should be addressed
cation.
Key words and phrases: Compositional data, Error-in-variable, High-dimensional regression, Log contrast models, Lasso
Information
| Preprint No. | SS-2025-0223 |
|---|---|
| Manuscript ID | SS-2025-0223 |
| Complete Authors | Wenxi Tan, Lingzhou Xue, Songshan Yang, Xiang Zhan |
| Corresponding Authors | Lingzhou Xue |
| Emails | lzxue@psu.edu |
References
No references available.
Acknowledgments
The authors would like to thank the Co-Editor, the Associate Editor, and
the anonymous referees for their helpful suggestions and constructive comments. The research of Tan and Xue was supported by the U.S. National
Science Foundation (NSF) grant DMS-2210775 and the U.S. National Institutes of Health (NIH) grant 1R01GM152812. The research of Yang was
supported by the National Key R&D Program of China 2023YFA1008702
and the National Natural Foundation of China (NSFC) 12301389. The research of Zhan was supported by the National Natural Science Foundation
of China (grant no. 12371287).
Penn State University
Renmin University of China
Southeast University
Supplementary Materials
The online supplementary materials consist of technical proofs of theorems
and additional numerical results.