Abstract
Hierarchical factor models, which include the bifactor model as a special case, are useful in
social and behavioural sciences for measuring hierarchically structured constructs. Specifying a hierarchical factor model involves imposing hierarchically structured zero constraints on a factor loading
matrix, which is often challenging.
Therefore, an exploratory analysis is needed to learn the hierarchical factor structure from data. Unfortunately, there does not exist an identifiability theory for
the learnability of this hierarchical structure, nor a computationally efficient method with provable
performance. The method of Schmid-Leiman transformation, which is often regarded as the default
method for exploratory hierarchical factor analysis, is flawed and likely to fail. The contribution of
this paper is three-fold.
First, an identifiability result is established for general hierarchical factor
models, which shows that the hierarchical factor structure is learnable under mild regularity conditions. Second, a computationally efficient divide-and-conquer approach is proposed for learning the
hierarchical factor structure. Finally, asymptotic theory is established for the proposed method, showing that it can consistently recover the true hierarchical factor structure as the sample size grows to
infinity. The power of the proposed method is shown via simulation studies and a real data application to a personality test. The computation code for the proposed method is publicly available at
Information
| Preprint No. | SS-2025-0196 |
|---|---|
| Manuscript ID | SS-2025-0196 |
| Complete Authors | Jiawei Qiao, Yunxiao Chen, Zhiliang Ying |
| Corresponding Authors | Yunxiao Chen |
| Emails | y.chen186@lse.ac.uk |
References
- Anderson, T. (2003). An Introduction to Multivariate Statistical Analysis. Wiley New York.
- Anderson, T. and H. Rubin (1956). Statistical inference in factor analysis. In Proceedings of the Berkeley Symposium on Mathematical Statistics and Probability, pp. 111 – 150. University of California Press.
- Bertsekas, D. P. (2014). Constrained Optimization and Lagrange Multiplier Methods. Academic Press.
- Browne, M. W. (2001). An overview of analytic rotation in exploratory factor analysis. Multivariate Behavioral Research 36(1), 111–150.
- Brunner, M., G. Nagy, and O. Wilhelm (2012). A tutorial on hierarchically structured constructs. Journal of Personality 80(4), 796–846.
- Carroll, J. B. (1993). Human Cognitive Abilities: A survey of Factor-Analytic Studies. Cambridge University Press. Exploratory Hierarchical Factor Analysis
- Chen, F. F., S. G. West, and K. H. Sousa (2006). A comparison of bifactor and second-order models of quality of life. Multivariate Behavioral Research 41(2), 189–225.
- Chen, Y., X. Li, and S. Zhang (2019). Joint maximum likelihood estimation for highdimensional exploratory item factor analysis. Psychometrika 84(1), 124–146.
- Chen, Y., X. Li, and S. Zhang (2020). Structured latent factor analysis for large-scale data: Identifiability, estimability, and their implications. Journal of the American Statistical Association 115(532), 1756–1770.
- Cui, C. and G. Xu (2025). Identifiability and inference for generalized latent factor models. arXiv preprint arXiv:2508.05866.
- DeYoung, C. G. (2006). Higher-order factors of the big five in a multi-informant sample. Journal of Personality and Social Psychology 91(6), 1138.
- Fabrigar, L. R. and D. T. Wegener (2012). Exploratory Factor Analysis. Oxford University Press.
- Fang, G., J. Guo, X. Xu, Z. Ying, and S. Zhang (2021). Identifiability of bifactor models. Statistica Sinica 31, 2309–2330.
- Holzinger, K. J. and F. Swineford (1937). The bi-factor method. Psychometrika 2(1), 41–54.
- Jennrich, R. I. and P. M. Bentler (2011). Exploratory bi-factor analysis. Psychometrika 76, 537–549.
- Johnson, J. A. (2014). Measuring thirty facets of the five factor model with a 120-item public domain inventory: Development of the IPIP-NEO-120. Journal of Research in Personality 51, 78–89.
- Kose, M. A., C. Otrok, and C. H. Whiteman (2008). Understanding the evolution of world business cycles. Journal of International Economics 75(1), 110–130.
- Moench, E., S. Ng, and S. Potter (2013). Dynamic hierarchical factor models. Review of Economics and Statistics 95(5), 1811–1817. Exploratory Hierarchical Factor Analysis
- Qiao, J., Y. Chen, and Z. Ying (2025). Exact exploratory bi-factor analysis: A constraintbased optimization approach. Psychometrika 90(3), 998–1013.
- Reise, S. P., J. Morizot, and R. D. Hays (2007). The role of the bifactor model in resolving dimensionality issues in health outcomes measures. Quality of Life Research 16, 19–31.
- Revelle, W. and J. Wilt (2013). The general factor of personality: A general critique. Journal of Research in Personality 47(5), 493–504.
- Schmid, J. and J. M. Leiman (1957). The development of hierarchical factor solutions. Psychometrika 22(1), 53–61.
- Schwarz, G. (1978). Estimating the dimension of a model. The Annals of Statistics 6, 461–464.
- Sharma, P., B. Sivakumaran, and G. Mohan (2022). Using Schmid–Leiman solution with higher-order constructs in marketing research. Marketing Intelligence & Planning 40(4), 513–526.
- Thomson, G. H. (1939). The factorial analysis of human ability. Houghton Mifflin.
- Thurstone, L. L. (1947). Multiple Factor Analysis. University of Chicago Press.
- Yung, Y.-F., D. Thissen, and L. D. McLeod (1999). On the relationship between the higherorder factor model and the hierarchical factor model. Psychometrika 64, 113–128.
- Zheng, X., B. Aragam, P. K. Ravikumar, and E. P. Xing (2018). Dags with NO TEARS: Continuous optimization for structure learning. Advances in Neural Information Processing Systems 31. Author Information
Acknowledgments
The authors would like to thank the editor, the associate editor, and the reviewers for their
constructive and valuable comments, which have substantially improved the manuscript.