Abstract

Hierarchical factor models, which include the bifactor model as a special case, are useful in

social and behavioural sciences for measuring hierarchically structured constructs. Specifying a hierarchical factor model involves imposing hierarchically structured zero constraints on a factor loading

matrix, which is often challenging.

Therefore, an exploratory analysis is needed to learn the hierarchical factor structure from data. Unfortunately, there does not exist an identifiability theory for

the learnability of this hierarchical structure, nor a computationally efficient method with provable

performance. The method of Schmid-Leiman transformation, which is often regarded as the default

method for exploratory hierarchical factor analysis, is flawed and likely to fail. The contribution of

this paper is three-fold.

First, an identifiability result is established for general hierarchical factor

models, which shows that the hierarchical factor structure is learnable under mild regularity conditions. Second, a computationally efficient divide-and-conquer approach is proposed for learning the

hierarchical factor structure. Finally, asymptotic theory is established for the proposed method, showing that it can consistently recover the true hierarchical factor structure as the sample size grows to

infinity. The power of the proposed method is shown via simulation studies and a real data application to a personality test. The computation code for the proposed method is publicly available at

Key words and phrases: Hierarchical factor model, augmented Lagrangian method, exploratory hier- archical factor analysis 1 Introduction Many constructs in social and behavioural sciences are conceptualized to be hierarchically structured, such as psychological traits (e.g., Carroll, 1993; DeYoung, 2006), economic factors (e.g., Kose et al., 2008; Moench et al., 2013), health outcomes measures (e.g., Chen et al., 2006; Reise et al 2007) and constructs in marketing research (e g Sharma et al 2022) Exploratory Hierarchical Factor Analysis Hierarchical factor models (Brunner et al., 2012; Schmid and Leiman, 1957; Thomson, 1939; Yung et al., 1999), which include the bi-factor model (Holzinger and Swineford, 1937) as a special case with two factor layers, are commonly used to measure hierarchically structured constructs. In these models, hierarchically structured zero constraints are imposed on factor loadings to define the hierarchical factors. When the hierarchical factor structure is known or hypothesized a priori, the statistical inference of a hierarchical factor model only requires standard confirmatory factor analysis techniques (Brunner et al., 2012). However, for many real-world scenarios, little prior information about the hierarchical factor structure is avail- able, so we need to learn this structure from data. This analysis is referred to as exploratory hierarchical factor analysis

Information

Preprint No.SS-2025-0196
Manuscript IDSS-2025-0196
Complete AuthorsJiawei Qiao, Yunxiao Chen, Zhiliang Ying
Corresponding AuthorsYunxiao Chen
Emailsy.chen186@lse.ac.uk

References

  1. Anderson, T. (2003). An Introduction to Multivariate Statistical Analysis. Wiley New York.
  2. Anderson, T. and H. Rubin (1956). Statistical inference in factor analysis. In Proceedings of the Berkeley Symposium on Mathematical Statistics and Probability, pp. 111 – 150. University of California Press.
  3. Bertsekas, D. P. (2014). Constrained Optimization and Lagrange Multiplier Methods. Academic Press.
  4. Browne, M. W. (2001). An overview of analytic rotation in exploratory factor analysis. Multivariate Behavioral Research 36(1), 111–150.
  5. Brunner, M., G. Nagy, and O. Wilhelm (2012). A tutorial on hierarchically structured constructs. Journal of Personality 80(4), 796–846.
  6. Carroll, J. B. (1993). Human Cognitive Abilities: A survey of Factor-Analytic Studies. Cambridge University Press. Exploratory Hierarchical Factor Analysis
  7. Chen, F. F., S. G. West, and K. H. Sousa (2006). A comparison of bifactor and second-order models of quality of life. Multivariate Behavioral Research 41(2), 189–225.
  8. Chen, Y., X. Li, and S. Zhang (2019). Joint maximum likelihood estimation for highdimensional exploratory item factor analysis. Psychometrika 84(1), 124–146.
  9. Chen, Y., X. Li, and S. Zhang (2020). Structured latent factor analysis for large-scale data: Identifiability, estimability, and their implications. Journal of the American Statistical Association 115(532), 1756–1770.
  10. Cui, C. and G. Xu (2025). Identifiability and inference for generalized latent factor models. arXiv preprint arXiv:2508.05866.
  11. DeYoung, C. G. (2006). Higher-order factors of the big five in a multi-informant sample. Journal of Personality and Social Psychology 91(6), 1138.
  12. Fabrigar, L. R. and D. T. Wegener (2012). Exploratory Factor Analysis. Oxford University Press.
  13. Fang, G., J. Guo, X. Xu, Z. Ying, and S. Zhang (2021). Identifiability of bifactor models. Statistica Sinica 31, 2309–2330.
  14. Holzinger, K. J. and F. Swineford (1937). The bi-factor method. Psychometrika 2(1), 41–54.
  15. Jennrich, R. I. and P. M. Bentler (2011). Exploratory bi-factor analysis. Psychometrika 76, 537–549.
  16. Johnson, J. A. (2014). Measuring thirty facets of the five factor model with a 120-item public domain inventory: Development of the IPIP-NEO-120. Journal of Research in Personality 51, 78–89.
  17. Kose, M. A., C. Otrok, and C. H. Whiteman (2008). Understanding the evolution of world business cycles. Journal of International Economics 75(1), 110–130.
  18. Moench, E., S. Ng, and S. Potter (2013). Dynamic hierarchical factor models. Review of Economics and Statistics 95(5), 1811–1817. Exploratory Hierarchical Factor Analysis
  19. Qiao, J., Y. Chen, and Z. Ying (2025). Exact exploratory bi-factor analysis: A constraintbased optimization approach. Psychometrika 90(3), 998–1013.
  20. Reise, S. P., J. Morizot, and R. D. Hays (2007). The role of the bifactor model in resolving dimensionality issues in health outcomes measures. Quality of Life Research 16, 19–31.
  21. Revelle, W. and J. Wilt (2013). The general factor of personality: A general critique. Journal of Research in Personality 47(5), 493–504.
  22. Schmid, J. and J. M. Leiman (1957). The development of hierarchical factor solutions. Psychometrika 22(1), 53–61.
  23. Schwarz, G. (1978). Estimating the dimension of a model. The Annals of Statistics 6, 461–464.
  24. Sharma, P., B. Sivakumaran, and G. Mohan (2022). Using Schmid–Leiman solution with higher-order constructs in marketing research. Marketing Intelligence & Planning 40(4), 513–526.
  25. Thomson, G. H. (1939). The factorial analysis of human ability. Houghton Mifflin.
  26. Thurstone, L. L. (1947). Multiple Factor Analysis. University of Chicago Press.
  27. Yung, Y.-F., D. Thissen, and L. D. McLeod (1999). On the relationship between the higherorder factor model and the hierarchical factor model. Psychometrika 64, 113–128.
  28. Zheng, X., B. Aragam, P. K. Ravikumar, and E. P. Xing (2018). Dags with NO TEARS: Continuous optimization for structure learning. Advances in Neural Information Processing Systems 31. Author Information

Acknowledgments

The authors would like to thank the editor, the associate editor, and the reviewers for their

constructive and valuable comments, which have substantially improved the manuscript.