Back To Index Previous Article Next Article Full Text

Statistica Sinica 25 (2015), 1081-1106

SPECTRAL CLUSTERING IN HETEROGENEOUS
NETWORKS
Srijan Sengupta and Yuguo Chen
University of Illinois at Urbana-Champaign

Abstract: Many real-world systems consist of several types of entities, and heterogeneous networks are required to represent such systems. However, the current statistical toolbox for network data can only deal with homogeneous networks, where all nodes are supposed to be of the same type. This article introduces a statistical framework for community detection in heterogeneous networks. For modeling heterogeneous networks, we propose heterogeneous versions of both the classical stochastic blockmodel and the degree-corrected blockmodel. For community detection, we formulate heterogeneous versions of standard spectral clustering and regularized spectral clustering. We demonstrate the theoretical accuracy of the proposed heterogeneous methods for networks generated from the proposed heterogeneous models. Our simulations establish the superiority of proposed heterogeneous methods over existing homogeneous methods in finite networks generated from the models. An analysis of the DBLP four-area data demonstrates the improved accuracy of the heterogeneous method over the homogeneous method in identifying research areas for authors.

Key words and phrases: Clustering, community detection, degree-corrected blockmodel, heterogeneous network, network analysis, regularized spectral clustering, spectral clustering, stochastic blockmodel.

Back To Index Previous Article Next Article Full Text