Back To Index Previous Article Next Article Full Text

Statistica Sinica 30 (2020), 1419-1436

NETWORK IMPUTATION FOR A SPATIAL
AUTOREGRESSION MODEL WITH INCOMPLETE DATA
Zhimeng Sun and Hansheng Wang
Central University of Finance and Economics and Peking University

Abstract: Numerous imputation methods have been developed for missing data. However, these methods apply mainly to independent data, and the assumption of independence disregards connections of units through social relationships (e.g., friendship, follower-followee relationship). In fact, observed responses from connected friends should provide valuable information for missing responses. This motivates us to conduct an imputation by borrowing information from connected friends using a network structure. With the missing-at-random assumption and using observed information only, we propose a partial likelihood approach and develop the corresponding maximum partial likelihood estimator (MPLE). The estimator's consistency and asymptotic normality are established. Using the MPLE, we then develop a novel regression imputation method. The method utilizes both auxiliary information and connected complete units (i.e., network information); using the imputed data, we can compute the sample mean of the responses. We show this method to be consistent and asymptotically normal. Compared with the imputation method using auxiliary information only (i.e., ignoring network information), the proposed estimator is statistically more efficient. Extensive simulation studies are conducted to demonstrate the finite-sample performance of the proposed method. We then analyze a real example about QQ in mainland China.

Key words and phrases: Incomplete data, network imputation, QQ, spatial autoregression.

Back To Index Previous Article Next Article Full Text