Back To Index Previous Article Next Article Full Text

Statistica Sinica 29 (2019), 185-201

ON THE EFFICIENCY OF ONLINE APPROACH TO
NONPARAMETRIC SMOOTHING OF BIG DATA
Efang Kong1 and Yingcun Xia2,1
1 University of Electronic Science and Technology of China
and 2National University of Singapore

Abstract: The online updating approach (ONLINE) has been commonly used for the analysis of big data and online transient data. We consider in this paper how to improve its efficiency for various ONLINE kernel-based nonparametric estimators. Our findings include: (i) the optimal choice concerning the bandwidth and how it differs from that for the classical estimators; (ii) the optimal choice among a general class of sequential updating schemes; (iii) that the relative efficiencies of ONLINE Parzen-Rosenblatt density estimation or Nadaraya-Waston (N-W) regression estimation change with the dimension p of covariate in a nonlinear manner, and (iv) that while the classical local-linear fitting renders the estimators design-adaptive, their ONLINE counterparts still depend on the design of covariates in its leading terms of bias, they are still preferred over the ONLINE N-W estimators.

Key words and phrases: Big data, kernel density estimation, N-W estimation, online updating estimation, varying coefficient model.

Back To Index Previous Article Next Article Full Text