TY - THES AU - Mao, Jiadong Y2 - 2020/04/22 Y1 - 2020 UR - http://hdl.handle.net/11343/237540 AB - Streaming data are a type of high-frequency and nonstationary time series data. The collection of streaming data is sequential and potentially never-ending. Examples of streaming data, including data from sensor networks, mobile devices and the Internet, are prevalent in our daily lives. An estimator for streaming data needs to be computationally efficient so that it is relatively easy to update the estimator using newly arrived data. In addition, the estimator has to be adaptive to the nonstationarity of data. These constraints make streaming data analysis more challenging than analysing the conventional non-streaming data sets. Although streaming data analysis has been discussed in the machine learning community for more than two decades, it has received limited attention from statistical researchers. Estimation methods that are both computationally efficient and theoretically justified are still lacking. In this thesis, we propose nonparametric density and regression estimation methods for streaming data, where the smoothing parameters are chosen in a computationally efficient and fully data-driven way. These methods extend some classical kernel smoothing techniques, such as the kernel density estimator and the Nadaraya-Watson regression estimator, to address the theoretical and computational challenges arising from streaming data analysis. Asymptotic analyses provide these methods with theoretical justification. Numerical studies have shown the superiority of our methods over conventional ones. Through some real-data examples, we show that these methods are potentially useful in modelling real-world problems. Finally, we discuss some directions for future research, including extending these methods to model higher-dimensional streaming data and to streaming data classification. KW - streaming data KW - kernel density estimation KW - kernel regression estimation KW - online modelling KW - nonstationary data KW - concept drift T1 - Nonparametric estimation for streaming data L1 - /bitstream/handle/11343/237540/3ec9f13c-ab48-ea11-94b5-0050568d7800_Jiadong-Mao-PhD-Thesis.pdf?sequence=1&isAllowed=n ER -