Show simple item record

dc.contributor.authorMao, Jiadong
dc.description© 2020 Jiadong Mao
dc.description.abstractStreaming data are a type of high-frequency and nonstationary time series data. The collection of streaming data is sequential and potentially never-ending. Examples of streaming data, including data from sensor networks, mobile devices and the Internet, are prevalent in our daily lives. An estimator for streaming data needs to be computationally efficient so that it is relatively easy to update the estimator using newly arrived data. In addition, the estimator has to be adaptive to the nonstationarity of data. These constraints make streaming data analysis more challenging than analysing the conventional non-streaming data sets. Although streaming data analysis has been discussed in the machine learning community for more than two decades, it has received limited attention from statistical researchers. Estimation methods that are both computationally efficient and theoretically justified are still lacking. In this thesis, we propose nonparametric density and regression estimation methods for streaming data, where the smoothing parameters are chosen in a computationally efficient and fully data-driven way. These methods extend some classical kernel smoothing techniques, such as the kernel density estimator and the Nadaraya-Watson regression estimator, to address the theoretical and computational challenges arising from streaming data analysis. Asymptotic analyses provide these methods with theoretical justification. Numerical studies have shown the superiority of our methods over conventional ones. Through some real-data examples, we show that these methods are potentially useful in modelling real-world problems. Finally, we discuss some directions for future research, including extending these methods to model higher-dimensional streaming data and to streaming data classification.
dc.rightsTerms and Conditions: Copyright in works deposited in Minerva Access is retained by the copyright owner. The work may not be altered without permission from the copyright owner. Readers may only download, print and save electronic copies of whole works for their own personal non-commercial use. Any use that exceeds these limits requires permission from the copyright owner. Attribution is essential when quoting or paraphrasing from these works.
dc.subjectstreaming data
dc.subjectkernel density estimation
dc.subjectkernel regression estimation
dc.subjectonline modelling
dc.subjectnonstationary data
dc.subjectconcept drift
dc.titleNonparametric estimation for streaming data
dc.typePhD thesis
melbourne.affiliation.departmentSchool of Mathematics and Statistics
melbourne.thesis.supervisornameAurore Delaigle
melbourne.contributor.authorMao, Jiadong
melbourne.thesis.supervisorothernameFelix Camirand Lemyre
melbourne.tes.fieldofresearch1010405 Statistical Theory
melbourne.accessrightsThis item is embargoed and will be available on 2022-04-22.

Files in this item


There are no files associated with this item.

This item appears in the following Collection(s)

Show simple item record