Show simple item record

dc.contributor.authorXu, Zemei
dc.description© 2019 Zemei Xu
dc.description.abstractStatistical variable selection, also known as feature selection, has become an indispensable tool in many research areas involving machine learning and data mining. The object of statistical variable selection is to select the best subset of predictors for fitting or predicting the response variable from a potentially large collection of candidate predictors. It is particularly important in high-dimensional problems such as cancer genetics, where there are potentially thousands of predictors and only a few are associated with the outcome. Moreover, predictors may have underlying group structures in them, so it is desirable to take the underlying group effects into consideration when performing variable selection and handle the grouping structure present in the model when selecting important variables. This thesis presents a Bayesian grouped regression model with continuous global-local shrinkage priors to tackle the high-dimensional variable selection problem by introducing shrinkage parameters at the group level as well as within each group. This model is able to handle complex group hierarchies that include overlapping and multilevel group structures, and also enjoys the advantages of handling sparsity as well as strong signals. The proposed method is also applied to a real high-dimensional GWAS breast cancer dataset, Haiman-Hopper dataset, to analyse the breast cancer genome-wide association study.
dc.rightsTerms and Conditions: Copyright in works deposited in Minerva Access is retained by the copyright owner. The work may not be altered without permission from the copyright owner. Readers may only download, print and save electronic copies of whole works for their own personal non-commercial use. Any use that exceeds these limits requires permission from the copyright owner. Attribution is essential when quoting or paraphrasing from these works.
dc.subjectGenome-wide association studies
dc.subjectHigh-dimensional data
dc.subjectBayesian penalised regression
dc.subjectBayesian Lasso
dc.subjectBayesian horseshoe
dc.subjectAdditive models
dc.subjectSparse group selection
dc.subjectSparse global-local shrinkage regression
dc.subjectBreast cancer
dc.titleSparse Bayesian Grouped Regression with Application to Genome-Wide Association Studies
dc.typePhD thesis
melbourne.affiliation.departmentMelbourne School of Population and Global Health
melbourne.affiliation.facultyMedicine, Dentistry & Health Sciences
melbourne.thesis.supervisornameDaniel Schmidt
melbourne.contributor.authorXu, Zemei
melbourne.thesis.supervisorothernameEnes Makalic
melbourne.thesis.supervisorothernameJohn Hopper
melbourne.thesis.supervisorothernameGuoqi Qian
melbourne.tes.fieldofresearch1010401 Applied Statistics
melbourne.tes.fieldofresearch2010402 Biostatistics
melbourne.accessrightsThis item is embargoed and will be available on 2022-01-13.

Files in this item


There are no files associated with this item.

This item appears in the following Collection(s)

Show simple item record