SparSNP: Fast and memory-efficient analysis of all SNPs for phenotype prediction
AuthorAbraham, G; Kowalczyk, A; Zobel, J; Inouye, M
Source TitleBMC Bioinformatics
University of Melbourne Author/sKowalczyk, Adam; Zobel, Justin; Inouye, Michael; Abraham, Gad; ABRAHAM, GAD
Computing and Information Systems
Document TypeJournal Article
CitationsAbraham, G., Kowalczyk, A., Zobel, J. & Inouye, M. (2012). SparSNP: Fast and memory-efficient analysis of all SNPs for phenotype prediction. BMC BIOINFORMATICS, 13 (1), https://doi.org/10.1186/1471-2105-13-88.
Access StatusAccess this item via the Open Access location
Open Access URLPublished version
NHMRC Grant codeNHMRC/637400
BACKGROUND: A central goal of genomics is to predict phenotypic variation from genetic variation. Fitting predictive models to genome-wide and whole genome single nucleotide polymorphism (SNP) profiles allows us to estimate the predictive power of the SNPs and potentially develop diagnostic models for disease. However, many current datasets cannot be analysed with standard tools due to their large size. RESULTS: We introduce SparSNP, a tool for fitting lasso linear models for massive SNP datasets quickly and with very low memory requirements. In analysis on a large celiac disease case/control dataset, we show that SparSNP runs substantially faster than four other state-of-the-art tools for fitting large scale penalised models. SparSNP was one of only two tools that could successfully fit models to the entire celiac disease dataset, and it did so with superior performance. Compared with the other tools, the models generated by SparSNP had better than or equal to predictive performance in cross-validation. CONCLUSIONS: Genomic datasets are rapidly increasing in size, rendering existing approaches to model fitting impractical due to their prohibitive time or memory requirements. This study shows that SparSNP is an essential addition to the genomic analysis toolkit.SparSNP is available at http://www.genomics.csse.unimelb.edu.au/SparSNP.
- Click on "Export Reference in RIS Format" and choose "open with... Endnote".
- Click on "Export Reference in RIS Format". Login to Refworks, go to References => Import References