University Library
  • Login
A gateway to Melbourne's research publications
Minerva Access is the University's Institutional Repository. It aims to collect, preserve, and showcase the intellectual output of staff and students of the University of Melbourne for a global audience.
View Item 
  • Minerva Access
  • Science
  • School of Mathematics and Statistics
  • School of Mathematics and Statistics - Research Publications
  • View Item
  • Minerva Access
  • Science
  • School of Mathematics and Statistics
  • School of Mathematics and Statistics - Research Publications
  • View Item
JavaScript is disabled for your browser. Some features of this site may not work without it.

    Sparse canonical methods for biological data integration: application to a cross-platform study

    Thumbnail
    Download
    Published version (1.024Mb)

    Citations
    Scopus
    Altmetric
    166
    Author
    Le Cao, K-A; Martin, PGP; Robert-Granie, C; Besse, P
    Date
    2009-01-26
    Source Title
    BMC Bioinformatics
    Publisher
    BIOMED CENTRAL LTD
    University of Melbourne Author/s
    Le Cao, Kim-Anh
    Affiliation
    School of Mathematics and Statistics
    Metadata
    Show full item record
    Document Type
    Journal Article
    Citations
    Le Cao, K. -A., Martin, P. G. P., Robert-Granie, C. & Besse, P. (2009). Sparse canonical methods for biological data integration: application to a cross-platform study. BMC BIOINFORMATICS, 10 (1), https://doi.org/10.1186/1471-2105-10-34.
    Access Status
    Open Access
    URI
    http://hdl.handle.net/11343/259606
    DOI
    10.1186/1471-2105-10-34
    Abstract
    BACKGROUND: In the context of systems biology, few sparse approaches have been proposed so far to integrate several data sets. It is however an important and fundamental issue that will be widely encountered in post genomic studies, when simultaneously analyzing transcriptomics, proteomics and metabolomics data using different platforms, so as to understand the mutual interactions between the different data sets. In this high dimensional setting, variable selection is crucial to give interpretable results. We focus on a sparse Partial Least Squares approach (sPLS) to handle two-block data sets, where the relationship between the two types of variables is known to be symmetric. Sparse PLS has been developed either for a regression or a canonical correlation framework and includes a built-in procedure to select variables while integrating data. To illustrate the canonical mode approach, we analyzed the NCI60 data sets, where two different platforms (cDNA and Affymetrix chips) were used to study the transcriptome of sixty cancer cell lines. RESULTS: We compare the results obtained with two other sparse or related canonical correlation approaches: CCA with Elastic Net penalization (CCA-EN) and Co-Inertia Analysis (CIA). The latter does not include a built-in procedure for variable selection and requires a two-step analysis. We stress the lack of statistical criteria to evaluate canonical correlation methods, which makes biological interpretation absolutely necessary to compare the different gene selections. We also propose comprehensive graphical representations of both samples and variables to facilitate the interpretation of the results. CONCLUSION: sPLS and CCA-EN selected highly relevant genes and complementary findings from the two data sets, which enabled a detailed understanding of the molecular characteristics of several groups of cell lines. These two approaches were found to bring similar results, although they highlighted the same phenomenons with a different priority. They outperformed CIA that tended to select redundant information.

    Export Reference in RIS Format     

    Endnote

    • Click on "Export Reference in RIS Format" and choose "open with... Endnote".

    Refworks

    • Click on "Export Reference in RIS Format". Login to Refworks, go to References => Import References


    Collections
    • Minerva Elements Records [52609]
    • School of Mathematics and Statistics - Research Publications [840]
    Minerva AccessDepositing Your Work (for University of Melbourne Staff and Students)NewsFAQs

    BrowseCommunities & CollectionsBy Issue DateAuthorsTitlesSubjectsThis CollectionBy Issue DateAuthorsTitlesSubjects
    My AccountLoginRegister
    StatisticsMost Popular ItemsStatistics by CountryMost Popular Authors