School of Mathematics and Statistics - Research Publications

Permanent URI for this collection

Search Results

Now showing 1 - 5 of 5
  • Item
    Thumbnail Image
    Molecular networks involved in mouse cerebral corticogenesis and spatio-temporal regulation of Sox4 and Sox11 novel antisense transcripts revealed by transcriptome profiling
    Ling, K-H ; Hewitt, CA ; Beissbarth, T ; Hyde, L ; Banerjee, K ; Cheah, P-S ; Cannon, PZ ; Hahn, CN ; Thomas, PQ ; Smyth, GK ; Tan, S-S ; Thomas, T ; Scott, HS (BMC, 2009)
    BACKGROUND: Development of the cerebral cortex requires highly specific spatio-temporal regulation of gene expression. It is proposed that transcriptome profiling of the cerebral cortex at various developmental time points or regions will reveal candidate genes and associated molecular pathways involved in cerebral corticogenesis. RESULTS: Serial analysis of gene expression (SAGE) libraries were constructed from C57BL/6 mouse cerebral cortices of age embryonic day (E) 15.5, E17.5, postnatal day (P) 1.5 and 4 to 6 months. Hierarchical clustering analysis of 561 differentially expressed transcripts showed regionalized, stage-specific and co-regulated expression profiles. SAGE expression profiles of 70 differentially expressed transcripts were validated using quantitative RT-PCR assays. Ingenuity pathway analyses of validated differentially expressed transcripts demonstrated that these transcripts possess distinctive functional properties related to various stages of cerebral corticogenesis and human neurological disorders. Genomic clustering analysis of the differentially expressed transcripts identified two highly transcribed genomic loci, Sox4 and Sox11, during embryonic cerebral corticogenesis. These loci feature unusual overlapping sense and antisense transcripts with alternative polyadenylation sites and differential expression. The Sox4 and Sox11 antisense transcripts were highly expressed in the brain compared to other mouse organs and are differentially expressed in both the proliferating and differentiating neural stem/progenitor cells and P19 (embryonal carcinoma) cells. CONCLUSIONS: We report validated gene expression profiles that have implications for understanding the associations between differentially expressed transcripts, novel targets and related disorders pertaining to cerebral corticogenesis. The study reports, for the first time, spatio-temporally regulated Sox4 and Sox11 antisense transcripts in the brain, neural stem/progenitor cells and P19 cells, suggesting they have an important role in cerebral corticogenesis and neuronal/glial cell differentiation.
  • Item
    Thumbnail Image
    Illumina WG-6 BeadChip strips should be normalized separately
    Shi, W ; Banerjee, A ; Ritchie, ME ; Gerondakis, S ; Smyth, GK (BMC, 2009-11-11)
    BACKGROUND: Illumina Sentrix-6 Whole-Genome Expression BeadChips are relatively new microarray platforms which have been used in many microarray studies in the past few years. These Chips have a unique design in which each Chip contains six microarrays and each microarray consists of two separate physical strips, posing special challenges for precise between-array normalization of expression values. RESULTS: None of the normalization strategies proposed so far for this microarray platform allow for the possibility of systematic variation between the two strips comprising each array. That this variation can be substantial is illustrated by a data example. We demonstrate that normalizing at the strip-level rather than at the array-level can effectively remove this between-strip variation, improve the precision of gene expression measurements and discover more differentially expressed genes. The gain is substantial, yielding a 20% increase in statistical information and doubling the number of genes detected at a 5% false discovery rate. Functional analysis reveals that the extra genes found tend to have interesting biological meanings, dramatically strengthening the biological conclusions from the experiment. Strip-level normalization still outperforms array-level normalization when non-expressed probes are filtered out. CONCLUSION: Plots are proposed which demonstrate how the need for strip-level normalization relates to inconsistent intensity range variation between the strips. Strip-level normalization is recommended for the preprocessing of Illumina Sentrix-6 BeadChips whenever the intensity range is seen to be inconsistent between the strips. R code is provided to implement the recommended plots and normalization algorithms.
  • Item
    Thumbnail Image
    Microarray background correction: maximum likelihood estimation for the normal-exponential convolution
    Silver, JD ; Ritchie, ME ; Smyth, GK (OXFORD UNIV PRESS, 2009-04)
    Background correction is an important preprocessing step for microarray data that attempts to adjust the data for the ambient intensity surrounding each feature. The "normexp" method models the observed pixel intensities as the sum of 2 random variables, one normally distributed and the other exponentially distributed, representing background noise and signal, respectively. Using a saddle-point approximation, Ritchie and others (2007) found normexp to be the best background correction method for 2-color microarray data. This article develops the normexp method further by improving the estimation of the parameters. A complete mathematical development is given of the normexp model and the associated saddle-point approximation. Some subtle numerical programming issues are solved which caused the original normexp method to fail occasionally when applied to unusual data sets. A practical and reliable algorithm is developed for exact maximum likelihood estimation (MLE) using high-quality optimization software and using the saddle-point estimates as starting values. "MLE" is shown to outperform heuristic estimators proposed by other authors, both in terms of estimation accuracy and in terms of performance on real data. The saddle-point approximation is an adequate replacement in most practical situations. The performance of normexp for assessing differential expression is improved by adding a small offset to the corrected intensities.
  • Item
    Thumbnail Image
    Testing significance relative to a fold-change threshold is a TREAT
    McCarthy, DJ ; Smyth, GK (OXFORD UNIV PRESS, 2009-03-15)
    MOTIVATION: Statistical methods are used to test for the differential expression of genes in microarray experiments. The most widely used methods successfully test whether the true differential expression is different from zero, but give no assurance that the differences found are large enough to be biologically meaningful. RESULTS: We present a method, t-tests relative to a threshold (TREAT), that allows researchers to test formally the hypothesis (with associated p-values) that the differential expression in a microarray experiment is greater than a given (biologically meaningful) threshold. We have evaluated the method using simulated data, a dataset from a quality control experiment for microarrays and data from a biological experiment investigating histone deacetylase inhibitors. When the magnitude of differential expression is taken into account, TREAT improves upon the false discovery rate of existing methods and identifies more biologically relevant genes. AVAILABILITY: R code implementing our methods is contributed to the software package limma available at http://www.bioconductor.org.
  • Item
    Thumbnail Image
    Molecular dissection of the pea shoot apical meristem
    Liang, D ; Wong, CE ; Singh, MB ; Beveridge, CA ; Phipson, B ; Smyth, GK ; Bhalla, PL (OXFORD UNIV PRESS, 2009)
    The shoot apical meristem (SAM) is responsible for the development of all the above-ground parts of a plant. Our understanding of the SAM at the molecular level is incomplete. This study investigates the gene expression repertoire of SAMs in the garden pea (Pisum sativum). To this end, 10 346 EST sequences representing 7610 unique genes were generated from SAM cDNA libraries. These sequences, together with previously reported pea ESTs, were used to construct a 12K oligonucleotide array to identify genes with differential SAM expression, as compared to axillary meristems, root apical meristems, or non-meristematic tissues. A number of genes were identified, predominantly expressed in specific cell layers or domains of the SAM and thus are likely components of the gene networks involved in stem cell maintenance or the initiation of lateral organs. Further in situ hybridization analysis confirmed the spatial localization of some of these genes within the SAM. Our data also indicate the diversification of some gene expression patterns and hence functions in legume crop plants. A number of transcripts highly expressed in all three meristems have also been uncovered and these candidates may provide valuable insight into molecular networks that underpin the maintenance of meristematic functionality.