School of Mathematics and Statistics - Research Publications

Permanent URI for this collection

Search Results

Now showing 1 - 10 of 172
  • Item
    Thumbnail Image
    Effects of Salt Supplementation on the Albuminuric Response to Telmisartan With or Without Hydrochlorothiazide Therapy in Hypertensive Patients With Type 2 Diabetes Are Modulated by Habitual Dietary Salt Intake
    Ekinci, EI ; Thomas, G ; Thomas, D ; Johnson, C ; MacIsaac, RJ ; Houlihan, CA ; Finch, S ; Panagiotopoulos, S ; O'Callaghan, C ; Jerums, G (AMER DIABETES ASSOC, 2009-08)
    OBJECTIVE This prospective randomized double-blind placebo-controlled crossover study examined the effects of sodium chloride (NaCl) supplementation on the antialbuminuric action of telmisartan with or without hydrochlorothiazide (HCT) in hypertensive patients with type 2 diabetes, increased albumin excretion rate (AER), and habitual low dietary salt intake (LDS; <100 mmol sodium/24 h on two of three consecutive occasions) or high dietary salt intake (HDS; >200 mmol sodium/24 h on two of three consecutive occasions). RESEARCH DESIGN AND METHODS Following a washout period, subjects (n = 32) received 40 mg/day telmisartan for 4 weeks followed by 40 mg telmisartan plus 12.5 mg/day HCT for 4 weeks. For the last 2 weeks of each treatment period, patients received either 100 mmol/day NaCl or placebo capsules. After a second washout, the regimen was repeated with supplements in reverse order. AER and ambulatory blood pressure were measured at weeks 0, 4, 8, 14, 18, and 22. RESULTS In LDS, NaCl supplementation reduced the anti-albuminuric effect of telmisartan with or without HCT from 42.3% (placebo) to 9.5% (P = 0.004). By contrast, in HDS, NaCl supplementation did not reduce the AER response to telmisartan with or without HCT (placebo 30.9%, NaCl 28.1%, P = 0.7). Changes in AER were independent of changes in blood pressure. CONCLUSIONS The AER response to telmisartan with or without HCT under habitual low salt intake can be blunted by NaCl supplementation. By contrast, when there is already a suppressed renin angiotensin aldosterone system under habitual high dietary salt intake, the additional NaCl does not alter the AER response.
  • Item
    Thumbnail Image
    Predicting qualitative phenotypes from microarray data - the Eadgene pig data set.
    Robert-Granié, C ; Lê Cao, K-A ; Sancristobal, M (Springer Science and Business Media LLC, 2009-07-16)
    BACKGROUND: The aim of this work was to study the performances of 2 predictive statistical tools on a data set that was given to all participants of the Eadgene-SABRE Post Analyses Working Group, namely the Pig data set of Hazard et al. (2008). The data consisted of 3686 gene expressions measured on 24 animals partitioned in 2 genotypes and 2 treatments. The objective was to find biomarkers that characterized the genotypes and the treatments in the whole set of genes. METHODS: We first considered the Random Forest approach that enables the selection of predictive variables. We then compared the classical Partial Least Squares regression (PLS) with a novel approach called sparse PLS, a variant of PLS that adapts lasso penalization and allows for the selection of a subset of variables. RESULTS: All methods performed well on this data set. The sparse PLS outperformed the PLS in terms of prediction performance and improved the interpretability of the results. CONCLUSION: We recommend the use of machine learning methods such as Random Forest and multivariate methods such as sparse PLS for prediction purposes. Both approaches are well adapted to transcriptomic data where the number of features is much greater than the number of individuals.
  • Item
    Thumbnail Image
    Feasibility, design and conduct of a pragmatic randomized controlled trial to reduce overweight and obesity in children: The electronic games to aid motivation to exercise (eGAME) study.
    Maddison, R ; Foley, L ; Mhurchu, CN ; Jull, A ; Jiang, Y ; Prapavessis, H ; Rodgers, A ; Vander Hoorn, S ; Hohepa, M ; Schaaf, D (Springer Science and Business Media LLC, 2009-05-19)
    BACKGROUND: Childhood obesity has reached epidemic proportions in developed countries. Sedentary screen-based activities such as video gaming are thought to displace active behaviors and are independently associated with obesity. Active video games, where players physically interact with images onscreen, may have utility as a novel intervention to increase physical activity and improve body composition in children. The aim of the Electronic Games to Aid Motivation to Exercise (eGAME) study is to determine the effects of an active video game intervention over 6 months on: body mass index (BMI), percent body fat, waist circumference, cardio-respiratory fitness, and physical activity levels in overweight children. METHODS/DESIGN: Three hundred and thirty participants aged 10-14 years will be randomized to receive either an active video game upgrade package or to a control group (no intervention). DISCUSSION: An overview of the eGAME study is presented, providing an example of a large, pragmatic randomized controlled trial in a community setting. Reflection is offered on key issues encountered during the course of the study. In particular, investigation into the feasibility of the proposed intervention, as well as robust testing of proposed study procedures is a critical step prior to implementation of a large-scale trial.
  • Item
    Thumbnail Image
    Improved Methodology for Assessment of mRNA Levels in Blood of Patients with FMR1 Related Disorders
    Godler, DE ; Loesch, DZ ; Huggins, R ; Gordon, L ; Slater, HR ; Gehling, F ; Burgess, T ; Choo, KHA (BIOMED CENTRAL LTD, 2009)
    BACKGROUND: Elevated levels of FMR1 mRNA in blood have been implicated in RNA toxicity associated with a number of clinical conditions. Due to the extensive inter-sample variation in the time lapse between the blood collection and RNA extraction in clinical practice, the resulting variation in mRNA quality significantly confounds mRNA analysis by real-time PCR. METHODS: Here, we developed an improved method to normalize for mRNA degradation in a sample set with large variation in rRNA quality, without sample omission. Initially, RNA samples were artificially degraded, and analyzed using capillary electrophoresis and real-time PCR standard curve method, with the aim of defining the best predictors of total RNA and mRNA degradation. RESULTS: We found that: (i) the 28S:18S ratio and RNA quality indicator (RQI) were good predictors of severe total RNA degradation, however, the greatest changes in the quantity of different mRNAs (FMR1, DNMT1, GUS, B2M and GAPDH) occurred during the early to moderate stages of degradation; (ii) chromatographic features for the 18S, 28S and the inter-peak region were the most reliable predictors of total RNA degradation, however their use for target gene normalization was inferior to internal control genes, of which GUS was the most appropriate. Using GUS for normalization, we examined in the whole blood the relationship between the FMR1 mRNA and CGG expansion in a non-coding portion of this gene, in a sample set (n = 30) with the large variation in rRNA quality. By combining FMR1 3' and 5' mRNA analyses the confounding impact of mRNA degradation on the correlation between FMR1 expression and CGG size was minimized, and the biological significance increased from p = 0.046 for the 5' FMR1 assay, to p = 0.018 for the combined FMR1 3' and 5' mRNA analysis. CONCLUSION: Our observations demonstrate that, through the use of an appropriate internal control and the direct analysis of multiple sites of target mRNA, samples that do not conform to the conventional rRNA criteria can still be utilized to obtain biologically/clinically relevant data. Although, this strategy clearly has application for improved assessment of FMR1 mRNA toxicity in blood, it may also have more general implications for gene expression studies in fresh and archival tissues.
  • Item
    No Preview Available
    The EADGENE microarray data analysis workshop (open access publication)
    De Koning, D-J ; Jaffrezic, F ; Lund, MS ; Watson, M ; Channing, C ; Hulsegge, I ; Pool, MH ; Buitenhuis, B ; Hedegaard, J ; Hornshoj, H ; Jiang, L ; Sorensen, P ; Marot, G ; Delmas, C ; Le Cao, K-A ; Cristobal, MS ; Baron, MD ; Malinverni, R ; Stella, A ; Brunner, RM ; Seyfert, H-M ; Jensen, K ; Mouzaki, D ; Waddington, D ; Jimenez-Marin, A ; Perez-Alegre, M ; Perez-Reinado, E ; Closset, R ; Detilleux, JC ; Dovc, P ; Lavric, M ; Nie, H ; Janss, L (EDP SCIENCES S A, 2007)
    Microarray analyses have become an important tool in animal genomics. While their use is becoming widespread, there is still a lot of ongoing research regarding the analysis of microarray data. In the context of a European Network of Excellence, 31 researchers representing 14 research groups from 10 countries performed and discussed the statistical analyses of real and simulated 2-colour microarray data that were distributed among participants. The real data consisted of 48 microarrays from a disease challenge experiment in dairy cattle, while the simulated data consisted of 10 microarrays from a direct comparison of two treatments (dye-balanced). While there was broader agreement with regards to methods of microarray normalisation and significance testing, there were major differences with regards to quality control. The quality control approaches varied from none, through using statistical weights, to omitting a large number of spots or omitting entire slides. Surprisingly, these very different approaches gave quite similar results when applied to the simulated data, although not all participating groups analysed both real and simulated data. The workshop was very successful in facilitating interaction between scientists with a diverse background but a common interest in microarray analyses.
  • Item
    No Preview Available
    Analysis of a simulated microarray dataset:: Comparison of methods for data normalisation and detection of differential expression (Open Access publication)
    Watson, M ; Alegre, MP ; Baron, MD ; Delmas, C ; Dovc, P ; Duval, M ; Foulley, JL ; Pavon, JJG ; Hulsegge, I ; Jaffrezic, F ; Marin, AJ ; Lavric, M ; Le Cao, KA ; Marot, G ; Mouzaki, D ; Pool, MH ; Granie, CR ; Cristobal, MS ; Klopp, GT ; Waddington, D ; De Koning, DJ (EDP SCIENCES S A, 2007)
    Microarrays allow researchers to measure the expression of thousands of genes in a single experiment. Before statistical comparisons can be made, the data must be assessed for quality and normalisation procedures must be applied, of which many have been proposed. Methods of comparing the normalised data are also abundant, and no clear consensus has yet been reached. The purpose of this paper was to compare those methods used by the EADGENE network on a very noisy simulated data set. With the a priori knowledge of which genes are differentially expressed, it is possible to compare the success of each approach quantitatively. Use of an intensity-dependent normalisation procedure was common, as was correction for multiple testing. Most variety in performance resulted from differing approaches to data quality and the use of different statistical tests. Very few of the methods used any kind of background correction. A number of approaches achieved a success rate of 95% or above, with relatively small numbers of false positives and negatives. Applying stringent spot selection criteria and elimination of data did not improve the false positive rate and greatly increased the false negative rate. However, most approaches performed well, and it is encouraging that widely available techniques can achieve such good results on a very noisy data set.
  • Item
    No Preview Available
    Analysis of the real EADGENE data set:: Multivariate approaches and post analysis (Open Access publication)
    Sorensen, P ; Bonnet, A ; Buitenhuis, B ; Closset, R ; Dejean, S ; Delmas, C ; Duval, M ; Glass, L ; Hedegaard, J ; Hornshoj, H ; Hulsegge, I ; Jaffrezic, F ; Jensen, K ; Jiang, L ; De Koning, D-J ; Le Cao, K-A ; Nie, H ; Petzl, W ; Pool, MH ; Robert-Granie, C ; Cristobal, MS ; Lund, MS ; Van Schothorst, EM ; Schuberth, H-J ; Seyfert, H-M ; Tosser-Klopp, G ; Waddington, D ; Watson, M ; Yang, W ; Zerbe, H (BMC, 2007)
    The aim of this paper was to describe, and when possible compare, the multivariate methods used by the participants in the EADGENE WP1.4 workshop. The first approach was for class discovery and class prediction using evidence from the data at hand. Several teams used hierarchical clustering (HC) or principal component analysis (PCA) to identify groups of differentially expressed genes with a similar expression pattern over time points and infective agent (E. coli or S. aureus). The main result from these analyses was that HC and PCA were able to separate tissue samples taken at 24 h following E. coli infection from the other samples. The second approach identified groups of differentially co-expressed genes, by identifying clusters of genes highly correlated when animals were infected with E. coli but not correlated more than expected by chance when the infective pathogen was S. aureus. The third approach looked at differential expression of predefined gene sets. Gene sets were defined based on information retrieved from biological databases such as Gene Ontology. Based on these annotation sources the teams used either the GlobalTest or the Fisher exact test to identify differentially expressed gene sets. The main result from these analyses was that gene sets involved in immune defence responses were differentially expressed.
  • Item
    No Preview Available
    Analysis of the real EADGENE data set:: Comparison of methods and guidelines for data normalisation and selection of diffrentially expressed genes (Open Access publication)
    Jaffrezic, F ; De Koning, D-J ; Boettcher, PJ ; Bonnet, A ; Buitenhuis, B ; Closset, R ; Dejean, S ; Delmas, C ; Detilleux, JC ; Dovc, P ; Duval, M ; Foulley, J-L ; Hedegaard, J ; Hornshoj, H ; Hulsegge, I ; Janss, L ; Jensen, K ; Jiang, L ; Lavric, M ; Le Cao, K-A ; Lund, MS ; Malinverni, R ; Marot, G ; Nie, H ; Petzl, W ; Pool, MH ; Granie, CR ; Cristobal, MS ; Van Schothorst, EM ; Schuberth, H-J ; Sorensen, P ; Stella, A ; Tosser-Klopp, G ; Waddington, D ; Watson, M ; Yang, W ; Zerbe, H ; Seyfert, H-M (BMC, 2007)
    A large variety of methods has been proposed in the literature for microarray data analysis. The aim of this paper was to present techniques used by the EADGENE (European Animal Disease Genomics Network of Excellence) WP1.4 participants for data quality control, normalisation and statistical methods for the detection of differentially expressed genes in order to provide some more general data analysis guidelines. All the workshop participants were given a real data set obtained in an EADGENE funded microarray study looking at the gene expression changes following artificial infection with two different mastitis causing bacteria: Escherichia coli and Staphylococcus aureus. It was reassuring to see that most of the teams found the same main biological results. In fact, most of the differentially expressed genes were found for infection by E. coli between uninfected and 24 h challenged udder quarters. Very little transcriptional variation was observed for the bacteria S. aureus. Lists of differentially expressed genes found by the different research teams were, however, quite dependent on the method used, especially concerning the data quality control step. These analyses also emphasised a biological problem of cross-talk between infected and uninfected quarters which will have to be dealt with for further microarray studies.
  • Item
    No Preview Available
    Evolving gene/transcript definitions significantly alter the interpretation of GeneChip data
    Dai, MH ; Wang, PL ; Boyd, AD ; Kostov, G ; Athey, B ; Jones, EG ; Bunney, WE ; Myers, RM ; Speed, TP ; Akil, H ; Watson, SJ ; Meng, F (OXFORD UNIV PRESS, 2005)
    Genome-wide expression profiling is a powerful tool for implicating novel gene ensembles in cellular mechanisms of health and disease. The most popular platform for genome-wide expression profiling is the Affymetrix GeneChip. However, its selection of probes relied on earlier genome and transcriptome annotation which is significantly different from current knowledge. The resultant informatics problems have a profound impact on analysis and interpretation the data. Here, we address these critical issues and offer a solution. We identified several classes of problems at the individual probe level in the existing annotation, under the assumption that current genome and transcriptome databases are more accurate than those used for GeneChip design. We then reorganized probes on more than a dozen popular GeneChips into gene-, transcript- and exon-specific probe sets in light of up-to-date genome, cDNA/EST clustering and single nucleotide polymorphism information. Comparing analysis results between the original and the redefined probe sets reveals approximately 30-50% discrepancy in the genes previously identified as differentially expressed, regardless of analysis method. Our results demonstrate that the original Affymetrix probe set definitions are inaccurate, and many conclusions derived from past GeneChip analyses may be significantly flawed. It will be beneficial to re-analyze existing GeneChip data with updated probe set definitions.
  • Item
    No Preview Available
    The impact of low-cost, genome-wide resequencing on association studies.
    Balding, D (Springer Science and Business Media LLC, 2005-06)