Computing and Information Systems - Research Publications

Permanent URI for this collection

Search Results

Now showing 1 - 10 of 149
  • Item
    Thumbnail Image
    Beyond a Disease Registry: An Integrated Virtual Environment for Adrenal Cancer Research
    Sinnott, RO ; Beuschlein, F ; Effendy, J ; Eisenhofer, G ; Gloeckner, S ; Stell, A (SPRINGER, 2016-12)
  • Item
    Thumbnail Image
    Estimating micro-populations through social media analytics
    Sinnott, RO ; Wang, W (SPRINGER WIEN, 2017-04-19)
  • Item
    Thumbnail Image
    Privacy Preserving Geo-Linkage in the Big Urban Data Era
    Sinnott, RO ; Bayliss, C ; Bromage, A ; Galang, G ; Gong, Y ; Greenwood, P ; Jayaputera, G ; Marques, D ; Morandini, L ; Nogoorani, G ; Pursultani, H ; Sarwar, M ; Voorsluys, W ; Widjaja, I (SPRINGER, 2016-12)
  • Item
    Thumbnail Image
    Supporting geospatial privacy-preserving data mining of social media
    Wang, S ; Sinnott, RO (SPRINGER WIEN, 2016-12)
  • Item
    Thumbnail Image
    EURO-WABB: an EU rare diseases registry for Wolfram syndrome, Alstrom syndrome and Bardet-Biedl syndrome
    Farmer, A ; Ayme, S ; de Heredia, ML ; Maffei, P ; McCafferty, S ; Mlynarski, W ; Nunes, V ; Parkinson, K ; Paquis-Flucklinger, V ; Rohayem, J ; Sinnott, R ; Tillmann, V ; Tranebaerg, L ; Barrett, TG (BMC, 2013-08-27)
    BACKGROUND: Wolfram, Alström and Bardet-Biedl (WABB) syndromes are rare diseases with overlapping features of multiple sensory and metabolic impairments, including diabetes mellitus, which have caused diagnostic confusion. There are as yet no specific treatments available, little or no access to well characterized cohorts of patients, and limited information on the natural history of the diseases. We aim to establish a Europe-wide registry for these diseases to inform patient care and research. METHODS: EURO-WABB is an international multicenter large-scale observational study capturing longitudinal clinical and outcome data for patients with WABB diagnoses. Three hundred participants will be recruited over 3 years from different sites throughout Europe. Comprehensive clinical, genetic and patient experience data will be collated into an anonymized disease registry. Data collection will be web-based, and forms part of the project's Virtual Research and Information Environment (VRIE). Participants who haven't undergone genetic diagnostic testing for their condition will be able to do so via the project. CONCLUSIONS: The registry data will be used to increase the understanding of the natural history of WABB diseases, to serve as an evidence base for clinical management, and to aid the identification of opportunities for intervention to stop or delay the progress of the disease. The detailed clinical characterisation will allow inclusion of patients into studies of novel treatment interventions, including targeted interventions in small scale open label studies; and enrolment into multi-national clinical trials. The registry will also support wider access to genetic testing, and encourage international collaborations for patient benefit.
  • Item
    Thumbnail Image
    Monogenic diabetes syndromes: Locus-specific databases for Alstrom, Wolfram, and Thiamine-responsive megaloblastic anemia
    Astuti, D ; Sabir, A ; Fulton, P ; Zatyka, M ; Williams, D ; Hardy, C ; Milan, G ; Favaretto, F ; Yu-Wai-Man, P ; Rohayem, J ; Lopez de Heredia, M ; Hershey, T ; Tranebjaerg, L ; Chen, J-H ; Chaussenot, A ; Nunes, V ; Marshall, B ; McAfferty, S ; Tillmann, V ; Maffei, P ; Paquis-Flucklinger, V ; Geberhiwot, T ; Mlynarski, W ; Parkinson, K ; Picard, V ; Esteban Bueno, G ; Dias, R ; Arnold, A ; Richens, C ; Paisey, R ; Urano, F ; Semple, R ; Sinnott, R ; Barrett, TG (WILEY, 2017-07)
    We developed a variant database for diabetes syndrome genes, using the Leiden Open Variation Database platform, containing observed phenotypes matched to the genetic variations. We populated it with 628 published disease-associated variants (December 2016) for: WFS1 (n = 309), CISD2 (n = 3), ALMS1 (n = 268), and SLC19A2 (n = 48) for Wolfram type 1, Wolfram type 2, Alström, and Thiamine-responsive megaloblastic anemia syndromes, respectively; and included 23 previously unpublished novel germline variants in WFS1 and 17 variants in ALMS1. We then investigated genotype-phenotype relations for the WFS1 gene. The presence of biallelic loss-of-function variants predicted Wolfram syndrome defined by insulin-dependent diabetes and optic atrophy, with a sensitivity of 79% (95% CI 75%-83%) and specificity of 92% (83%-97%). The presence of minor loss-of-function variants in WFS1 predicted isolated diabetes, isolated deafness, or isolated congenital cataracts without development of the full syndrome (sensitivity 100% [93%-100%]; specificity 78% [73%-82%]). The ability to provide a prognostic prediction based on genotype will lead to improvements in patient care and counseling. The development of the database as a repository for monogenic diabetes gene variants will allow prognostic predictions for other diabetes syndromes as next-generation sequencing expands the repertoire of genotypes and phenotypes. The database is publicly available online at https://lovd.euro-wabb.org.
  • Item
    Thumbnail Image
    Investigating reproducibility and tracking provenance - A genomic workflow case study
    Kanwal, S ; Khan, FZ ; Lonie, A ; Sinnott, RO (BMC, 2017-07-12)
    BACKGROUND: Computational bioinformatics workflows are extensively used to analyse genomics data, with different approaches available to support implementation and execution of these workflows. Reproducibility is one of the core principles for any scientific workflow and remains a challenge, which is not fully addressed. This is due to incomplete understanding of reproducibility requirements and assumptions of workflow definition approaches. Provenance information should be tracked and used to capture all these requirements supporting reusability of existing workflows. RESULTS: We have implemented a complex but widely deployed bioinformatics workflow using three representative approaches to workflow definition and execution. Through implementation, we identified assumptions implicit in these approaches that ultimately produce insufficient documentation of workflow requirements resulting in failed execution of the workflow. This study proposes a set of recommendations that aims to mitigate these assumptions and guides the scientific community to accomplish reproducible science, hence addressing reproducibility crisis. CONCLUSIONS: Reproducing, adapting or even repeating a bioinformatics workflow in any environment requires substantial technical knowledge of the workflow execution environment, resolving analysis assumptions and rigorous compliance with reproducibility requirements. Towards these goals, we propose conclusive recommendations that along with an explicit declaration of workflow specification would result in enhanced reproducibility of computational genomic analyses.
  • Item
    Thumbnail Image
    Influence of fecal collection conditions and 16S rRNA gene sequencing at two centers on human gut microbiota analysis
    Penington, JS ; Penno, MAS ; Ngui, KM ; Ajami, NJ ; Roth-Schulze, AJ ; Wilcox, SA ; Bandala-Sanchez, E ; Wentworth, JM ; Barry, SC ; Brown, CY ; Couper, JJ ; Petrosino, JF ; Papenfuss, AT ; Harrison, LC (NATURE PORTFOLIO, 2018-03-12)
    To optimise fecal sampling for reproducible analysis of the gut microbiome, we compared different methods of sample collection and sequencing of 16S rRNA genes at two centers. Samples collected from six individuals on three consecutive days were placed in commercial collection tubes (OMNIgeneGut OMR-200) or in sterile screw-top tubes in a home fridge or home freezer for 6-24 h, before transfer and storage at -80 °C. Replicate samples were shipped to centers in Australia and the USA for DNA extraction and sequencing by their respective PCR protocols, and analysed with the same bioinformatic pipeline. Variation in gut microbiome was dominated by differences between individuals. Minor differences in the abundance of taxa were found between collection-processing methods and day of collection, and between the two centers. We conclude that collection with storage and transport at 4 °C within 24 h is adequate for 16S rRNA analysis of the gut microbiome. Other factors including differences in PCR and sequencing methods account for relatively minor variation compared to differences between individuals.
  • Item
    Thumbnail Image
    Novel approach identifies SNPs in SLC2A10 and KCNK9 with evidence for parent-of-origin effect on body mass index.
    Hoggart, CJ ; Venturini, G ; Mangino, M ; Gomez, F ; Ascari, G ; Zhao, JH ; Teumer, A ; Winkler, TW ; Tšernikova, N ; Luan, J ; Mihailov, E ; Ehret, GB ; Zhang, W ; Lamparter, D ; Esko, T ; Macé, A ; Rüeger, S ; Bochud, P-Y ; Barcella, M ; Dauvilliers, Y ; Benyamin, B ; Evans, DM ; Hayward, C ; Lopez, MF ; Franke, L ; Russo, A ; Heid, IM ; Salvi, E ; Vendantam, S ; Arking, DE ; Boerwinkle, E ; Chambers, JC ; Fiorito, G ; Grallert, H ; Guarrera, S ; Homuth, G ; Huffman, JE ; Porteous, D ; Generation Scotland Consortium, ; LifeLines Cohort study, ; GIANT Consortium, ; Moradpour, D ; Iranzo, A ; Hebebrand, J ; Kemp, JP ; Lammers, GJ ; Aubert, V ; Heim, MH ; Martin, NG ; Montgomery, GW ; Peraita-Adrados, R ; Santamaria, J ; Negro, F ; Schmidt, CO ; Scott, RA ; Spector, TD ; Strauch, K ; Völzke, H ; Wareham, NJ ; Yuan, W ; Bell, JT ; Chakravarti, A ; Kooner, JS ; Peters, A ; Matullo, G ; Wallaschofski, H ; Whitfield, JB ; Paccaud, F ; Vollenweider, P ; Bergmann, S ; Beckmann, JS ; Tafti, M ; Hastie, ND ; Cusi, D ; Bochud, M ; Frayling, TM ; Metspalu, A ; Jarvelin, M-R ; Scherag, A ; Smith, GD ; Borecki, IB ; Rousson, V ; Hirschhorn, JN ; Rivolta, C ; Loos, RJF ; Kutalik, Z ; Visscher, PM (Public Library of Science (PLoS), 2014-07)
    The phenotypic effect of some single nucleotide polymorphisms (SNPs) depends on their parental origin. We present a novel approach to detect parent-of-origin effects (POEs) in genome-wide genotype data of unrelated individuals. The method exploits increased phenotypic variance in the heterozygous genotype group relative to the homozygous groups. We applied the method to >56,000 unrelated individuals to search for POEs influencing body mass index (BMI). Six lead SNPs were carried forward for replication in five family-based studies (of ∼4,000 trios). Two SNPs replicated: the paternal rs2471083-C allele (located near the imprinted KCNK9 gene) and the paternal rs3091869-T allele (located near the SLC2A10 gene) increased BMI equally (beta = 0.11 (SD), P<0.0027) compared to the respective maternal alleles. Real-time PCR experiments of lymphoblastoid cell lines from the CEPH families showed that expression of both genes was dependent on parental origin of the SNPs alleles (P<0.01). Our scheme opens new opportunities to exploit GWAS data of unrelated individuals to identify POEs and demonstrates that they play an important role in adult obesity.
  • Item
    Thumbnail Image
    TruXy: Trusted Storage Cloud for Scientific Workflows
    Nepal, S ; Sinnott, RO ; Friedrich, C ; Wise, C ; Chen, S ; Kanwal, S ; Yao, J ; Lonie, A (IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC, 2017-07-01)