- Melbourne Medical School Collected Works - Research Publications
Melbourne Medical School Collected Works - Research Publications
Permanent URI for this collection
1101 results
Filters
Settings
Statistics
Citations
Search Results
Now showing
1 - 10 of 1101
-
ItemCSM-Toxin: A Web-Server for Predicting Protein ToxicityMorozov, V ; Rodrigues, CHM ; Ascher, DB (MDPI, 2023-02-01)Biologics are one of the most rapidly expanding classes of therapeutics, but can be associated with a range of toxic properties. In small-molecule drug development, early identification of potential toxicity led to a significant reduction in clinical trial failures, however we currently lack robust qualitative rules or predictive tools for peptide- and protein-based biologics. To address this, we have manually curated the largest set of high-quality experimental data on peptide and protein toxicities, and developed CSM-Toxin, a novel in-silico protein toxicity classifier, which relies solely on the protein primary sequence. Our approach encodes the protein sequence information using a deep learning natural languages model to understand "biological" language, where residues are treated as words and protein sequences as sentences. The CSM-Toxin was able to accurately identify peptides and proteins with potential toxicity, achieving an MCC of up to 0.66 across both cross-validation and multiple non-redundant blind tests, outperforming other methods and highlighting the robust and generalisable performance of our model. We strongly believe the CSM-Toxin will serve as a valuable platform to minimise potential toxicity in the biologic development pipeline. Our method is freely available as an easy-to-use webserver.
-
ItemDockNet: high-throughput protein-protein interface contact prediction.Williams, NP ; Rodrigues, CHM ; Truong, J ; Ascher, DB ; Holien, JK ; Cowen, L (Oxford University Press (OUP), 2023-01-01)MOTIVATION: Over 300 000 protein-protein interaction (PPI) pairs have been identified in the human proteome and targeting these is fast becoming the next frontier in drug design. Predicting PPI sites, however, is a challenging task that traditionally requires computationally expensive and time-consuming docking simulations. A major weakness of modern protein docking algorithms is the inability to account for protein flexibility, which ultimately leads to relatively poor results. RESULTS: Here, we propose DockNet, an efficient Siamese graph-based neural network method which predicts contact residues between two interacting proteins. Unlike other methods that only utilize a protein's surface or treat the protein structure as a rigid body, DockNet incorporates the entire protein structure and places no limits on protein flexibility during an interaction. Predictions are modeled at the residue level, based on a diverse set of input node features including residue type, surface accessibility, residue depth, secondary structure, pharmacophore and torsional angles. DockNet is comparable to current state-of-the-art methods, achieving an area under the curve (AUC) value of up to 0.84 on an independent test set (DB5), can be applied to a variety of different protein structures and can be utilized in situations where accurate unbound protein structures cannot be obtained. AVAILABILITY AND IMPLEMENTATION: DockNet is available at https://github.com/npwilliams09/docknet and an easy-to-use webserver at https://biosig.lab.uq.edu.au/docknet. All other data underlying this article are available in the article and in its online supplementary material. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
-
ItemA bias of Asparagine to Lysine mutations in SARS-CoV-2 outside the receptor binding domain affects protein flexibility.Boer, JC ; Pan, Q ; Holien, JK ; Nguyen, T-B ; Ascher, DB ; Plebanski, M (Frontiers Media SA, 2022)INTRODUCTION: COVID-19 pandemic has been threatening public health and economic development worldwide for over two years. Compared with the original SARS-CoV-2 strain reported in 2019, the Omicron variant (B.1.1.529.1) is more transmissible. This variant has 34 mutations in its Spike protein, 15 of which are present in the Receptor Binding Domain (RBD), facilitating viral internalization via binding to the angiotensin-converting enzyme 2 (ACE2) receptor on endothelial cells as well as promoting increased immune evasion capacity. METHODS: Herein we compared SARS-CoV-2 proteins (including ORF3a, ORF7, ORF8, Nucleoprotein (N), membrane protein (M) and Spike (S) proteins) from multiple ancestral strains. We included the currently designated original Variant of Concern (VOC) Omicron, its subsequent emerged variants BA.1, BA2, BA3, BA.4, BA.5, the two currently emerging variants BQ.1 and BBX.1, and compared these with the previously circulating VOCs Alpha, Beta, Gamma, and Delta, to better understand the nature and potential impact of Omicron specific mutations. RESULTS: Only in Omicron and its subvariants, a bias toward an Asparagine to Lysine (N to K) mutation was evident within the Spike protein, including regions outside the RBD domain, while none of the regions outside the Spike protein domain were characterized by this mutational bias. Computational structural analysis revealed that three of these specific mutations located in the central core region, contribute to a preference for the alteration of conformations of the Spike protein. Several mutations in the RBD which have circulated across most Omicron subvariants were also analysed, and these showed more potential for immune escape. CONCLUSION: This study emphasizes the importance of understanding how specific N to K mutations outside of the RBD region affect SARS-CoV-2 conformational changes and the need for neutralizing antibodies for Omicron to target a subset of conformationally dependent B cell epitopes.
-
ItemGenetic variation in histone modifications and gene expression identifies regulatory variants in the mammary gland of cattleProwse-Wilkins, CP ; Lopdell, TJ ; Xiang, R ; Vander Jagt, CJ ; Littlejohn, MD ; Chamberlain, AJ ; Goddard, ME (BMC, 2022-12-08)BACKGROUND: Causal variants for complex traits, such as eQTL are often found in non-coding regions of the genome, where they are hypothesised to influence phenotypes by regulating gene expression. Many regulatory regions are marked by histone modifications, which can be assayed by chromatin immunoprecipitation followed by sequencing (ChIP-seq). Sequence reads from ChIP-seq form peaks at putative regulatory regions, which may reflect the amount of regulatory activity at this region. Therefore, eQTL which are also associated with differences in histone modifications are excellent candidate causal variants. RESULTS: We assayed the histone modifications H3K4Me3, H3K4Me1 and H3K27ac and mRNA in the mammary gland of up to 400 animals. We identified QTL for peak height (histone QTL), exon expression (eeQTL), allele specific expression (aseQTL) and allele specific binding (asbQTL). By intersecting these results, we identify variants which may influence gene expression by altering regulatory regions of the genome, and may be causal variants for other traits. Lastly, we find that these variants are found in putative transcription factor binding sites, identifying a mechanism for the effect of many eQTL. CONCLUSIONS: We find that allele specific and traditional QTL analysis often identify the same genetic variants and provide evidence that many eQTL are regulatory variants which alter activity at regulatory regions of the bovine genome. Our work provides methodological and biological updates on how regulatory mechanisms interplay at multi-omics levels.
-
ItemDementia Risk Reduction in Primary Care: A Scoping Review of Clinical Guidelines Using a Behavioral Specificity FrameworkGodbee, K ; Guccione, L ; Palmer, VJ ; Gunn, J ; Lautenschlager, N ; Francis, JJ ; Macpherson, H (IOS PRESS, 2022-01-01)BACKGROUND: Primary care practitioners are being called upon to work with their patients to reduce dementia risk. However, it is unclear who should do what with whom, when, and under what circumstances. OBJECTIVE: This scoping review aimed to identify clinical guidelines for dementia risk reduction (DRR) in primary care settings, synthesize the guidelines into actionable behaviors, and appraise the guidelines for specificity. METHODS: Terms related to "dementia", "guidelines", and "risk reduction" were entered into two academic databases and two web search engines. Guidelines were included if they referred specifically to clinical practices for healthcare professionals for primary prevention of dementia. Included guidelines were analyzed using a directed content analysis method, underpinned by the Action-Actor-Context-Target-Time framework for specifying behavior. RESULTS: Eighteen guidelines were included in the analysis. Together, the guidelines recommended six distinct clusters of actions for DRR. These were to 1) invite patients to discuss DRR, 2) identify patients with risk factors for dementia, 3) discuss DRR, 4) manage dementia risk factors, 5) signpost to additional support, and 6) follow up. Guidelines recommended various actors, contexts, targets, and times for performing these actions. Together, guidelines lacked specificity and were at times contradictory. CONCLUSION: Currently available guidelines allow various approaches to promoting DRR in primary care. Primary care teams are advised to draw on the results of the review to decide which actions to undertake and the locally appropriate actors, contexts, targets, and times for these actions. Documenting these decisions in more specific, local guidelines for promoting DRR should facilitate implementation.
-
ItemA structural biology community assessment of AlphaFold2 applications.Akdel, M ; Pires, DEV ; Pardo, EP ; Jänes, J ; Zalevsky, AO ; Mészáros, B ; Bryant, P ; Good, LL ; Laskowski, RA ; Pozzati, G ; Shenoy, A ; Zhu, W ; Kundrotas, P ; Serra, VR ; Rodrigues, CHM ; Dunham, AS ; Burke, D ; Borkakoti, N ; Velankar, S ; Frost, A ; Basquin, J ; Lindorff-Larsen, K ; Bateman, A ; Kajava, AV ; Valencia, A ; Ovchinnikov, S ; Durairaj, J ; Ascher, DB ; Thornton, JM ; Davey, NE ; Stein, A ; Elofsson, A ; Croll, TI ; Beltrao, P (Springer Science and Business Media LLC, 2022-11)Most proteins fold into 3D structures that determine how they function and orchestrate the biological processes of the cell. Recent developments in computational methods for protein structure predictions have reached the accuracy of experimentally determined models. Although this has been independently verified, the implementation of these methods across structural-biology applications remains to be tested. Here, we evaluate the use of AlphaFold2 (AF2) predictions in the study of characteristic structural elements; the impact of missense variants; function and ligand binding site predictions; modeling of interactions; and modeling of experimental structural data. For 11 proteomes, an average of 25% additional residues can be confidently modeled when compared with homology modeling, identifying structural features rarely seen in the Protein Data Bank. AF2-based predictions of protein disorder and complexes surpass dedicated tools, and AF2 models can be used across diverse applications equally well compared with experimentally determined structures, when the confidence metrics are critically considered. In summary, we find that these advances are likely to have a transformative impact in structural biology and broader life-science research.
-
ItemkinCSM: Using graph-based signatures to predict small molecule CDK2 inhibitorsZhou, Y ; Al-Jarf, R ; Alavi, A ; Thanh, BN ; Rodrigues, CHM ; Pires, DE ; Ascher, DB (WILEY, 2022-11-01)Protein phosphorylation acts as an essential on/off switch in many cellular signaling pathways. This has led to ongoing interest in targeting kinases for therapeutic intervention. Computer-aided drug discovery has been proven a useful and cost-effective approach for facilitating prioritization and enrichment of screening libraries, but limited effort has been devoted providing insights on what makes a potent kinase inhibitor. To fill this gap, here we developed kinCSM, an integrative computational tool capable of accurately identifying potent cyclin-dependent kinase 2 (CDK2) inhibitors, quantitatively predicting CDK2 ligand-kinase inhibition constants (pKi ) and classifying different types of inhibitors based on their favorable binding modes. kinCSM predictive models were built using supervised learning and leveraged the concept of graph-based signatures to capture both physicochemical properties and geometry properties of small molecules. CDK2 inhibitors were accurately identified with Matthew's Correlation Coefficients (MCC) of up to 0.74, and inhibition constants predicted with Pearson's correlation of up to 0.76, both with consistent performances of 0.66 and 0.68 on a nonredundant blind test, respectively. kinCSM was also able to identify the potential type of inhibition for a given molecule, achieving MCC of up to 0.80 on cross-validation and 0.73 on the blind test. Analyzing the molecular composition of revealed enriched chemical fragments in CDK2 inhibitors and different types of inhibitors, which provides insights into the molecular mechanisms behind ligand-kinase interactions. kinCSM will be an invaluable tool to guide future kinase drug discovery. To aid the fast and accurate screening of CDK2 inhibitors, kinCSM is freely available at https://biosig.lab.uq.edu.au/kin_csm/.
-
ItemVIVID A Web Application for Variant Interpretation and Visualization in Multi-dimensional AnalysesTichkule, S ; Myung, Y ; Naung, MT ; Ansell, BRE ; Guy, AJ ; Srivastava, N ; Mehra, S ; Caccio, SM ; Mueller, I ; Barry, AE ; van Oosterhout, C ; Pope, B ; Ascher, DB ; Jex, AR ; Teeling, E (OXFORD UNIV PRESS, 2022-09-01)Large-scale comparative genomics- and population genetic studies generate enormous amounts of polymorphism data in the form of DNA variants. Ultimately, the goal of many of these studies is to associate genetic variants to phenotypes or fitness. We introduce VIVID, an interactive, user-friendly web application that integrates a wide range of approaches for encoding genotypic to phenotypic information in any organism or disease, from an individual or population, in three-dimensional (3D) space. It allows mutation mapping and annotation, calculation of interactions and conservation scores, prediction of harmful effects, analysis of diversity and selection, and 3D visualization of genotypic information encoded in Variant Call Format on AlphaFold2 protein models. VIVID enables the rapid assessment of genes of interest in the study of adaptive evolution and the genetic load, and it helps prioritizing targets for experimental validation. We demonstrate the utility of VIVID by exploring the evolutionary genetics of the parasitic protist Plasmodium falciparum, revealing geographic variation in the signature of balancing selection in potential targets of functional antibodies.
-
ItemHGDiscovery: An online tool providing functional and phenotypic information on novel variants of homogentisate 1,2- dioxigenaseKarmakar, M ; Cicaloni, V ; Rodrigues, CHM ; Spiga, O ; Santucci, A ; Ascher, DB (ELSEVIER, 2022-09-09)Alkaptonuria (AKU), a rare genetic disorder, is characterized by the accumulation of homogentisic acid (HGA) in the body. Affected individuals lack functional levels of an enzyme required to breakdown HGA. Mutations in the homogentisate 1,2-dioxygenase (HGD) gene cause AKU and they are responsible for deficient levels of functional HGD, which, in turn, leads to excess levels of HGA. Although HGA is rapidly cleared from the body by the kidneys, in the long term it starts accumulating in various tissues, especially cartilage. Over time (rarely before adulthood), it eventually changes the color of affected tissue to slate blue or black. Here we report a comprehensive mutation analysis of 111 pathogenic and 190 non-pathogenic HGD missense mutations using protein structural information. Using our comprehensive suite of graph-based signature methods, mCSM complemented with sequence-based tools, we studied the functional and molecular consequences of each mutation on protein stability, interaction and evolutionary conservation. The scores generated from the structure and sequence-based tools were used to train a supervised machine learning algorithm with 89% accuracy. The empirical classifier was used to generate the variant phenotype for novel HGD missense mutations. All this information is deployed as a user friendly freely available web server called HGDiscovery (https://biosig.lab.uq.edu.au/hgdiscovery/).
-
ItemA multi-tissue atlas of regulatory variants in cattleLiu, S ; Gao, Y ; Canela-Xandri, O ; Wang, S ; Yu, Y ; Cai, W ; Li, B ; Xiang, R ; Chamberlain, AJ ; Pairo-Castineira, E ; D'Mellow, K ; Rawlik, K ; Xia, C ; Yao, Y ; Navarro, P ; Rocha, D ; Li, X ; Yan, Z ; Li, C ; Rosen, BD ; Van Tassell, CP ; Vanraden, PM ; Zhang, S ; Ma, L ; Cole, JB ; Liu, GE ; Tenesa, A ; Fang, L (NATURE PORTFOLIO, 2022-08-11)Characterization of genetic regulatory variants acting on livestock gene expression is essential for interpreting the molecular mechanisms underlying traits of economic value and for increasing the rate of genetic gain through artificial selection. Here we build a Cattle Genotype-Tissue Expression atlas (CattleGTEx) as part of the pilot phase of the Farm animal GTEx (FarmGTEx) project for the research community based on 7,180 publicly available RNA-sequencing (RNA-seq) samples. We describe the transcriptomic landscape of more than 100 tissues/cell types and report hundreds of thousands of genetic associations with gene expression and alternative splicing for 23 distinct tissues. We evaluate the tissue-sharing patterns of these genetic regulatory effects, and functionally annotate them using multiomics data. Finally, we link gene expression in different tissues to 43 economically important traits using both transcriptome-wide association and colocalization analyses to decipher the molecular regulatory mechanisms underpinning such agronomic traits in cattle.