School of BioSciences - Research Publications

Permanent URI for this collection

Search Results

Now showing 1 - 10 of 42
  • Item
    Thumbnail Image
    Protein degradation rate is the dominant mechanism accounting for the differences in protein abundance of basal p53 in a human breast and colorectal cancer cell line
    Lakatos, E ; Salehi-Reyhani, A ; Barclay, M ; Stumpf, MPH ; Klug, DR ; Deb, S (PUBLIC LIBRARY SCIENCE, 2017-05-10)
    We determine p53 protein abundances and cell to cell variation in two human cancer cell lines with single cell resolution, and show that the fractional width of the distributions is the same in both cases despite a large difference in average protein copy number. We developed a computational framework to identify dominant mechanisms controlling the variation of protein abundance in a simple model of gene expression from the summary statistics of single cell steady state protein expression distributions. Our results, based on single cell data analysed in a Bayesian framework, lends strong support to a model in which variation in the basal p53 protein abundance may be best explained by variations in the rate of p53 protein degradation. This is supported by measurements of the relative average levels of mRNA which are very similar despite large variation in the level of protein.
  • Item
    Thumbnail Image
    Inferring extrinsic noise from single-cell gene expression data using approximate Bayesian computation
    Lenive, O ; Kirk, PDW ; Stumpf, MPH (BIOMED CENTRAL LTD, 2016-08-22)
    BACKGROUND: Gene expression is known to be an intrinsically stochastic process which can involve single-digit numbers of mRNA molecules in a cell at any given time. The modelling of such processes calls for the use of exact stochastic simulation methods, most notably the Gillespie algorithm. However, this stochasticity, also termed "intrinsic noise", does not account for all the variability between genetically identical cells growing in a homogeneous environment. Despite substantial experimental efforts, determining appropriate model parameters continues to be a challenge. Methods based on approximate Bayesian computation can be used to obtain posterior parameter distributions given the observed data. However, such inference procedures require large numbers of simulations of the model and exact stochastic simulation is computationally costly. In this work we focus on the specific case of trying to infer model parameters describing reaction rates and extrinsic noise on the basis of measurements of molecule numbers in individual cells at a given time point. RESULTS: To make the problem computationally tractable we develop an exact, model-specific, stochastic simulation algorithm for the commonly used two-state model of gene expression. This algorithm relies on certain assumptions and favourable properties of the model to forgo the simulation of the whole temporal trajectory of protein numbers in the system, instead returning only the number of protein and mRNA molecules present in the system at a specified time point. The computational gain is proportional to the number of protein molecules created in the system and becomes significant for systems involving hundreds or thousands of protein molecules. CONCLUSIONS: We employ this simulation algorithm with approximate Bayesian computation to jointly infer the model's rate and noise parameters from published gene expression data. Our analysis indicates that for most genes the extrinsic contributions to noise will be small to moderate but certainly are non-negligible.
  • Item
    No Preview Available
    What the papers say: text mining for genomics and systems biology.
    Harmston, N ; Filsell, W ; Stumpf, MPH (Springer Science and Business Media LLC, 2010-10)
    Keeping up with the rapidly growing literature has become virtually impossible for most scientists. This can have dire consequences. First, we may waste research time and resources on reinventing the wheel simply because we can no longer maintain a reliable grasp on the published literature. Second, and perhaps more detrimental, judicious (or serendipitous) combination of knowledge from different scientific disciplines, which would require following disparate and distinct research literatures, is rapidly becoming impossible for even the most ardent readers of research publications. Text mining - the automated extraction of information from (electronically) published sources - could potentially fulfil an important role - but only if we know how to harness its strengths and overcome its weaknesses. As we do not expect that the rate at which scientific results are published will decrease, text mining tools are now becoming essential in order to cope with, and derive maximum benefit from, this information explosion. In genomics, this is particularly pressing as more and more rare disease-causing variants are found and need to be understood. Not being conversant with this technology may put scientists and biomedical regulators at a severe disadvantage. In this review, we introduce the basic concepts underlying modern text mining and its applications in genomics and systems biology. We hope that this review will serve three purposes: (i) to provide a timely and useful overview of the current status of this field, including a survey of present challenges; (ii) to enable researchers to decide how and when to apply text mining tools in their own research; and (iii) to highlight how the research communities in genomics and systems biology can help to make text mining from biomedical abstracts and texts more straightforward.
  • Item
    Thumbnail Image
    An information-theoretic framework for deciphering pleiotropic and noisy biochemical signaling
    Jetka, T ; Nienaltowski, K ; Filippi, S ; Stumpf, MPH ; Komorowski, M (NATURE PUBLISHING GROUP, 2018-11-02)
    Many components of signaling pathways are functionally pleiotropic, and signaling responses are marked with substantial cell-to-cell heterogeneity. Therefore, biochemical descriptions of signaling require quantitative support to explain how complex stimuli (inputs) are encoded in distinct activities of pathways effectors (outputs). A unique perspective of information theory cannot be fully utilized due to lack of modeling tools that account for the complexity of biochemical signaling, specifically for multiple inputs and outputs. Here, we develop a modeling framework of information theory that allows for efficient analysis of models with multiple inputs and outputs; accounts for temporal dynamics of signaling; enables analysis of how signals flow through shared network components; and is not restricted by limited variability of responses. The framework allows us to explain how identity and quantity of type I and type III interferon variants could be recognized by cells despite activating the same signaling effectors.
  • Item
    Thumbnail Image
    Quantitative time-resolved analysis reveals intricate, differential regulation of standard- and immuno-proteasomes
    Liepe, J ; Holzhuetter, H-G ; Bellavista, E ; Kloetzel, PM ; Stumpf, MPH ; Mishto, M (ELIFE SCIENCES PUBLICATIONS LTD, 2015-09-22)
    Proteasomal protein degradation is a key determinant of protein half-life and hence of cellular processes ranging from basic metabolism to a host of immunological processes. Despite its importance the mechanisms regulating proteasome activity are only incompletely understood. Here we use an iterative and tightly integrated experimental and modelling approach to develop, explore and validate mechanistic models of proteasomal peptide-hydrolysis dynamics. The 20S proteasome is a dynamic enzyme and its activity varies over time because of interactions between substrates and products and the proteolytic and regulatory sites; the locations of these sites and the interactions between them are predicted by the model, and experimentally supported. The analysis suggests that the rate-limiting step of hydrolysis is the transport of the substrates into the proteasome. The transport efficiency varies between human standard- and immuno-proteasomes thereby impinging upon total degradation rate and substrate cleavage-site usage.
  • Item
    Thumbnail Image
    Control mechanisms for stochastic biochemical systems via computation of reachable sets
    Lakatos, E ; Stumpf, MPH (ROYAL SOC, 2017-08)
    Controlling the behaviour of cells by rationally guiding molecular processes is an overarching aim of much of synthetic biology. Molecular processes, however, are notoriously noisy and frequently nonlinear. We present an approach to studying the impact of control measures on motifs of molecular interactions that addresses the problems faced in many biological systems: stochasticity, parameter uncertainty and nonlinearity. We show that our reachability analysis formalism can describe the potential behaviour of biological (naturally evolved as well as engineered) systems, and provides a set of bounds on their dynamics at the level of population statistics: for example, we can obtain the possible ranges of means and variances of mRNA and protein expression levels, even in the presence of uncertainty about model parameters.
  • Item
    Thumbnail Image
    Parametric and non-parametric gradient matching for network inference: a comparison
    Dony, L ; He, F ; Stumpf, MPH (BMC, 2019-01-25)
    BACKGROUND: Reverse engineering of gene regulatory networks from time series gene-expression data is a challenging problem, not only because of the vast sets of candidate interactions but also due to the stochastic nature of gene expression. We limit our analysis to nonlinear differential equation based inference methods. In order to avoid the computational cost of large-scale simulations, a two-step Gaussian process interpolation based gradient matching approach has been proposed to solve differential equations approximately. RESULTS: We apply a gradient matching inference approach to a large number of candidate models, including parametric differential equations or their corresponding non-parametric representations, we evaluate the network inference performance under various settings for different inference objectives. We use model averaging, based on the Bayesian Information Criterion (BIC), to combine the different inferences. The performance of different inference approaches is evaluated using area under the precision-recall curves. CONCLUSIONS: We found that parametric methods can provide comparable, and often improved inference compared to non-parametric methods; the latter, however, require no kinetic information and are computationally more efficient.
  • Item
    Thumbnail Image
    Transition state characteristics during cell differentiation
    Brackston, RD ; Lakatos, E ; Stumpf, MPH ; Maini, PK (PUBLIC LIBRARY SCIENCE, 2018-09)
    Models describing the process of stem-cell differentiation are plentiful, and may offer insights into the underlying mechanisms and experimentally observed behaviour. Waddington's epigenetic landscape has been providing a conceptual framework for differentiation processes since its inception. It also allows, however, for detailed mathematical and quantitative analyses, as the landscape can, at least in principle, be related to mathematical models of dynamical systems. Here we focus on a set of dynamical systems features that are intimately linked to cell differentiation, by considering exemplar dynamical models that capture important aspects of stem cell differentiation dynamics. These models allow us to map the paths that cells take through gene expression space as they move from one fate to another, e.g. from a stem-cell to a more specialized cell type. Our analysis highlights the role of the transition state (TS) that separates distinct cell fates, and how the nature of the TS changes as the underlying landscape changes-change that can be induced by e.g. cellular signaling. We demonstrate that models for stem cell differentiation may be interpreted in terms of either a static or transitory landscape. For the static case the TS represents a particular transcriptional profile that all cells approach during differentiation. Alternatively, the TS may refer to the commonly observed period of heterogeneity as cells undergo stochastic transitions.
  • Item
    Thumbnail Image
    Extracellular proteasome-osteopontin circuit regulates cell migration with implications in multiple sclerosis
    Dianzani, C ; Bellavista, E ; Liepe, J ; Verderio, C ; Martucci, M ; Santoro, A ; Chiocchetti, A ; Gigliotti, CL ; Boggio, E ; Ferrara, B ; Riganti, L ; Keller, C ; Janek, K ; Niewienda, A ; Fenoglio, C ; Sorosina, M ; Cantello, R ; Kloetzel, PM ; Stumpf, MPH ; Paul, F ; Ruprecht, K ; Galimberti, D ; Boneschi, FM ; Comi, C ; Dianzani, U ; Mishto, M (NATURE PORTFOLIO, 2017-03-09)
    Osteopontin is a pleiotropic cytokine that is involved in several diseases including multiple sclerosis. Secreted osteopontin is cleaved by few known proteases, modulating its pro-inflammatory activities. Here we show by in vitro experiments that secreted osteopontin can be processed by extracellular proteasomes, thereby producing fragments with novel chemotactic activity. Furthermore, osteopontin reduces the release of proteasomes in the extracellular space. The latter phenomenon seems to occur in vivo in multiple sclerosis, where it reflects the remission/relapse alternation. The extracellular proteasome-mediated inflammatory pathway may represent a general mechanism to control inflammation in inflammatory diseases.
  • Item
    Thumbnail Image
    From qualitative data to quantitative models: analysis of the phage shock protein stress response in Escherichia coli
    Toni, T ; Jovanovic, G ; Huvet, M ; Buck, M ; Stumpf, MPH (BMC, 2011-05-12)
    BACKGROUND: Bacteria have evolved a rich set of mechanisms for sensing and adapting to adverse conditions in their environment. These are crucial for their survival, which requires them to react to extracellular stresses such as heat shock, ethanol treatment or phage infection. Here we focus on studying the phage shock protein (Psp) stress response in Escherichia coli induced by a phage infection or other damage to the bacterial membrane. This system has not yet been theoretically modelled or analysed in silico. RESULTS: We develop a model of the Psp response system, and illustrate how such models can be constructed and analyzed in light of available sparse and qualitative information in order to generate novel biological hypotheses about their dynamical behaviour. We analyze this model using tools from Petri-net theory and study its dynamical range that is consistent with currently available knowledge by conditioning model parameters on the available data in an approximate Bayesian computation (ABC) framework. Within this ABC approach we analyze stochastic and deterministic dynamics. This analysis allows us to identify different types of behaviour and these mechanistic insights can in turn be used to design new, more detailed and time-resolved experiments. CONCLUSIONS: We have developed the first mechanistic model of the Psp response in E. coli. This model allows us to predict the possible qualitative stochastic and deterministic dynamic behaviours of key molecular players in the stress response. Our inferential approach can be applied to stress response and signalling systems more generally: in the ABC framework we can condition mathematical models on qualitative data in order to delimit e.g. parameter ranges or the qualitative system dynamics in light of available end-point or qualitative information.