School of Agriculture, Food and Ecosystem Sciences - Research Publications

Permanent URI for this collection

Search Results

Now showing 1 - 10 of 105
  • Item
    No Preview Available
    Allele specific binding of histone modifications and a transcription factor does not predict allele specific expression in correlated ChIP-seq peak-exon pairs.
    Prowse-Wilkins, CP ; Wang, J ; Garner, JB ; Goddard, ME ; Chamberlain, AJ (Springer Science and Business Media LLC, 2023-09-20)
    Allele specific expression (ASE) is widespread in many species including cows. Therefore, regulatory regions which control gene expression should show cis-regulatory variation which mirrors this differential expression within the animal. ChIP-seq peaks for histone modifications and transcription factors measure activity at functional regions and the height of some peaks have been shown to correlate across tissues with the expression of particular genes, suggesting these peaks are putative regulatory regions. In this study we identified ASE in the bovine genome in multiple tissues and investigated whether ChIP-seq peaks for four histone modifications and the transcription factor CTCF show allele specific binding (ASB) differences in the same tissues. We then investigate whether peak height and gene expression, which correlates across tissues, also correlates within the animal by investigating whether the direction of ASB in putative regulatory regions, mirrors that of the ASE in the genes they are putatively regulating. We found that ASE and ASB were widespread in the bovine genome but vary in extent between tissues. However, even when the height of a peak was positively correlated across tissues with expression of an exon, ASE of the exon and ASB of the peak were in the same direction only half the time. A likely explanation for this finding is that the correlations between peak height and exon expression do not indicate that the height of the peak causes the extent of exon expression, at least in some cases.
  • Item
    No Preview Available
    Gene expression and RNA splicing explain large proportions of the heritability for complex traits in cattle
    Xiang, R ; Fang, L ; Liu, S ; Macleod, IM ; Liu, Z ; Breen, EJ ; Gao, Y ; Liu, GE ; CattleGTEx Consortium, BA ; Mason, BA ; Chamberlain, AJ ; Wray, NR ; Goddard, ME (ELSEVIER, 2023-10-11)
    Many quantitative trait loci (QTLs) are in non-coding regions. Therefore, QTLs are assumed to affect gene regulation. Gene expression and RNA splicing are primary steps of transcription, so DNA variants changing gene expression (eVariants) or RNA splicing (sVariants) are expected to significantly affect phenotypes. We quantify the contribution of eVariants and sVariants detected from 16 tissues (n = 4,725) to 37 traits of ∼120,000 cattle (average magnitude of genetic correlation between traits = 0.13). Analyzed in Bayesian mixture models, averaged across 37 traits, cis and trans eVariants and sVariants detected from 16 tissues jointly explain 69.2% (SE = 0.5%) of heritability, 44% more than expected from the same number of random variants. This 69.2% includes an average of 24% from trans e-/sVariants (14% more than expected). Averaged across 56 lipidomic traits, multi-tissue cis and trans e-/sVariants also explain 71.5% (SE = 0.3%) of heritability, demonstrating the essential role of proximal and distal regulatory variants in shaping mammalian phenotypes.
  • Item
    Thumbnail Image
    Erratum to "Evaluation of updated Feed Saved breeding values developed in Australian Holstein dairy cattle" (JDS Commun. 3:114-119).
    Bolormaa, S ; MacLeod, IM ; Khansefid, M ; Marett, LC ; Wales, WJ ; Nieuwhof, GJ ; Baes, CF ; Schenkel, FS ; Goddard, ME ; Pryce, JE (American Dairy Science Association, 2022-09)
    [This corrects the article DOI: 10.3168/jdsc.2021-0150.].
  • Item
    Thumbnail Image
    Additive Genetic Variation in Schizophrenia Risk Is Shared by Populations of African and European Descent
    de Candia, TR ; Lee, SH ; Yang, J ; Browning, BL ; Gejman, PV ; Levinson, DF ; Mowry, BJ ; Hewitt, JK ; Goddard, ME ; O'Donovan, MC ; Purcell, SM ; Posthuma, D ; Visscher, PM ; Wray, NR ; Keller, MC (CELL PRESS, 2013-09-05)
    To investigate the extent to which the proportion of schizophrenia's additive genetic variation tagged by SNPs is shared by populations of European and African descent, we analyzed the largest combined African descent (AD [n = 2,142]) and European descent (ED [n = 4,990]) schizophrenia case-control genome-wide association study (GWAS) data set available, the Molecular Genetics of Schizophrenia (MGS) data set. We show how a method that uses genomic similarities at measured SNPs to estimate the additive genetic correlation (SNP correlation [SNP-rg]) between traits can be extended to estimate SNP-rg for the same trait between ethnicities. We estimated SNP-rg for schizophrenia between the MGS ED and MGS AD samples to be 0.66 (SE = 0.23), which is significantly different from 0 (p(SNP-rg = 0) = 0.0003), but not 1 (p(SNP-rg = 1) = 0.26). We re-estimated SNP-rg between an independent ED data set (n = 6,665) and the MGS AD sample to be 0.61 (SE = 0.21, p(SNP-rg = 0) = 0.0003, p(SNP-rg = 1) = 0.16). These results suggest that many schizophrenia risk alleles are shared across ethnic groups and predate African-European divergence.
  • Item
    Thumbnail Image
    Genetic variation in histone modifications and gene expression identifies regulatory variants in the mammary gland of cattle
    Prowse-Wilkins, CP ; Lopdell, TJ ; Xiang, R ; Vander Jagt, CJ ; Littlejohn, MD ; Chamberlain, AJ ; Goddard, ME (BMC, 2022-12-08)
    BACKGROUND: Causal variants for complex traits, such as eQTL are often found in non-coding regions of the genome, where they are hypothesised to influence phenotypes by regulating gene expression. Many regulatory regions are marked by histone modifications, which can be assayed by chromatin immunoprecipitation followed by sequencing (ChIP-seq). Sequence reads from ChIP-seq form peaks at putative regulatory regions, which may reflect the amount of regulatory activity at this region. Therefore, eQTL which are also associated with differences in histone modifications are excellent candidate causal variants. RESULTS: We assayed the histone modifications H3K4Me3, H3K4Me1 and H3K27ac and mRNA in the mammary gland of up to 400 animals. We identified QTL for peak height (histone QTL), exon expression (eeQTL), allele specific expression (aseQTL) and allele specific binding (asbQTL). By intersecting these results, we identify variants which may influence gene expression by altering regulatory regions of the genome, and may be causal variants for other traits. Lastly, we find that these variants are found in putative transcription factor binding sites, identifying a mechanism for the effect of many eQTL. CONCLUSIONS: We find that allele specific and traditional QTL analysis often identify the same genetic variants and provide evidence that many eQTL are regulatory variants which alter activity at regulatory regions of the bovine genome. Our work provides methodological and biological updates on how regulatory mechanisms interplay at multi-omics levels.
  • Item
    Thumbnail Image
    Sharing of either phenotypes or genetic variants can increase the accuracy of genomic prediction of feed efficiency
    Bolormaa, S ; MacLeod, IM ; Khansefid, M ; Marett, LC ; Wales, WJ ; Miglior, F ; Baes, CF ; Schenkel, FS ; Connor, EE ; Manzanilla-Pech, CI ; Stothard, P ; Herman, E ; Nieuwhof, GJ ; Goddard, ME ; Pryce, JE (BMC, 2022-09-06)
    BACKGROUND: Sharing individual phenotype and genotype data between countries is complex and fraught with potential errors, while sharing summary statistics of genome-wide association studies (GWAS) is relatively straightforward, and thus would be especially useful for traits that are expensive or difficult-to-measure, such as feed efficiency. Here we examined: (1) the sharing of individual cow data from international partners; and (2) the use of sequence variants selected from GWAS of international cow data to evaluate the accuracy of genomic estimated breeding values (GEBV) for residual feed intake (RFI) in Australian cows. RESULTS: GEBV for RFI were estimated using genomic best linear unbiased prediction (GBLUP) with 50k or high-density single nucleotide polymorphisms (SNPs), from a training population of 3797 individuals in univariate to trivariate analyses where the three traits were RFI phenotypes calculated using 584 Australian lactating cows (AUSc), 824 growing heifers (AUSh), and 2526 international lactating cows (OVE). Accuracies of GEBV in AUSc were evaluated by either cohort-by-birth-year or fourfold random cross-validations. GEBV of AUSc were also predicted using only the AUS training population with a weighted genomic relationship matrix constructed with SNPs from the 50k array and sequence variants selected from a meta-GWAS that included only international datasets. The genomic heritabilities estimated using the AUSc, OVE and AUSh datasets were moderate, ranging from 0.20 to 0.36. The genetic correlations (rg) of traits between heifers and cows ranged from 0.30 to 0.95 but were associated with large standard errors. The mean accuracies of GEBV in Australian cows were up to 0.32 and almost doubled when either overseas cows, or both overseas cows and AUS heifers were included in the training population. They also increased when selected sequence variants were combined with 50k SNPs, but with a smaller relative increase. CONCLUSIONS: The accuracy of RFI GEBV increased when international data were used or when selected sequence variants were combined with 50k SNP array data. This suggests that if direct sharing of data is not feasible, a meta-analysis of summary GWAS statistics could provide selected SNPs for custom panels to use in genomic selection programs. However, since this finding is based on a small cross-validation study, confirmation through a larger study is recommended.
  • Item
    Thumbnail Image
    BayesR3 enables fast MCMC blocked processing for largescale multi-trait genomic prediction and QTN mapping analysis.
    Breen, EJ ; MacLeod, IM ; Ho, PN ; Haile-Mariam, M ; Pryce, JE ; Thomas, CD ; Daetwyler, HD ; Goddard, ME (Springer Science and Business Media LLC, 2022-07-05)
    Bayesian methods, such as BayesR, for predicting the genetic value or risk of individuals from their genotypes, such as Single Nucleotide Polymorphisms (SNP), are often implemented using a Markov Chain Monte Carlo (MCMC) process. However, the generation of Markov chains is computationally slow. We introduce a form of blocked Gibbs sampling for estimating SNP effects from Markov chains that greatly reduces computational time by sampling each SNP effect iteratively n-times from conditional block posteriors. Subsequent iteration over all blocks m-times produces chains of length m × n. We use this strategy to solve large-scale genomic prediction and fine-mapping problems using the Bayesian MCMC mixed-effects genetic model, BayesR3. We validate the method using simulated data, followed by analysis of empirical dairy cattle data using high dimension milk mid infra-red spectra data as an example of "omics" data and show its use to increase the precision of mapping variants affecting milk, fat, and protein yields relative to a univariate analysis of milk, fat, and protein.
  • Item
    Thumbnail Image
    MetaGS: an accurate method to impute and combine SNP effects across populations using summary statistics.
    Jighly, A ; Benhajali, H ; Liu, Z ; Goddard, ME (Springer Science and Business Media LLC, 2022-06-02)
    BACKGROUND: Meta-analysis describes a category of statistical methods that aim at combining the results of multiple studies to increase statistical power by exploiting summary statistics. Different industries that use genomic prediction do not share their raw data due to logistic or privacy restrictions, which can limit the size of their reference populations and creates a need for a practical meta-analysis method. RESULTS: We developed a meta-analysis, named MetaGS, that duplicates the results of multi-trait best linear unbiased prediction (mBLUP) analysis without accessing raw data. MetaGS exploits the correlations among different populations to produce more accurate population-specific single nucleotide polymorphism (SNP) effects. The method improves SNP effect estimations for a given population depending on its relations to other populations. MetaGS was tested on milk, fat and protein yield data of Australian Holstein and Jersey cattle and it generated very similar genomic estimated breeding values to those produced using the mBLUP method for all traits in both breeds. One of the major difficulties when combining SNP effects across populations is the use of different variants for the populations, which limits the applications of meta-analysis in practice. We solved this issue by developing a method to impute missing summary statistics without using raw data. Our results showed that imputing summary statistics can be done with high accuracy (r > 0.9) even when more than 70% of the SNPs were missing with a minimal effect on prediction accuracy. CONCLUSIONS: We demonstrated that MetaGS can replace the mBLUP model when raw data cannot be shared, which can lead to more flexible collaborations compared to the single-trait BLUP model.
  • Item
    No Preview Available
    Phantom epistasis between unlinked loci
    Hemani, G ; Powell, JE ; Wang, H ; Shakhbazov, K ; Westra, H-J ; Esko, T ; Henders, AK ; McRae, AF ; Martin, NG ; Metspalu, A ; Franke, L ; Montgomery, GW ; Goddard, ME ; Gibson, G ; Yang, J ; Visscher, PM (NATURE PORTFOLIO, 2021-08-12)
  • Item
    Thumbnail Image
    Author Correction: Assortative mating biases marker-based heritability estimators.
    Border, R ; O'Rourke, S ; de Candia, T ; Goddard, ME ; Visscher, PM ; Yengo, L ; Jones, M ; Keller, MC (Springer Science and Business Media LLC, 2022-04-01)