Genetic basis of transcriptome diversity in Drosophila melanogaster

Published on Nov 3, 2015in Proceedings of the National Academy of Sciences of the United States of America9.58
· DOI :10.1073/pnas.1519159112
Wen Huang26
Estimated H-index: 26
(NCSU: North Carolina State University),
Mary Anna Carbone22
Estimated H-index: 22
(NCSU: North Carolina State University)
+ 5 AuthorsTrudy F. C. Mackay67
Estimated H-index: 67
(NCSU: North Carolina State University)
Understanding how DNA sequence variation is translated into variation for complex phenotypes has remained elusive but is essential for predicting adaptive evolution, for selecting agriculturally important animals and crops, and for personalized medicine. Gene expression may provide a link between variation in DNA sequence and organismal phenotypes, and its abundance can be measured efficiently and accurately. Here we quantified genome-wide variation in gene expression in the sequenced inbred lines of the Drosophila melanogaster Genetic Reference Panel (DGRP), increasing the annotated Drosophila transcriptome by 11%, including thousands of novel transcribed regions (NTRs). We found that 42% of the Drosophila transcriptome is genetically variable in males and females, including the NTRs, and is organized into modules of genetically correlated transcripts. We found that NTRs often were negatively correlated with the expression of protein-coding genes, which we exploited to annotate NTRs functionally. We identified regulatory variants for the mean and variance of gene expression, which have largely independent genetic control. Expression quantitative trait loci (eQTLs) for the mean, but not for the variance, of gene expression were concentrated near genes. Notably, the variance eQTLs often interacted epistatically with local variants in these genes to regulate gene expression. This comprehensive characterization of population-scale diversity of transcriptomes and its genetic basis in the DGRP is critically important for a systems understanding of quantitative trait variation.
  • References (57)
  • Citations (59)
📖 Papers frequently viewed together
955 Citations
51 Authors (Wen Huang, ..., Trudy F. C. Mackay)
297 Citations
422 Citations
78% of Scinapse members use related papers. After signing in, all features are FREE.
#1Fabio MorganteH-Index: 3
#2Peter SørensenH-Index: 31
Last. Trudy F. C. MackayH-Index: 67
view all 5 authors...
Individuals of the same genotype do not have the same phenotype for quantitative traits when reared under common macro-environmental conditions, a phenomenon called micro-environmental plasticity. Genetic variation in micro-environmental plasticity is assumed in models of the evolution of phenotypic variance, and is important in applied breeding and personalized medicine. Here, we quantified genetic variation for micro-environmental plasticity for three quantitative traits in the inbred, sequenc...
27 CitationsSource
#1Wen Huang (NCSU: North Carolina State University)H-Index: 26
#2Andreas Massouras (EPFL: École Polytechnique Fédérale de Lausanne)H-Index: 7
Last. Trudy F. C. Mackay (NCSU: North Carolina State University)H-Index: 67
view all 51 authors...
The Drosophila melanogaster Genetic Reference Panel (DGRP) is a community resource of 205 sequenced inbred lines, derived to improve our understanding of the effects of naturally occurring genetic variation on molecular and organismal phenotypes. We used an integrated genotyping strategy to identify 4,853,802 single nucleotide polymorphisms (SNPs) and 1,296,080 non-SNP variants. Our molecular population genomic analyses show higher deletion than insertion mutation rates and stronger purifying se...
297 CitationsSource
#1Andrew Anand Brown (University of Oslo)H-Index: 27
#2Alfonso Buil (Swiss Institute of Bioinformatics)H-Index: 31
Last. Richard Durbin (Wellcome Trust Sanger Institute)H-Index: 110
view all 10 authors...
Every person has two copies of each gene: one is inherited from their mother and the other from their father. These two copies are often not identical because there can be many different variants of the same gene in the human population. Traits (such as height, body mass and risk of disease) vary from one person to the next—and for many traits this variation depends in part on the different gene variants that each person has inherited. Studies seeking to find the differences in DNA that can pred...
88 CitationsSource
The role of epistasis in quantitative traits is controversial. This Review argues that findings from studies of these traits are consistent with pervasive epistasis and discusses how experimental designs in model organisms are used to uncover the underlying genetic interactions.
378 CitationsSource
#1Ronald M. Nelson (SLU: Swedish University of Agricultural Sciences)H-Index: 8
#2Mats E. PetterssonH-Index: 5
Last. Örjan CarlborgH-Index: 36
view all 4 authors...
Here, we describe the results from the first variance heterogeneity Genome Wide Association Study (VGWAS) on yeast expression data. Using this forward genetics approach, we show that the genetic regulation of gene-expression in the budding yeast, Saccharomyces cerevisiae, includes mechanisms that can lead to variance heterogeneity in the expression between genotypes. Additionally, we performed a mean effect association study (GWAS). Comparing the mean and variance heterogeneity analyses, we find...
21 CitationsSource
#1Christopher D. Brown (UPenn: University of Pennsylvania)H-Index: 30
#2Lara M. Mangravite (Sage Bionetworks)H-Index: 33
Last. Barbara E. Engelhardt (Duke University)H-Index: 23
view all 3 authors...
Genetic variants in cis-regulatory elements or trans-acting regulators frequently influence the quantity and spatiotemporal distribution of gene transcription. Recent interest in expression quantitative trait locus (eQTL) mapping has paralleled the adoption of genome-wide association studies (GWAS) for the analysis of complex traits and disease in humans. Under the hypothesis that many GWAS associations tag non-coding SNPs with small effects, and that these SNPs exert phenotypic control by modif...
113 CitationsSource
#1Shilpa Swarup (NCSU: North Carolina State University)H-Index: 3
#2Wen Huang (NCSU: North Carolina State University)H-Index: 26
Last. Robert R. H. Anholt (NCSU: North Carolina State University)H-Index: 46
view all 4 authors...
Understanding the relationship between genetic variation and phenotypic variation for quantitative traits is necessary for predicting responses to natural and artificial selection and disease risk in human populations, but is challenging because of large sample sizes required to detect and validate loci with small effects. Here, we used the inbred, sequenced, wild-derived lines of the Drosophila melanogaster Genetic Reference Panel (DGRP) to perform three complementary genome-wide association (G...
58 CitationsSource
#1Amanda M. Hulse (A&M: Texas A&M University)H-Index: 2
#2James J. Cai (A&M: Texas A&M University)H-Index: 22
Expression quantitative trait loci (eQTL) studies have established convincing relationships between genetic variants and gene expression. Most of these studies focused on the mean of gene expression level, but not the variance of gene expression level (i.e., gene expression variability). In the present study, we systematically explore genome-wide association between genetic variants and gene expression variability in humans. We adapt the double generalized linear model (dglm) to simultaneously f...
69 CitationsSource
#1Jeannie T. Lee (HHMI: Howard Hughes Medical Institute)H-Index: 67
Recent studies show that transcription of the mammalian genome is not only pervasive but also enormously complex. It is estimated that an average of 10 transcription units, the vast majority of which make long noncoding RNAs (lncRNAs), may overlap each traditional coding gene. These lncRNAs include not only antisense, intronic, and intergenic transcripts but also pseudogenes and retrotransposons. Do they universally have function, or are they merely transcriptional by-products of conventional co...
684 CitationsSource
#1Andreas Massouras (EPFL: École Polytechnique Fédérale de Lausanne)H-Index: 7
#2Sebastian M. Waszak (EPFL: École Polytechnique Fédérale de Lausanne)H-Index: 22
Last. Bart Deplancke (EPFL: École Polytechnique Fédérale de Lausanne)H-Index: 42
view all 11 authors...
Understanding the relationship between genetic and phenotypic variation is one of the great outstanding challenges in biology. To meet this challenge, comprehensive genomic variation maps of human as well as of model organism populations are required. Here, we present a nucleotide resolution catalog of single-nucleotide, multi-nucleotide, and structural variants in 39 Drosophila melanogaster Genetic Reference Panel inbred lines. Using an integrative, local assembly-based approach for variant dis...
84 CitationsSource
Cited By59
#1Wen Huang (NCSU: North Carolina State University)H-Index: 26
#2Mary Anna Carbone (NCSU: North Carolina State University)H-Index: 22
Last. Trudy F. C. Mackay (NCSU: North Carolina State University)H-Index: 67
view all 5 authors...
The genetics of phenotypic responses to changing environments remains elusive. Using whole genome quantitative gene expression as a model, we studied how the genetic architecture of regulatory variation in gene expression changed in a population of fully sequenced inbred Drosophila melanogaster strains when flies developed at different environments (25 {degrees}C and 18 {degrees}C). We found a substantial fraction of the transcriptome exhibited genotype by environment interaction, implicating en...
#1Palle Duun Rohde (AAU: Aalborg University)
#2Torsten Nygaard Kristensen (AAU: Aalborg University)H-Index: 40
Last. Anders Malmendal (RU: Roskilde University)
view all 5 authors...
Understanding the genotype - phenotype map and how variation at different levels of biological organization are associated are central topics in modern biology. Fast developments in sequencing technologies and other molecular omic tools enable researchers to obtain detailed information on variation at DNA level and on intermediate endophenotypes, such as RNA, proteins, metabolites. This facilitates our understanding of the link between genotypes and molecular and functional organismal phenotypes...
#1Adam N. Spierer (Brown University)H-Index: 1
#2Jim A. Mossman (Brown University)H-Index: 10
Last. David M. Rand (Brown University)H-Index: 42
view all 6 authors...
The winged insects of the order Diptera are colloquially named for their most recognizable phenotype: flight. These insects rely on flight for a number of important life history traits, like dispersal, foraging, and courtship. Despite the importance of flight, relatively little is known about the genetic architecture of variation for flight performance. Accordingly, we sought to uncover the genetic modifiers of flight using a measure of flies’ reaction and response to an abrupt drop in a vertica...
#1Rebecca A. S. Palu (UofU: University of Utah)H-Index: 3
#2Hans M. Dalton (UofU: University of Utah)H-Index: 1
Last. Clement Y. Chow (UofU: University of Utah)H-Index: 16
view all 3 authors...
Endoplasmic reticulum (ER) stress-induced apoptosis is a primary cause and modifier of degeneration in a number of genetic disorders. Understanding how genetic variation influences the ER stress response and subsequent activation of apoptosis could improve individualized therapies and predictions of outcomes for patients. In this study, we find that the uncharacterized, membrane-bound metallopeptidase CG14516 in Drosophila melanogaster, which we rename as SUPpressor of ER stress-induced DEATH (s...
1 CitationsSource
#1Logan J. Everett (NCSU: North Carolina State University)H-Index: 3
#2Wen Huang (NCSU: North Carolina State University)H-Index: 26
Last. Genevieve St. Armour (NCSU: North Carolina State University)H-Index: 3
view all 13 authors...
A major challenge in modern biology is to understand how naturally occurring variation in DNA sequences affects complex organismal traits through networks of intermediate molecular phenotypes. This question is best addressed in a genetic mapping population in which all molecular polymorphisms are known and for which molecular endophenotypes and complex traits are assessed on the same genotypes. Here, we performed deep RNA sequencing of 200 Drosophila Genetic Reference Panel inbred lines with com...
3 CitationsSource
#1Christopher E. Ellison (RU: Rutgers University)H-Index: 15
#2Meenakshi S Kagda (RU: Rutgers University)
Last. Weihuan Cao (RU: Rutgers University)H-Index: 2
view all 3 authors...
Co-evolution between transposable elements (TEs) and their hosts can be antagonistic, where TEs evolve to avoid silencing and the host responds by reestablishing TE suppression, or mutualistic, where TEs are co-opted to benefit their host. The TART-A TE functions as an important component of Drosophila telomeres, but has also reportedly inserted into the D. melanogaster nuclear export factor gene nxf2 . We find that, rather than inserting into nxf2 , TART-A has actually captured a portion of nxf...
#1Annie Yim (MPG: Max Planck Society)H-Index: 3
#2Prasanna Koti (MPG: Max Planck Society)H-Index: 2
Last. Bianca Habermann (AMU: Aix-Marseille University)H-Index: 50
view all 12 authors...
Mitochondria participate in metabolism and signaling. They adapt to the requirements of various cell types. Publicly available expression data permit to study expression dynamics of genes with mitochondrial function (mito-genes) in various cell types, conditions and organisms. Yet, we lack an easy way of extracting these data for mito-genes. Here, we introduce the visual data mining platform mitoXplorer, which integrates expression and mutation data of mito-genes with a manually curated mitochon...
1 CitationsSource
#1Michael V. Frochaux (EPFL: École Polytechnique Fédérale de Lausanne)H-Index: 3
#2Maroun Bou Sleiman (EPFL: École Polytechnique Fédérale de Lausanne)H-Index: 8
Last. Bart Deplancke (EPFL: École Polytechnique Fédérale de Lausanne)H-Index: 42
view all 10 authors...
Background Resistance to enteric pathogens is a complex trait at the crossroads of multiple biological processes. We have previously shown in the Drosophila Genetic Reference Panel (DGRP) that resistance to infection is highly heritable, but our understanding of how the effects of genetic variants affect different molecular mechanisms to determine gut immunocompetence is still limited.
1 CitationsSource
#1Christopher E. Ellison (RU: Rutgers University)H-Index: 15
#2Weihuan Cao (RU: Rutgers University)H-Index: 2
Illumina sequencing has allowed for population-level surveys of transposable element (TE) polymorphism via split alignment approaches, which has provided important insight into the population dynamics of TEs. However, such approaches are not able to identify insertions of uncharacterized TEs, nor can they assemble the full sequence of inserted elements. Here, we use nanopore sequencing and Hi-C scaffolding to produce de novo genome assemblies for two wild strains of Drosophila melanogaster from ...
4 CitationsSource
#1Gregory L. EngelH-Index: 1
#2Kreager Taber (Middlebury College)H-Index: 1
Last. Amanda Crocker (Middlebury College)H-Index: 13
view all 4 authors...
Our understanding of the networks of genes and protein functions involved in Alcohol Use Disorder (AUD) remains incomplete, as do the mechanisms by which these networks lead to AUD phenotypes. The fruit fly (Drosophila melanogaster) is an efficient model for functional and mechanistic characterization of the genes involved in alcohol behavior. The fly offers many advantages as a model organism for investigating the molecular and cellular mechanisms of alcohol-related behaviors, and for understan...