A physical, genetic and functional sequence assembly of the barley genome

Published on Nov 29, 2012in Nature43.07
· DOI :10.1038/nature11543
Klaus F. X. Mayer69
Estimated H-index: 69
Robbie Waugh67
Estimated H-index: 67
(James Hutton Institute)
+ 69 AuthorsNils Stein63
Estimated H-index: 63
(Leibniz Association)
Barley (Hordeum vulgare L.) is among the world's earliest domesticated and most important crop plants. It is diploid with a large haploid genome of 5.1 gigabases (Gb). Here we present an integrated and ordered physical, genetic and functional sequence resource that describes the barley gene-space in a structured whole-genome context. We developed a physical map of 4.98 Gb, with more than 3.90 Gb anchored to a high-resolution genetic map. Projecting a deep whole-genome shotgun assembly, complementary DNA and deep RNA sequence data onto this framework supports 79,379 transcript clusters, including 26,159 'high-confidence' genes with homology support from other plant genomes. Abundant alternative splicing, premature termination codons and novel transcriptionally active regions suggest that post-transcriptional processing forms an important regulatory layer. Survey sequences from diverse accessions reveal a landscape of extensive single-nucleotide variation. Our data provide a platform for both genome-assisted research and enabling contemporary crop improvement.
Figures & Tables
  • References (63)
  • Citations (941)
📖 Papers frequently viewed together
6,972 Citations
137 Authors (John P. Vogel, ..., Ivan Baxter)
1,289 Citations
2,607 Citations
78% of Scinapse members use related papers. After signing in, all features are FREE.
#1Nina Riehs-Kearnan (Austrian Academy of Sciences)H-Index: 2
#2Jiradet Gloggnitzer (Austrian Academy of Sciences)H-Index: 4
Last. Karel Riha (Austrian Academy of Sciences)H-Index: 25
view all 5 authors...
Nonsense-mediated RNA decay (NMD) is an evolutionarily conserved RNA quality control mechanism that eliminates transcripts containing nonsense mutations. NMD has also been shown to affect the expression of numerous genes, and inactivation of this pathway is lethal in higher eukaryotes. However, despite relatively detailed knowledge of the molecular basis of NMD, our understanding of its physiological functions is still limited and the underlying causes of lethality are unknown. In this study, we...
71 CitationsSource
#1Cole Trapnell (Broad Institute)H-Index: 47
#2Adam Roberts (University of California, Berkeley)H-Index: 22
Last. Lior Pachter (University of California, Berkeley)H-Index: 55
view all 10 authors...
Recent advances in high-throughput cDNA sequencing (RNA-seq) can reveal new genes and splice variants and quantify expression genome-wide in a single assay. The volume and complexity of data from RNA-seq experiments necessitate scalable, fast and mathematically principled analysis software. TopHat and Cufflinks are free, open-source software tools for gene discovery and comprehensive expression analysis of high-throughput mRNA sequencing (RNA-seq) data. Together, they allow biologists to identif...
6,139 CitationsSource
#1Maria Kalyna (University of Veterinary Medicine Vienna)H-Index: 21
#2Craig G. Simpson (University of Veterinary Medicine Vienna)H-Index: 28
Last. John W. S. Brown (University of Veterinary Medicine Vienna)H-Index: 45
view all 13 authors...
Alternative splicing (AS) coupled to nonsensemediated decay (NMD) is a post-transcriptional mechanism for regulating gene expression. We have used a high-resolution AS RT–PCR panel to identify endogenous AS isoforms which increase in abundance when NMD is impaired in the Arabidopsis NMD factor mutants, upf1-5 and upf3-1. Of 270 AS genes (950 transcripts) on the panel, 102 transcripts from 97 genes (32%) were identified as NMD targets. Extrapolating from these data around 13% of intron-containing...
238 CitationsSource
#1Jesse Poland (KSU: Kansas State University)H-Index: 41
#2Patrick J. Brown (UIUC: University of Illinois at Urbana–Champaign)H-Index: 24
Last. Jean-Luc Jannink (Cornell University)H-Index: 51
view all 4 authors...
Advancements in next-generation sequencing technology have enabled whole genome re-sequencing in many species providing unprecedented discovery and characterization of molecular polymorphisms. There are limitations, however, to next-generation sequencing approaches for species with large complex genomes such as barley and wheat. Genotyping-by-sequencing (GBS) has been developed as a tool for association studies and genomics-assisted breeding in a range of species including those with complex gen...
807 CitationsSource
Climate change is a major environmental stress threatening biodiversity and human civilization. The best hope to secure staple food for humans and animal feed by future crop improvement depends on wild progenitors. We examined 10 wild emmer wheat (Triticum dicoccoides Koern.) populations and 10 wild barley (Hordeum spontaneum K. Koch) populations in Israel, sampling them in 1980 and again in 2008, and performed phenotypic and genotypic analyses on the collected samples. We witnessed the profound...
102 CitationsSource
#1Samantha Rayson (University of Leeds)H-Index: 3
#2Luis Arciga-Reyes (University of Leeds)H-Index: 2
Last. Brendan Davies (University of Leeds)H-Index: 31
view all 8 authors...
Nonsense-mediated mRNA decay (NMD) is a conserved mechanism that targets aberrant mRNAs for destruction. NMD has also been found to regulate the expression of large numbers of genes in diverse organisms, although the biological role for this is unclear and few evolutionarily conserved targets have been identified. Expression analyses of three Arabidopsis thaliana lines deficient in NMD reveal that the vast majority of NMD-targeted transcripts are associated with response to pathogens. Congruentl...
78 CitationsSource
#1Mitchell Guttman (MIT: Massachusetts Institute of Technology)H-Index: 41
#2John L. Rinn (Broad Institute)H-Index: 76
It is clear that RNA has a diverse set of functions and is more than just a messenger between gene and protein. The mammalian genome is extensively transcribed, giving rise to thousands of non-coding transcripts. Whether all of these transcripts are functional is debated, but it is evident that there are many functional large non-coding RNAs (ncRNAs). Recent studies have begun to explore the functional diversity and mechanistic role of these large ncRNAs. Here we synthesize these studies to prov...
1,014 CitationsSource
Recent genome-wide analyses revealed that eukaryotic genomes are almost entirely transcribed, generating a large number of short or long non-protein coding RNAs (non-coding RNAs; ncRNAs). Rapidly accumulating experimental evidence suggests that ncRNAs are not just transcriptional noise, but have biological roles in gene expression. In this review, we focus on the functions of nuclear-localized ncRNAs including the spliceosomal small nuclear RNAs. These nuclear ncRNAs play diverse regulatory role...
8 CitationsSource
#1Stefano Lonardi (UCR: University of California, Riverside)H-Index: 44
#2Denisa DumaH-Index: 4
Last. Timothy J. CloseH-Index: 68
view all 13 authors...
We propose a new sequencing protocol that combines recent advances in combinatorial pooling design and second-generation sequencing technology to efficiently approach de novo selective genome sequencing. We show that combinatorial pooling is a cost-effective and practical alternative to exhaustive DNA barcoding when dealing with hundreds or thousands of DNA samples, such as genome-tiling gene-rich BAC clones. The novelty of the protocol hinges on the computational ability to efficiently compare ...
3 Citations
#1Hee Jeong Jeong (KU: Korea University)H-Index: 4
#2Y Jinkim (KU: Korea University)H-Index: 25
Last. Jeong Sheop Shin (KU: Korea University)H-Index: 27
view all 7 authors...
In Arabidopsis, the NMD-defective mutants upf1-5 andupf3-1 are characterized by dwarfism, curly leaves and lateflowering. These phenotypes are similar to those of mutantsshowing constitutive pathogenesis-related (PR) gene expres-sion, salicylic acid (SA) accumulation and, subsequently, re-sistance to pathogens. The disease symptoms of upf1-5 andupf3-1 mutants were observed following infection with thevirulent pathogen Pst DC3000 with the aim of determiningwhether the loss of nonsense-mediated mRNA...
49 CitationsSource
Cited By941
#1Qi Shi (Ningxia University)
#2Yueya Zhang (SJTU: Shanghai Jiao Tong University)
Last. Wenguo Cai (SJTU: Shanghai Jiao Tong University)
view all 6 authors...
Aux/IAA genes are early auxin-responsive genes and essential for auxin signaling transduction. There is little information about Aux/IAAs in the agriculturally important cereal, barley. Using in silico method, we identified and subsequently characterized 36 Aux/IAAs from the barley genome. Based on their genomic sequences and the phylogenic relationship with Arabidopsis and rice Aux/IAA, the 36 HvIAAs were categorized into two major groups and 14 subgroups. The indication of the presence or abse...
Cultivated grasses are an important source of food for domestic animals worldwide. Increased knowledge of their genomes can speed up the development of new cultivars with better quality and greater resistance to biotic and abiotic stresses. The most widely grown grasses are tetraploid ryegrass species (Lolium) and diploid and hexaploid fescue species (Festuca). In this work, we characterized repetitive DNA sequences and their contribution to genome size in five fescue and two ryegrass species as...
#1Gianluca Bretani (University of Milan)H-Index: 1
#2Laura Rossini (University of Milan)H-Index: 26
Last. A. FricanoH-Index: 6
view all 9 authors...
Copy number variants (CNVs) are pervasive in several animal and plant genomes and contribute to shaping genetic diversity. In barley, there is evidence that changes in gene copy number underlie important agronomic traits. The recently released reference sequence of barley represents a valuable genomic resource for unveiling the incidence of CNVs that affect gene content and identifying sequence features associated with CNV formation. Using exome sequencing and read count data, we detected 16,605...
#1Shyam Solanki (NDSU: North Dakota State University)H-Index: 1
#2Gazala Ameen (NDSU: North Dakota State University)H-Index: 1
Last. Robert Brueggeman (NDSU: North Dakota State University)H-Index: 21
view all 6 authors...
In situ analysis of biomarkers such as DNA, RNA and proteins are important for research and diagnostic purposes. At the RNA level, plant gene expression studies rely on qPCR, RNAseq and probe-based in situ hybridization (ISH). However, for ISH experiments poor stability of RNA and RNA based probes commonly results in poor detection or poor reproducibility. Recently, the development and availability of the RNAscope RNA-ISH method addressed these problems by novel signal amplification and backgrou...
1 CitationsSource
#1Vinh-Trieu To (SJTU: Shanghai Jiao Tong University)
#2Qi Shi (SJTU: Shanghai Jiao Tong University)
Last. Wenguo Cai (SJTU: Shanghai Jiao Tong University)
view all 7 authors...
The GRAS (named after first three identified proteins within this family, GAI, RGA, and SCR) family contains plant-specific genes encoding transcriptional regulators that play a key role in gibberellin (GA) signaling, which regulates plant growth and development. Even though GRAS genes have been characterized in some plant species, little research is known about the GRAS genes in barley (Hordeum vulgare L.). In this study, we observed 62 GRAS members from barley genome, which were grouped into 1...
#1William Ho (University of Melbourne)H-Index: 26
#2Camilla B. Hill (University of Melbourne)H-Index: 12
Last. Ute Roessner (University of Melbourne)H-Index: 38
view all 8 authors...
Abstract Mechanisms underlying rootzone-localised responses to salinity during early stage of barley development remains elusive. Here, we detected the multi-root-omes (transcriptomes, metabolomes, lipidomes) of a domesticated barley cultivar (Clipper) and a landrace (Sahara) which maintain and restrict seedling root growth under salt stress, respectively. Novel generalized linear models were designed to determine differentially expressed genes (DEG) and abundant metabolites (DAM) specific to sa...
2 CitationsSource
#1Richard Ruey-Chyi Wang (USDA: United States Department of Agriculture)
#2Xingfeng Li (SDAU: Shandong Agricultural University)H-Index: 2
Last. Aaron J. Thomas (USU: Utah State University)H-Index: 4
view all 7 authors...
Bluebunch wheatgrass (referred to as BBWG) [Pseudoroegneria spicata (Pursh) A. Love] is an important rangeland Triticeae grass used for forage, conservation, and restoration. This diploid has the basic St> genome that occurs also in many polyploid Triticeae species, which serve as a gene reservoir for wheat improvement. Until now, the St genome in diploid Pseudoroegneria species has not been mapped. Using a double-cross mapping populations, we mapped 230 expressed sequence tag derived simple seq...
#1Xingquan ZengH-Index: 6
#2Tong XuH-Index: 1
Last. Yuzhen BasangH-Index: 2
view all 14 authors...
Hulless barley (Hordeum vulgare L. var. nudum) is a barley variety that has loose husk cover of the caryopses. Because of the ease in processing and edibility, hulless barley has been locally cultivated and used as human food. For example, in Tibetan Plateau, hulless barley is the staple food for human and essential livestock feed. Although the draft genome of hulless barley has been sequenced, the assembly remains fragmented. Here, we reported an improved high-quality assembly and annotation of...
#1Malika Ourari (University of Béjaïa)
#2Olivier Coriton (University of Rennes)H-Index: 20
Last. Abdelkader Aïnouche (University of Rennes)H-Index: 17
view all 8 authors...
We explored diversity, distribution and evolutionary dynamics of Ty1-Copia retrotransposons in the genomes of the Hordeum murinum polyploid complex and related taxa. Phylogenetic and fluorescent in situ hybridization (FISH) analyses of reverse transcriptase sequences identified four Copia families in these genomes: the predominant BARE1 (including three groups or subfamilies, A, B and C), and the less represented RIRE1, IKYA and TAR-1. Within the BARE1 family, BARE1-A elements and a subgroup of ...
#1Kevin Magne (Université Paris-Saclay)H-Index: 2
#2Shengbin Liu (Université Paris-Saclay)
Last. Pascal Ratet (Université Paris-Saclay)H-Index: 41
view all 8 authors...
In cultivated grasses, tillering, spike architecture and seed shattering represent major agronomical traits. In barley, maize and rice, the NOOT-BOP-COCH-LIKE (NBCL) genes play important roles in development, especially in ligule development, tillering and flower identity. However, compared with dicots, the role of grass NBCL genes is underinvestigated. To better understand the role of grass NBCLs and to overcome any effects of domestication that might conceal their original functions, we studie...