Poster Presentation 40th Annual Lorne Genome Conference 2019

Splicing conservation signals in plant long non-coding RNAs (#153)

Selene L Fernandez-Valverde 1 2 , Jose Antonio Corona-Gómez 1 , Irving J García-López 1 , Peter Stadler 3 4 5
  1. Unidad de Genomica Avanzada, Langebio, CINVESTAV, Irapuato, Guanajuato, Mexico
  2. CONACYT, Consejo Nacional de Ciencia y Tecnología, Mexico City, Mexico
  3. Department of Computer Science, Bioinformatics Group, Interdisciplinary Center for Bioinformatics, University of Leipzig, Leipzig, Germany
  4. Max Planck Institute for Mathematics in the Sciences, Leipzig, Germany
  5. Santa Fe Institute, Santa Fe, United States

Long non-coding RNAs (lncRNAs) have recently emerged as prominent regulators of gene expression in eukaryotes. With over 200 nt have little to no protein-coding potential, lncRNAs often drive the modification and maintenance of gene activation or gene silencing states via chromatin conformation rearrangements. In plants, lncRNAs have been shown to participate in gene regulation, and are essential to processes such as vernalization (Csorba et al. 2014) and photomorphogenesis (Wang et al. 2014). Despite their prominent functions, only over a dozen lncRNAs have been experimentally and functionally characterised.

Little is known about the evolutionary patterns of lncRNAs plants. The rates of divergence are much higher in lncRNAs than in protein coding mRNAs, making it difficult to identify lncRNA conservation using traditional sequence comparison methods. One of the few studies that has tried to address this found only 4 lncRNAs with positional conservation and 15 conserved at the sequence level in Brassicaceae (Mohammadin et al. 2015).

Here, we characterised the splicing conservation of lncRNAs in Brassicaceae. We generated a whole-genome alignment of 16 Brassica species and used it to identify synthenic lncRNA orthologues. Using a scoring system trained on transcriptomes from A. thaliana and B. oleracea, we identified splice sites across the whole alignment and measured their conservation. Our analysis revealed that 38% of all intergenic lncRNAs (~900) display splicing conservation in at least one exon, an estimate that is substantially higher to previous estimates of lncRNA conservation in this group. Our findings agree with similar studies in vertebrates (Nitsche et al. 2015), suggesting that splicing conservation can be evidence of stabilising selection and thus used to identify functional lncRNAs in plants.

  1. Nitsche A, Rose D, Fasold M, Reiche K, Stadler PF: Comparison of splice sites reveals that long noncoding RNAs are evolutionarily well conserved. RNA 2015.
  2. Mohammadin S, Edger PP, Pires JC, Schranz ME: Positionally-conserved but sequence-diverged: identification of long non-coding RNAs in the Brassicaceae and Cleomaceae. BMC Plant Biology 2015.
  3. Wang Y, Fan X, Lin F, He G, Terzaghi W, Zhu D, et al.: Arabidopsis noncoding RNA mediates control of photomorphogenesis by red light. Proc Natl Acad Sci USA 2014.
  4. Csorba T, Questa JI, Sun Q, Dean C: Antisense COOLAIR mediates the coordinated switching of chromatin states at FLC during vernalization. Proc Natl Acad Sci USA 2014.