Naturally occurring variations in sequence length creates microRNA isoforms that differ in argonaute effector complex specificity
© Ebhardt et al; licensee BioMed Central Ltd. 2010
Received: 22 February 2010
Accepted: 9 June 2010
Published: 9 June 2010
Micro(mi)RNAs are short RNA sequences, ranging from 16 to 35 nucleotides (miRBase; http://www.mirbase.org). The majority of the identified sequences are 21 or 22 nucleotides in length. Despite the range of sequence lengths for different miRNAs, individual miRNAs were thought to have a specific sequence of a particular length. A recent report describing a longer variant of a previously identified miRNA in Arabidopsis thaliana prompted this investigation for variations in the length of other miRNAs.
In this paper, we demonstrate that a fifth of annotated A. thaliana miRNAs recorded in miRBase V.14 have stable miRNA isoforms that are one or two nucleotides longer than their respective recorded miRNA. Further, we demonstrate that miRNA isoforms are co-expressed and often show differential argonaute complex association. We postulate that these extensions are caused by differential cleavage of the parent precursor miRNA.
Our systematic analysis of A. thaliana miRNAs reveals that miRNA length isoforms are relatively common. This finding not only has implications for miRBase and miRNA annotation, but also extends to miRNA validation experiments and miRNA localization studies. Further, we predict that miRNA isoforms are present in other plant species also.
Micro(mi)RNAs are important for gene regulation  and for cell fate decisions during development . Aberrant levels of miRNAs are seen in various disease states [3–6]. miRNAs are transcribed from one strand of their genomic loci into a primary miRNA transcript, which folds into a characteristic bulge with stem-loop conformation . In plants, the primary transcript is cleaved by a Dicer-like (DCL) RNase III enzyme, DCL1, into an approximately 19 bp duplex with a two-nucleotide (nt) overhang at either end . Of the two strands forming the duplex, one strand, designated miRNA*, is typically degraded while the other is incorporated into the argonaute (AGO)-containing effector complex [9, 10]. Co-immunoprecipitation experiments demonstrate an enrichment of miRNAs in AGO1, whereas AGO2 shows depletion of miRNAs compared with non-immunoprecipitated samples .
The biological significance of sequence length heterogeneity has been recently identified for a mature miRNA in Arabidopsis thaliana, in which ath-MIR168 is processed as miRNAs of 21 and 22 nucleotides in length from its two genomic loci. Vaucheret demonstrated that reducing the amount of 21 nt miRNA greatly reduces homeostasis and leads to developmental defects of the plant, especially in environmentally challenging conditions . In general, it is appreciated that there is variation in the lengths of different miRNAs, as the mature miRNAs listed in miRBase http://www.mirbase.org/ are between 16 and 35 nucleotides in length . In miRBase V.14 there are 209 small RNA sequences identified for in A. thaliana, of which 7%, 79%, 11% and 3% are 20, 21, 22 and 24 nt in length, respectively. The reason and function for this heterogeneity is unclear and we are unaware of any systematic investigation into non-uniform length distributions of individual miRNAs. Each annotated miRNA in miRBase is a single defined sequence, and there are no details on the possibility of variable sequence length. Sequence length variation may have been overlooked previously, as small variations in the sequence length might not have been thought to alter the function of individual miRNAs, as they are directed to their targets by base pairing.
Recent reports show however, that alterations in miRNA length can potentially lead to dramatic effects on miRNA function in organisms such as A. thaliana, in which the identity of the first 5' nucleotide of the miRNA is the major determinant for AGO protein association [11, 14]. Sequence-specific AGO association has been characterized for most A. thaliana AGO complexes [11, 14, 15]. Of these, AGO1 is the major AGO in the pathway of miRNA post-transcriptional gene silencing [16–19], whereas AGO4 functions in repeat-associated silencing of RNA accumulation and in regulating loci- specific DNA methylation [20, 21].
To investigate the frequency with which additional nucleotides on the 5' ends of miRNAs are observed, we queried several published A. thaliana small RNA datasets collected by pyrophosphate and Solexa/Iillumina http://www.illumina.com sequencing techniques. The approach of analyzing small RNA sequencing datasets has previously proven successful for the identification of post-transcriptional modifications in small RNAs [22–25]. Using similar methods, we queried all of the annotated miRNAs from A. thaliana (miRBase V.14) for 5' extensions of one to three nucleotides based on nucleotides present in the pre-miRNA hairpin.
The datasets investigated were from three small RNA sequencing studies including a small RNA transcriptome that responds to changing phosphate levels , an RNA analysis of the dicer (DCL2/DCL3/DCL4) triple mutant , and a study on RNAs that are co-immunoprecipitated with different AGO proteins . In total, these datasets contained 51,907,309 redundant small RNA sequences.
MiRNAs with SNE
ath-MIR156h exists in two isoforms across seven datasets
MiRNAs with a double nucleotide extension
In our first example, we demonstrated that both ath-MIR156h and ath-MIR156h+1 coexist within the plant at constant ratios, with each miRNA isoforms showing preferential AGO association. A second class of miRNAs identified possess two additional 5' nucleotides. An example of this class is ath-MIR775, which exists as both ath-MIR775 and ath-MIR775+2; the latter has two additional 5' U nucleotides, with both of these nucleotides present in the pre-miRNA hairpin. The parental miRNA and ath-MIR775+2 were found at comparable frequencies in all the datasets (1858 and 1587 occurrences, respectively, in the AGO association database). Conversely, there was a negligible occurrence of the +1 miRNA. There are two possible explanations for this exclusive occurrence of ath-MIR775 and ath-MIR775+2. Cleavage events generating the mature miRNAs might generate the two variable length miRNAs forms (0 and +2) exclusively. Alternatively, all three lengths (0, +1 and +2) might be generated, but with only the 0 and +2 forms being stabilized and protected from degradation.
Cloning frequencies of ath-MIR775 sequences in the argonaute (AGO) association datasets
Not all miRNAs are heterogeneously processed
Heterogeneity in mature miRNA lengths is not the rule, as many do not exhibit detectable amounts of variable length processing. Examples include ath-MIR168b which was observed 86,634 times in the four different AGO association datasets, whereas the ath-MIR168b+1 sequence was observed only 34 times. This observed frequency is within the 3% insertion/deletion error rate of pyrophosphate sequencing . To date, there have been no detailed analyses by Illumina or Solexa sequencing of the frequency of insertion and deletion errors. The presence of both types of miRNAs (variable and homogeneous lengths) suggests that the variable lengths of some miRNAs are not simply the result of 'ragged end' processing of all miRNAs, but are a specific process for a subset of miRNAs.
Overall frequency of variable length miRNAs
In addition to our presented in silico data, previous work using a genetic approach also suggests co-existence of miRNA and miRNA+1 and the importance of their co-expression. A recent report described and confirmed the occurrence of a long (22 nt) form of ath-miR168 . In addition, experiments by Vaucheret and data from other studies also reveals evidence of long miRNA variants; for example, careful examination of previously published miRNA northern blots found the presence of double bands for some miRNAs, such as for ath-miR169, ath-miR156 and ath-miR172 .
We have presented evidence arising from several small RNA sequencing experiments that supports the co-existence of mature miRNAs and their 5' extended forms in A. thaliana. Our results expand the previous genetic evidence of variability in miRNA sequence length  by revealing that nearly a fifth of miRNAs identified in A. thaliana have additional nucleotide(s) on their 5' ends. These 5' extended miRNAs are not simply misannotated, as both longer and shorter forms of the miRNAs co-exist. Additionally, we provide evidence that the 5' end variations can result in changes in the type of AGOs with which these miRNA isoforms preferentially associate. Differences in AGO associations suggest alterations in the biological functioning of the different observed forms of these miRNAs. These variable length miRNAs could essentially be considered miRNA isoforms and should be included in any annotation of miRNAs.
A Perl script mapped each mature miRNA to their respective hairpin, recorded the hairpin sequence, then appended one, two or three nucleotide(s) to the 5' of the mature miRNA. For miRNA, the Perl script recorded five sequences: hairpin, mature miRNA, +1 miRNA, +2 miRNA and +3 miRNA. All * sequences were ignored and not used for analysis. Scripts are available online under GPLV.2 http://www.bioinformatics.org/ebbie. The output file in FASTA format was used for an in silico northern blot, which probed all computer-generated small RNA sequences in various datasets (GEO:GSE17741 , GEO:GSE5343  and ath-sbs ) using a modified Ebbie-(mis)match-AGO v1  script. To determine AGO complex affiliation, the computer-generated small RNAs were similarly compared against the AGO1, AGO2, AGO4 and AGO5 small RNA datasets  using Ebbie-(mis)match-AGO v2. Computation was performed on an IBM system (Model x3850; IBM Computers, Markham, ON, Canada).
HAE is a Natural Sciences and Engineering Research Council of Canada (NSERC) Post-Doctoral Fellow. This project has been made possible through a grant from the Alberta Health Services and the Alberta Cancer Foundation to RPF.
- Carthew RW: Gene regulation by microRNAs. Curr Opin Genet Dev. 2006, 16: 203-208. 10.1016/j.gde.2006.02.012.View ArticlePubMedGoogle Scholar
- Jones-Rhoades MW, Bartel DP, Bartel B: MicroRNAS and their regulatory roles in plants. Ann Rev Plant Biol. 2006, 57: 19-53. 10.1146/annurev.arplant.57.032905.105218.View ArticleGoogle Scholar
- Hagen JW, Lai EC: microRNA control of cell-cell signaling during development and disease. Cell Cycle. 2008, 7: 2327-2332.PubMed CentralView ArticlePubMedGoogle Scholar
- Williams AE: Functional aspects of animal microRNAs. Cell Mol Life Sci. 2008, 65: 545-562. 10.1007/s00018-007-7355-9.View ArticlePubMedGoogle Scholar
- Yang B, Lu Y, Wang Z: Control of cardiac excitability by microRNAs. Cardiovasc Res. 2008, 79: 571-580. 10.1093/cvr/cvn181.View ArticlePubMedGoogle Scholar
- Ebhardt HA, Thi EP, Wang MB, Unrau PJ: Extensive 3' modification of plant small RNAs is modulated by helper component-proteinase expression. Proc Natl Acad Sci USA. 2005, 102: 13398-13403. 10.1073/pnas.0506597102.PubMed CentralView ArticlePubMedGoogle Scholar
- Bartel DP: MicroRNAs: genomics, biogenesis, mechanism, and function. Cell. 2004, 116: 281-297. 10.1016/S0092-8674(04)00045-5.View ArticlePubMedGoogle Scholar
- Voinnet O: Origin, biogenesis, and activity of plant microRNAs. Cell. 2009, 136: 669-687. 10.1016/j.cell.2009.01.046.View ArticlePubMedGoogle Scholar
- Ambros V, Bartel B, Bartel DP, Burge CB, Carrington JC, Chen X, Dreyfuss G, Eddy SR, Griffiths-Jones S, Marshall M, Matzke M, Ruvkun G, Tuschl T: A uniform system for microRNA annotation. RNA. 2003, 9: 277-279. 10.1261/rna.2183803.PubMed CentralView ArticlePubMedGoogle Scholar
- Meyers BC, Axtell MJ, Bartel B, Bartel DP, Baulcombe D, Bowman JL, Cao X, Carrington JC, Chen X, Green PJ, Griffiths-Jones S, Jacobsen SE, Mallory AC, Martienssen RA, Poethig RS, Qi Y, Vaucheret H, Voinnet O, Watanabe Y, Weigel D, Zhu JK: Criteria for annotation of plant microRNAs. Plant Cell. 2008, 20: 3186-3190. 10.1105/tpc.108.064311.PubMed CentralView ArticlePubMedGoogle Scholar
- Mi S, Cai T, Hu Y, Chen Y, Hodges E, Ni F, Wu L, Li S, Zhou H, Long C, Chen S, Hannon GJ, Qi Y: Sorting of small RNAs into Arabidopsis argonaute complexes is directed by the 5' terminal nucleotide. Cell. 2008, 133: 116-127. 10.1016/j.cell.2008.02.034.PubMed CentralView ArticlePubMedGoogle Scholar
- Vaucheret H: AGO1 homeostasis involves differential production of 21-nt and 22-nt miR168 species by MIR168a and MIR168b. PLoS One. 2009, 4: e6442-10.1371/journal.pone.0006442.PubMed CentralView ArticlePubMedGoogle Scholar
- Griffiths-Jones S, Saini HK, van Dongen S, Enright AJ: miRBase: tools for microRNA genomics. Nucleic Acids Res. 2008, 36: D154-8. 10.1093/nar/gkm952.PubMed CentralView ArticlePubMedGoogle Scholar
- Montgomery TA, Howell MD, Cuperus JT, Li D, Hansen JE, Alexander AL, Chapman EJ, Fahlgren N, Allen E, Carrington JC: Specificity of ARGONAUTE7-miR390 interaction and dual functionality in TAS3 trans-acting siRNA formation. Cell. 2008, 133: 128-141. 10.1016/j.cell.2008.02.033.View ArticlePubMedGoogle Scholar
- Havecker ER, Wallbridge LM, Hardcastle TJ, Bush MS, Kelly KA, Dunn RM, Schwach F, Doonan JH, Baulcombe DC: The Arabidopsis RNA-directed DNA methylation argonautes functionally diverge based on their expression and interaction with target loci. Plant Cell. 2010, 22: 321-334. 10.1105/tpc.109.072199.PubMed CentralView ArticlePubMedGoogle Scholar
- Vaucheret H, Vazquez F, Crete P, Bartel DP: The action of ARGONAUTE1 in the miRNA pathway and its regulation by the miRNA pathway are crucial for plant development. Genes Dev. 2004, 18: 1187-1197. 10.1101/gad.1201404.PubMed CentralView ArticlePubMedGoogle Scholar
- Qi Y, Denli AM, Hannon GJ: Biochemical specialization within Arabidopsis RNA silencing pathways. Mol Cell. 2005, 19: 421-428. 10.1016/j.molcel.2005.06.014.View ArticlePubMedGoogle Scholar
- Baumberger N, Baulcombe DC: Arabidopsis ARGONAUTE1 is an RNA Slicer that selectively recruits microRNAs and short interfering RNAs. Proc Natl Acad Sci USA. 2005, 102: 11928-11933. 10.1073/pnas.0505461102.PubMed CentralView ArticlePubMedGoogle Scholar
- Morel JB, Godon C, Mourrain P, Beclin C, Boutet S, Feuerbach F, Proux F, Vaucheret H: Fertile hypomorphic ARGONAUTE (ago1) mutants impaired in post-transcriptional gene silencing and virus resistance. Plant Cell. 2002, 14: 629-639. 10.1105/tpc.010358.PubMed CentralView ArticlePubMedGoogle Scholar
- Zilberman D, Cao X, Jacobsen SE: ARGONAUTE4 control of locus-specific siRNA accumulation and DNA and histone methylation. Science. 2003, 299: 716-719. 10.1126/science.1079695.View ArticlePubMedGoogle Scholar
- Qi Y, He X, Wang XJ, Kohany O, Jurka J, Hannon GJ: Distinct catalytic and non-catalytic roles of ARGONAUTE4 in RNA-directed DNA methylation. Nature. 2006, 443: 1008-1012. 10.1038/nature05198.View ArticlePubMedGoogle Scholar
- Ebhardt HA, Tsang HH, Dai DC, Liu Y, Bostan B, Fahlman RP: Meta-analysis of small RNA-sequencing errors reveals ubiquitous post-transcriptional RNA modifications. Nucleic Acids Res. 2009, 37: 2461-2470. 10.1093/nar/gkp093.PubMed CentralView ArticlePubMedGoogle Scholar
- de Hoon MJ, Taft RJ, Hashimoto T, Kanamori-Katayama M, Kawaji H, Kawano M, Kishima M, Lassmann T, Faulkner GJ, Mattick JS, Daub CO, Carninci P, Kawai J, Suzuki H, Hayashizaki Y: Cross-mapping and the identification of editing sites in mature microRNAs in high-throughput sequencing libraries. Genome Res. 2010, 20: 257-264. 10.1101/gr.095273.109.PubMed CentralView ArticlePubMedGoogle Scholar
- Iida K, Jin H, Zhu JK: Bioinformatics analysis suggests base modifications of tRNAs and miRNAs in Arabidopsis thaliana. BMC Genomics. 2009, 10: 155-10.1186/1471-2164-10-155.PubMed CentralView ArticlePubMedGoogle Scholar
- Pantano L, Estivill X, Marti E: SeqBuster, a bioinformatic tool for the processing and analysis of small RNAs datasets, reveals ubiquitous miRNA modifications in human embryonic cells. Nucleic Acids Res. 2010, 38 (5): e34-10.1093/nar/gkp1127.PubMed CentralView ArticlePubMedGoogle Scholar
- Hsieh LC, Lin SI, Shih AC, Chen JW, Lin WY, Tseng CY, Li WH, Chiou TJ: Uncovering small RNA-mediated responses to phosphate deficiency in Arabidopsis by deep sequencing. Plant Physiol. 2009, 151: 2120-2132. 10.1104/pp.109.147280.PubMed CentralView ArticlePubMedGoogle Scholar
- sbs database. [http://mpss.udel.edu/at_sbs/]
- Jones-Rhoades MW, Bartel DP: Computational identification of plant microRNAs and their targets, including a stress-induced miRNA. Mol Cell. 2004, 14: 787-799. 10.1016/j.molcel.2004.05.027.View ArticlePubMedGoogle Scholar
- Rajagopalan R, Vaucheret H, Trejo J, Bartel DP: A diverse and evolutionarily fluid set of microRNAs in Arabidopsis thaliana. Genes Dev. 2006, 20: 3407-3425. 10.1101/gad.1476406.PubMed CentralView ArticlePubMedGoogle Scholar
- Margulies M, Egholm M, Altman WE, Attiya S, Bader JS, Bemben LA, Berka J, Braverman MS, Chen YJ, Chen Z, Dewell SB, Du L, Fierro JM, Gomes XV, Godwin BC, He W, Helgesen S, Ho CH, Irzyk GP, Jando SC, Alenquer ML, Jarvie TP, Jirage KB, Kim JB, Knight JR, Lanza JR, Leamon JH, Lefkowitz SM, Lei M, Li J, Lohman KL, Lu H, Makhijani VB, McDade KE, McKenna MP, Myers EW, Nickerson E, Nobile JR, Plant R, Puc BP, Ronan MT, Roth GT, Sarkis GJ, Simons JF, Simpson JW, Srinivasan M, Tartaro KR, Tomasz A, Vogt KA, Volkmer GA, Wang SH, Wang Y, Weiner MP, Yu P, Begley RF, Rothberg JM: Genome sequencing in microfabricated high-density picolitre reactors. Nature. 2005, 437: 376-380.PubMed CentralPubMedGoogle Scholar
- Vaucheret H, Mallory AC, Bartel DP: AGO1 homeostasis entails coexpression of MIR168 and AGO1 and preferential stabilization of miR168 by AGO1. Mol Cell. 2006, 22: 129-136. 10.1016/j.molcel.2006.03.011.PubMed CentralView ArticlePubMedGoogle Scholar
- Lu C, Kulkarni K, Souret FF, MuthuValliappan R, Tej SS, Poethig RS, Henderson IR, Jacobsen SE, Wang W, Green PJ, Meyers BC: MicroRNAs and other sm all RNAs enriched in the Arabidopsis RNA-dependent RNA polymerase-2 mutant. Genome Res. 2006, 16: 1276-1288. 10.1101/gr.5530106.PubMed CentralView ArticlePubMedGoogle Scholar
- Freeman GH, Halton JH: Note on exact treatment of contingency goodness of fit and other problems of significance. Biometrika. 1951, 141-149.Google Scholar
This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.