An RNA motif is a description of a group of RNAs that have a related structure. RNA motifs consist of a pattern of features within the primary sequence and secondary structure of related RNAs. Thus, it extends the concept of a sequence motif to include RNA secondary structure. The term "RNA motif" can refer both to the pattern and to the RNA sequences that match it.
Descriptions of RNAs motifs
RNA motifs can be described in two main forms: a multiple sequence alignment or an explicit search pattern. An alignment is usually augmented with a consensus secondary structure, i.e. the structure that is common to all or most RNAs. The sequences in the alignment then implicitly define a pattern of conservation that can, for example, be used to find additional examples of the RNA. This search strategy is implemented by, among others, the Infernal software package. [1]
The Rfam database is a collection of multiple sequence alignments that define a large subset of reliably known RNA motifs and associated information. Its data can be used with the Infernal software to find examples of such RNAs in sequence databases, e.g. genome sequences.
Alternatively, RNA motifs can also be described using explicit search patterns, which define specific primary sequence patterns combined with constraints of where helices should form. Such patterns can be used to find matching subsequences in a large sequence database. Several software packages implement such a search, e.g. RNArobo [2] and RNAmotif.[3]
Discovery of novel RNA motifs
Many methods to discover novel RNAs use a comparative approach, in which different sequences are analyzed together in order to detect characteristic signals of a conserved RNA. When such methods are successful, the resulting novel conserved RNA can be viewed as an RNA motif, expressed using an alignment or a pattern. An early example is the RNA motif based around the T-box, which in 1993 was determined to be associated with aminoacyl-tRNA synthetase genes.[4] The mechanism by which this RNA motif regulates genes was later demonstrated, thus establishing the functional importance of the RNA motif. Later, in 1997, a conserved RNA motif called the B12-box was detected upstream of genes related to B12 metabolism.[5] This RNA motif was later found to correspond to a part of a riboswitch that binds the co-factor adenosylcobalamin, which is often called the cobalamin riboswitch. (Later variants were shown to bind other cobalamin derivatives.) Many other examples of RNA motifs whose functions were later determined are known, especially in the context of riboswitches.[6] However, other types of RNA motifs have been functionally characterized, such as bacterial sRNAs like the 6C RNA, which was discovered as a motif in 2007[7] and functionally characterized in 2016,[8] or ribozymes like the twister ribozyme, which was detected as an RNA motif and functionally characterized in the same publication.[9]
References
- ↑ Nawrocki EP, Eddy SR (November 2013). "Infernal 1.1: 100-fold faster RNA homology searches". Bioinformatics. 29 (22): 2933–5. doi:10.1093/bioinformatics/btt509. PMC 3810854. PMID 24008419.
- ↑ Rampášek L, Jimenez RM, Lupták A, Vinař T, Brejová B (May 2016). "RNA motif search with data-driven element ordering". BMC Bioinformatics. 17 (1): 216. doi:10.1186/s12859-016-1074-x. PMC 4870747. PMID 27188396.
- ↑ Macke TJ, Ecker DJ, Gutell RR, Gautheret D, Case DA, Sampath R (November 2001). "RNAMotif, an RNA secondary structure definition and search algorithm". Nucleic Acids Res. 29 (22): 4724–35. doi:10.1093/nar/29.22.4724. PMC 92549. PMID 11713323.
- ↑ Grundy FJ, Henkin TM (August 1993). "tRNA as a positive regulator of transcription antitermination in B. subtilis". Cell. 74 (3): 475–82. doi:10.1016/0092-8674(93)80049-k. PMID 8348614. S2CID 35490594.
- ↑ Franklund CV, Kadner RJ (June 1997). "Multiple transcribed elements control expression of the Escherichia coli btuB gene". J Bacteriol. 179 (12): 4039–42. doi:10.1128/jb.179.12.4039-4042.1997. PMC 179215. PMID 9190822.
- ↑ Sherlock ME, Breaker RR (June 2020). "Former orphan riboswitches reveal unexplored areas of bacterial metabolism, signaling, and gene control processes". RNA. 26 (6): 675–693. doi:10.1261/rna.074997.120. PMC 7266159. PMID 32165489.
- ↑ Weinberg Z, Barrick JE, Yao Z, Roth A, Kim JN, Gore J, Wang JX, Lee ER, Block KF, Sudarsan N, Neph S, Tompa M, Ruzzo WL, Breaker RR (2007). "Identification of 22 candidate structured RNAs in bacteria using the CMfinder comparative genomics pipeline". Nucleic Acids Res. 35 (14): 4809–19. doi:10.1093/nar/gkm487. PMC 1950547. PMID 17621584.
- ↑ Pahlke J, Dostálová H, Holátko J, Degner U, Bott M, Pátek M, Polen T (September 2016). "The small 6C RNA of Corynebacterium glutamicum is involved in the SOS response". RNA Biol. 13 (9): 848–60. doi:10.1080/15476286.2016.1205776. PMC 5014011. PMID 27362471.
- ↑ Roth A, Weinberg Z, Chen AG, Kim PB, Ames TD, Breaker RR (January 2014). "A widespread self-cleaving ribozyme class is revealed by bioinformatics". Nat Chem Biol. 10 (1): 56–60. doi:10.1038/nchembio.1386. PMC 3867598. PMID 24240507.