Long non-coding RNA

(Redirected from LncRNA)

Long non-coding RNAs (long ncRNAs, lncRNA) are a type of RNA, generally defined as transcripts more than 200 nucleotides that are not translated into protein.[2] This arbitrary limit distinguishes long ncRNAs from small non-coding RNAs, such as microRNAs (miRNAs), small interfering RNAs (siRNAs), Piwi-interacting RNAs (piRNAs), small nucleolar RNAs (snoRNAs), and other short RNAs.[3] Given that some lncRNAs have been reported to have the potential to encode small proteins or micro-peptides, the latest definition of lncRNA is a class of transcripts of over 200 nucleotides that have no or limited coding capacity.[4] However, John S. Mattick and colleagues suggested to change definition of long non-coding RNAs to transcripts more than 500 nt, which are mostly generated by Pol II.[5] That means that question of lncRNA exact definition is still under discussion in the field. Long intervening/intergenic noncoding RNAs (lincRNAs) are sequences of transcripts that do not overlap protein-coding genes.[6]

Different types of long non-coding RNAs.[1]

Long non-coding RNAs include intergenic lincRNAs, intronic ncRNAs, and sense and antisense lncRNAs, each type showing different genomic positions in relation to genes and exons.[1][3]

The definition of lncRNAs differs from that of other RNAs such as siRNAs, mRNAs, miRNAs, and snoRNAs because it is not connected to the function of the RNA. A lncRNA is any transcript that is not one of the other well-characterized RNAs and is longer than 200-500 nucleotides. Some scientists think that most lncRNAs do not have a biologically relevant function because they are transcripts of junk DNA.[7][8]

Abundance

edit

Long non-coding transcripts are found in many species. Large-scale complementary DNA (cDNA) sequencing projects such as FANTOM reveal the complexity of these transcripts in humans.[9] The FANTOM3 project identified ~35,000 non-coding transcripts that bear many signatures of messenger RNAs, including 5' capping, splicing, and poly-adenylation, but have little or no open reading frame (ORF).[9] This number represents a conservative lower estimate, since it omitted many singleton transcripts and non-polyadenylated transcripts (tiling array data shows more than 40% of transcripts are non-polyadenylated).[10] Identifying ncRNAs within these cDNA libraries is challenging since it can be difficult to distinguish protein-coding transcripts from non-coding transcripts. It has been suggested through multiple studies that testis,[11] and neural tissues express the greatest amount of long non-coding RNAs of any tissue type.[12] Using FANTOM5, 27,919 long ncRNAs have been identified in various human sources.[13]

Quantitatively, lncRNAs demonstrate ~10-fold lower abundance than mRNAs,[14][15] which is explained by higher cell-to-cell variation of expression levels of lncRNA genes in the individual cells, when compared to protein-coding genes.[16] In general, the majority (~78%) of lncRNAs are characterized as tissue-specific, as opposed to only ~19% of mRNAs.[14] Only 3.6% of human lncRNA genes are expressed in various biological contexts and 34% of lncRNA genes are expressed at high level (top 25% of both lncRNAs and mRNAs) in at least one biological context.[17] In addition to higher tissue specificity, lncRNAs are characterized by higher developmental stage specificity,[18] and cell subtype specificity in tissues such as human neocortex[19] and other parts of the brain, regulating correct brain development and function.[20] In 2022, a comprehensive integration of lncRNAs from existing databases, revealed that there are 95,243 lncRNA genes and 323,950 transcripts in humans.[21]

In comparison to mammals relatively few studies have focused on the prevalence of lncRNAs in plants. However an extensive study considering 37 higher plant species and six algae identified ~200,000 non-coding transcripts using an in-silico approach,[22] which also established the associated Green Non-Coding Database (GreeNC), a repository of plant lncRNAs.

Genomic organization

edit

In 2005 the landscape of the mammalian genome was described as numerous 'foci' of transcription that are separated by long stretches of intergenic space.[9] While some long ncRNAs are located within the intergenic stretches, the majority are overlapping sense and antisense transcripts that often include protein-coding genes,[23] giving rise to a complex hierarchy of overlapping isoforms.[24] Genomic sequences within these transcriptional foci are often shared within a number of coding and non-coding transcripts in the sense and antisense directions[25] For example, 3012 out of 8961 cDNAs previously annotated as truncated coding sequences within FANTOM2 were later designated as genuine ncRNA variants of protein-coding cDNAs.[9] While the abundance and conservation of these arrangements suggest they have biological relevance, the complexity of these foci frustrates easy evaluation.

The GENCODE consortium has collated and analysed a comprehensive set of human lncRNA annotations and their genomic organisation, modifications, cellular locations and tissue expression profiles.[12] Their analysis indicates human lncRNAs show a bias toward two-exon transcripts.[12]

Identification software

edit
Name Taxonomic group Web server Repository Input file Main model / algorithm Training set Year published Reference
PLEKv2 Plant, animal PLEKv2 Paper PLEKv2 source code FASTA CNN No 2024 [26]
DeepPlnc Plant DeepPlnc Server DeepPlnc FASTA Neural network Yes 2022 [27]
RNAsamba All RNAsamba RNAsamba FASTA Neural network Yes 2020 [28]
LGC Plant, animal LGC FASTA, BED, GTF Relationship between ORF length and GC content No 2019 [29]
CPAT Human, fly, mouse, zebrafish CPAT CPAT FASTA/BED Logistic regression Yes 2013 [30]
COME Plant, human, mouse, fly, worm COME COME GTF Random forest Yes 2017 [31]
CNCI Plant, animal NA FASTA, GTF Support vector machine No 2013 [32]
PLEK Vertebrate, Plant NA PLEK FASTA Support vector machine No 2014 [33]
FEELnc All NA FEELnc FASTA, GTF Random forest Yes 2017 [34]
PhyloCSF Vertebrate, fly, mosquito, yeast, worm NA FASTA Phylogenetic codon model Yes 2011 [35]
slncky All NA slncky FASTA, BED Evolutionary conservation Yes 2016 [36]

Translation

edit

There has been considerable debate about whether lncRNAs have been misannotated and do in fact encode proteins. Several lncRNAs have been found to in fact encode for peptides with biologically significant function.[37][38][39] Ribosome profiling studies have suggested that anywhere from 40% to 90% of annotated lncRNAs are in fact translated,[40][41] although there is disagreement about the correct method for analyzing ribosome profiling data.[42] Additionally, it is thought that many of the peptides produced by lncRNAs may be highly unstable and without biological function.[41]

Conservation

edit

Unlike protein coding genes, sequence of long non-coding RNAs has lower level of conservation. Initial studies into lncRNA conservation noted that as a class, they were enriched for conserved sequence elements,[43] depleted in substitution and insertion/deletion rates[44] and depleted in rare frequency variants,[45] indicative of purifying selection maintaining lncRNA function. However, further investigations into vertebrate lncRNAs revealed that while lncRNAs are conserved in sequence, they are not conserved in transcription.[46][47][11] In other words, even when the sequence of a human lncRNA is conserved in another vertebrate species, there is often no transcription of a lncRNA in the orthologous genomic region. Some argue that these observations suggest non-functionality of the majority of lncRNAs,[48][49][7] while others argue that they may be indicative of rapid species-specific adaptive selection.[50]

While the turnover of lncRNA transcription is much higher than initially expected, it is important to note that still, hundreds of lncRNAs are conserved at the sequence level. There have been several attempts to delineate the different categories of selection signatures seen amongst lncRNAs including: lncRNAs with strong sequence conservation across the entire length of the gene, lncRNAs in which only a portion of the transcript (e.g. 5′ end, splice sites) is conserved, and lncRNAs that are transcribed from syntenic regions of the genome but have no recognizable sequence similarity.[51][52][53] Additionally, there have been attempts to identify conserved secondary structures in lncRNAs, though these studies have currently given way to conflicting results.[54][55]

Functions

edit

Despite claims that the majority of long noncoding RNAs in mammals are likely to be functional,[56][57] it seems likely that most of them are transcriptional noise and only a relatively small proportion has been demonstrated to be biologically relevant.[7][8]

Some lncRNAs have been functionally annotated in LncRNAdb (a database of literature described lncRNAs),[58][59] with the majority of these being described in humans. Over 2600 human lncRNAs with experimental evidences have been community-curated in LncRNAWiki (a wiki-based, publicly editable and open-content platform for community curation of human lncRNAs).[60] According to the curation of functional mechanisms of lncRNAs based on the literatures, lncRNAs are extensively reported to be involved in ceRNA regulation, transcriptional regulation, and epigenetic regulation.[60] A further large-scale sequencing study provides evidence that many transcripts thought to be lncRNAs may, in fact, be translated into proteins.[61]

In the regulation of gene transcription

edit

In gene-specific transcription

edit

In eukaryotes, RNA transcription is a tightly regulated process. Noncoding RNAs act upon different aspects of this process, targeting transcriptional modulators, RNA polymerase (RNAP) II and even the DNA duplex to regulate gene expression.[62]

NcRNAs modulate transcription by several mechanisms, including functioning themselves as co-regulators, modifying transcription factor activity, or regulating the association and activity of co-regulators. For example, the noncoding RNA Evf-2 functions as a co-activator for the homeobox transcription factor Dlx2, which plays important roles in forebrain development and neurogenesis.[63][64] Sonic hedgehog induces transcription of Evf-2 from an ultra-conserved element located between the Dlx5 and Dlx6 genes during forebrain development.[63] Evf-2 then recruits the Dlx2 transcription factor to the same ultra-conserved element whereby Dlx2 subsequently induces expression of Dlx5. The existence of other similar ultra- or highly conserved elements within the mammalian genome that are both transcribed and fulfill enhancer functions suggest Evf-2 may be illustrative of a generalised mechanism that regulates developmental genes with complex expression patterns during vertebrate growth.[65][66] Indeed, the transcription and expression of similar non-coding ultraconserved elements was shown to be abnormal in human leukaemia and to contribute to apoptosis in colon cancer cells, suggesting their involvement in tumorigenesis in like fashion to protein-coding RNA.[67][68][69]

Local ncRNAs can also recruit transcriptional programmes to regulate adjacent protein-coding gene expression.

The RNA binding protein TLS binds and inhibits the CREB binding protein and p300 histone acetyltransferase activities on a repressed gene target, cyclin D1. The recruitment of TLS to the promoter of cyclin D1 is directed by long ncRNAs expressed at low levels and tethered to 5' regulatory regions in response to DNA damage signals.[70] Moreover, these local ncRNAs act cooperatively as ligands to modulate the activities of TLS. In the broad sense, this mechanism allows the cell to harness RNA-binding proteins, which make up one of the largest classes within the mammalian proteome, and integrate their function in transcriptional programs. Nascent long ncRNAs have been shown to increase the activity of CREB binding protein, which in turn increases the transcription of that ncRNA.[71] A study found that a lncRNA in the antisense direction of the Apolipoprotein A1 (APOA1) regulates the transcription of APOA1 through epigenetic modifications.[72]

Recent evidence has raised the possibility that transcription of genes that escape from X-inactivation might be mediated by expression of long non-coding RNA within the escaping chromosomal domains.[73]

Regulating basal transcription machinery

edit

NcRNAs also target general transcription factors required for the RNAP II transcription of all genes.[62] These general factors include components of the initiation complex that assemble on promoters or involved in transcription elongation. A ncRNA transcribed from an upstream minor promoter of the dihydrofolate reductase (DHFR) gene forms a stable RNA-DNA triplex within the major promoter of DHFR to prevent the binding of the transcriptional co-factor TFIIB.[74] This novel mechanism of regulating gene expression may represent a widespread method of controlling promoter usage, as thousands of RNA-DNA triplexes exist in eukaryotic chromosome.[75] The U1 ncRNA can induce transcription by binding to and stimulating TFIIH to phosphorylate the C-terminal domain of RNAP II.[76] In contrast the ncRNA 7SK is able to repress transcription elongation by, in combination with HEXIM1/2, forming an inactive complex that prevents PTEFb from phosphorylating the C-terminal domain of RNAP II,[76][77][78] repressing global elongation under stressful conditions. These examples, which bypass specific modes of regulation at individual promoters provide a means of quickly affecting global changes in gene expression.

The ability to quickly mediate global changes is also apparent in the rapid expression of non-coding repetitive sequences. The short interspersed nuclear (SINE) Alu elements in humans and analogous B1 and B2 elements in mice have succeeded in becoming the most abundant mobile elements within the genomes, comprising ~10% of the human and ~6% of the mouse genome, respectively.[79][80] These elements are transcribed as ncRNAs by RNAP III in response to environmental stresses such as heat shock,[81] where they then bind to RNAP II with high affinity and prevent the formation of active pre-initiation complexes.[82][83][84][85] This allows for the broad and rapid repression of gene expression in response to stress.[82][85]

A dissection of the functional sequences within Alu RNA transcripts has drafted a modular structure analogous to the organization of domains in protein transcription factors.[86] The Alu RNA contains two 'arms', each of which may bind one RNAP II molecule, as well as two regulatory domains that are responsible for RNAP II transcriptional repression in vitro.[85] These two loosely structured domains may even be concatenated to other ncRNAs such as B1 elements to impart their repressive role.[85] The abundance and distribution of Alu elements and similar repetitive elements throughout the mammalian genome may be partly due to these functional domains being co-opted into other long ncRNAs during evolution, with the presence of functional repeat sequence domains being a common characteristic of several known long ncRNAs including Kcnq1ot1, Xlsirt and Xist.[87][88][89][90]

In addition to heat shock, the expression of SINE elements (including Alu, B1, and B2 RNAs) increases during cellular stress such as viral infection[91] in some cancer cells[92] where they may similarly regulate global changes to gene expression. The ability of Alu and B2 RNA to bind directly to RNAP II provides a broad mechanism to repress transcription.[83][85] Nevertheless, there are specific exceptions to this global response where Alu or B2 RNAs are not found at activated promoters of genes undergoing induction, such as the heat shock genes.[85] This additional hierarchy of regulation that exempts individual genes from the generalised repression also involves a long ncRNA, heat shock RNA-1 (HSR-1). It was argued that HSR-1 is present in mammalian cells in an inactive state, but upon stress is activated to induce the expression of heat shock genes.[93] This activation involves a conformational alteration of HSR-1 in response to rising temperatures, permitting its interaction with the transcriptional activator HSF-1, which trimerizes and induces the expression of heat shock genes.[93] In the broad sense, these examples illustrate a regulatory circuit nested within ncRNAs whereby Alu or B2 RNAs repress general gene expression, while other ncRNAs activate the expression of specific genes.

Transcribed by RNA polymerase III

edit

Many of the ncRNAs that interact with general transcription factors or RNAP II itself (including 7SK, Alu and B1 and B2 RNAs) are transcribed by RNAP III,[94] uncoupling their expression from RNAP II, which they regulate. RNAP III also transcribes other ncRNAs, such as BC2, BC200 and some microRNAs and snoRNAs, in addition to housekeeping ncRNA genes such as tRNAs, 5S rRNAs and snRNAs.[94] The existence of an RNAP III-dependent ncRNA transcriptome that regulates its RNAP II-dependent counterpart is supported by the finding of a set of ncRNAs transcribed by RNAP III with sequence homology to protein-coding genes. This prompted the authors to posit a 'cogene/gene' functional regulatory network,[95] showing that one of these ncRNAs, 21A, regulates the expression of its antisense partner gene, CENP-F in trans.

In post-transcriptional regulation

edit

In addition to regulating transcription, ncRNAs also control various aspects of post-transcriptional mRNA processing. Similar to small regulatory RNAs such as microRNAs and snoRNAs, these functions often involve complementary base pairing with the target mRNA. The formation of RNA duplexes between complementary ncRNA and mRNA may mask key elements within the mRNA required to bind trans-acting factors, potentially affecting any step in post-transcriptional gene expression including pre-mRNA processing and splicing, transport, translation, and degradation.[96]

In splicing

edit

The splicing of mRNA can induce its translation and functionally diversify the repertoire of proteins it encodes. The Zeb2 mRNA requires the retention of a 5'UTR intron that contains an internal ribosome entry site for efficient translation.[97] The retention of the intron depends on the expression of an antisense transcript that complements the intronic 5' splice site.[97] Therefore, the ectopic expression of the antisense transcript represses splicing and induces translation of the Zeb2 mRNA during mesenchymal development. Likewise, the expression of an overlapping antisense Rev-ErbAa2 transcript controls the alternative splicing of the thyroid hormone receptor ErbAa2 mRNA to form two antagonistic isoforms.[98]

In translation

edit

NcRNA may also apply additional regulatory pressures during translation, a property particularly exploited in neurons where the dendritic or axonal translation of mRNA in response to synaptic activity contributes to changes in synaptic plasticity and the remodelling of neuronal networks. The RNAP III transcribed BC1 and BC200 ncRNAs, that previously derived from tRNAs, are expressed in the mouse and human central nervous system, respectively.[99][100] BC1 expression is induced in response to synaptic activity and synaptogenesis and is specifically targeted to dendrites in neurons.[101] Sequence complementarity between BC1 and regions of various neuron-specific mRNAs also suggest a role for BC1 in targeted translational repression.[102] Indeed, it was recently shown that BC1 is associated with translational repression in dendrites to control the efficiency of dopamine D2 receptor-mediated transmission in the striatum[103] and BC1 RNA-deleted mice exhibit behavioural changes with reduced exploration and increased anxiety.[104]

In siRNA-directed gene regulation

edit

In addition to masking key elements within single-stranded RNA, the formation of double-stranded RNA duplexes can also provide a substrate for the generation of endogenous siRNAs (endo-siRNAs) in Drosophila and mouse oocytes.[105] The annealing of complementary sequences, such as antisense or repetitive regions between transcripts, forms an RNA duplex that may be processed by Dicer-2 into endo-siRNAs. Also, long ncRNAs that form extended intramolecular hairpins may be processed into siRNAs, compellingly illustrated by the esi-1 and esi-2 transcripts.[106] Endo-siRNAs generated from these transcripts seem particularly useful in suppressing the spread of mobile transposon elements within the genome in the germline. However, the generation of endo-siRNAs from antisense transcripts or pseudogenes may also silence the expression of their functional counterparts via RISC effector complexes, acting as an important node that integrates various modes of long and short RNA regulation, as exemplified by the Xist and Tsix (see above).[107]

In epigenetic regulation

edit

Epigenetic modifications, including histone and DNA methylation, histone acetylation and sumoylation, affect many aspects of chromosomal biology, primarily including regulation of large numbers of genes by remodeling broad chromatin domains.[108][109] While it has been known for some time that RNA is an integral component of chromatin,[110][111] it is only recently that we are beginning to appreciate the means by which RNA is involved in pathways of chromatin modification.[112][113][114] For example, Oplr16 epigenetically induces the activation of stem cell core factors by coordinating intrachromosomal looping and recruitment of DNA demethylase TET2.[115]

In Drosophila, long ncRNAs induce the expression of the homeotic gene, Ubx, by recruiting and directing the chromatin modifying functions of the trithorax protein Ash1 to Hox regulatory elements.[114] Similar models have been proposed in mammals, where strong epigenetic mechanisms are thought to underlie the embryonic expression profiles of the Hox genes that persist throughout human development.[116][113] Indeed, the human Hox genes are associated with hundreds of ncRNAs that are sequentially expressed along both the spatial and temporal axes of human development and define chromatin domains of differential histone methylation and RNA polymerase accessibility.[113] One ncRNA, termed HOTAIR, that originates from the HOXC locus represses transcription across 40 kb of the HOXD locus by altering chromatin trimethylation state. HOTAIR is thought to achieve this by directing the action of Polycomb chromatin remodeling complexes in trans to govern the cells' epigenetic state and subsequent gene expression. Components of the Polycomb complex, including Suz12, EZH2 and EED, contain RNA binding domains that may potentially bind HOTAIR and probably other similar ncRNAs.[117][118][119] This example nicely illustrates a broader theme whereby ncRNAs recruit the function of a generic suite of chromatin modifying proteins to specific genomic loci, underscoring the complexity of recently published genomic maps.[109] Indeed, the prevalence of long ncRNAs associated with protein coding genes may contribute to localised patterns of chromatin modifications that regulate gene expression during development. For example, the majority of protein-coding genes have antisense partners, including many tumour suppressor genes that are frequently silenced by epigenetic mechanisms in cancer.[120] A recent study observed an inverse expression profile of the p15 gene and an antisense ncRNA in leukaemia.[120] A detailed analysis showed the p15 antisense ncRNA (CDKN2BAS) was able to induce changes to heterochromatin and DNA methylation status of p15 by an unknown mechanism, thereby regulating p15 expression.[120] Therefore, misexpression of the associated antisense ncRNAs may subsequently silence the tumour suppressor gene contributing towards cancer.

Imprinting

edit

Many emergent themes of ncRNA-directed chromatin modification were first apparent within the phenomenon of imprinting, whereby only one allele of a gene is expressed from either the maternal or the paternal chromosome. In general, imprinted genes are clustered together on chromosomes, suggesting the imprinting mechanism acts upon local chromosome domains rather than individual genes. These clusters are also often associated with long ncRNAs whose expression is correlated with the repression of the linked protein-coding gene on the same allele.[121] Indeed, detailed analysis has revealed a crucial role for the ncRNAs Kcnqot1 and Igf2r/Air in directing imprinting.[122]

Almost all the genes at the Kcnq1 loci are maternally inherited, except the paternally expressed antisense ncRNA Kcnqot1.[123] Transgenic mice with truncated Kcnq1ot fail to silence the adjacent genes, suggesting that Kcnqot1 is crucial to the imprinting of genes on the paternal chromosome.[124] It appears that Kcnqot1 is able to direct the trimethylation of lysine 9 (H3K9me3) and 27 of histone 3 (H3K27me3) to an imprinting centre that overlaps the Kcnqot1 promoter and actually resides within a Kcnq1 sense exon.[125] Similar to HOTAIR (see above), Eed-Ezh2 Polycomb complexes are recruited to the Kcnq1 loci paternal chromosome, possibly by Kcnqot1, where they may mediate gene silencing through repressive histone methylation.[125] A differentially methylated imprinting centre also overlaps the promoter of a long antisense ncRNA Air that is responsible for the silencing of neighbouring genes at the Igf2r locus on the paternal chromosome.[126][127] The presence of allele-specific histone methylation at the Igf2r locus suggests Air also mediates silencing via chromatin modification.[128]

Xist and X-chromosome inactivation

edit

The inactivation of a X-chromosome in female placental mammals is directed by one of the earliest and best characterized long ncRNAs, Xist.[129] The expression of Xist from the future inactive X-chromosome, and its subsequent coating of the inactive X-chromosome, occurs during early embryonic stem cell differentiation. Xist expression is followed by irreversible layers of chromatin modifications that include the loss of the histone (H3K9) acetylation and H3K4 methylation that are associated with active chromatin, and the induction of repressive chromatin modifications including H4 hypoacetylation, H3K27 trimethylation,[129] H3K9 hypermethylation and H4K20 monomethylation as well as H2AK119 monoubiquitylation. These modifications coincide with the transcriptional silencing of the X-linked genes.[130] Xist RNA also localises the histone variant macroH2A to the inactive X–chromosome.[131] There are additional ncRNAs that are also present at the Xist loci, including an antisense transcript Tsix, which is expressed from the future active chromosome and able to repress Xist expression by the generation of endogenous siRNA.[107] Together these ncRNAs ensure that only one X-chromosome is active in female mammals.

Telomeric non-coding RNAs

edit

Telomeres form the terminal region of mammalian chromosomes and are essential for stability and aging and play central roles in diseases such as cancer.[132] Telomeres have been long considered transcriptionally inert DNA-protein complexes until it was shown in the late 2000s that telomeric repeats may be transcribed as telomeric RNAs (TelRNAs)[133] or telomeric repeat-containing RNAs.[134] These ncRNAs are heterogeneous in length, transcribed from several sub-telomeric loci and physically localise to telomeres. Their association with chromatin, which suggests an involvement in regulating telomere specific heterochromatin modifications, is repressed by SMG proteins that protect chromosome ends from telomere loss.[134] In addition, TelRNAs block telomerase activity in vitro and may therefore regulate telomerase activity.[133] Although early, these studies suggest an involvement for telomeric ncRNAs in various aspects of telomere biology.

In regulation of DNA replication timing and chromosome stability

edit

Asynchronously replicating autosomal RNAs (ASARs) are very long (~200kb) non-coding RNAs that are non-spliced, non-polyadenylated, and are required for normal DNA replication timing and chromosome stability.[135][136][137] Deletion of any one of the genetic loci containing ASAR6, ASAR15, or ASAR6-141 results in the same phenotype of delayed replication timing and delayed mitotic condensation (DRT/DMC) of the entire chromosome. DRT/DMC results in chromosomal segregation errors that lead to increased frequency of secondary rearrangements and an unstable chromosome. Similar to Xist, ASARs show random monoallelic expression and exist in asynchronous DNA replication domains. Although the mechanism of ASAR function is still under investigation, it is hypothesized that they work via similar mechanisms as the Xist lncRNA, but on smaller autosomal domains resulting in allele specific changes in gene expression.

Incorrect reparation of DNA double-strand breaks (DSB) leading to chromosomal rearrangements is one of the oncogenesis's primary causes. A number of lncRNAs are crucial at the different stages of the main pathways of DSB repair in eukaryotic cells: nonhomologous end joining (NHEJ) and homology-directed repair (HDR). Gene mutations or variation in expression levels of such RNAs can lead to local DNA repair defects, increasing the chromosome aberration frequency. Moreover, it was demonstrated that some RNAs could stimulate long-range chromosomal rearrangements.[138]

In aging and disease

edit

The discovery that long ncRNAs function in various aspects of cell biology has led to research on their role in disease. Tens of thousands of lncRNAs are potentially associated with diseases based on the multi-omics evidence.[139] A handful of studies have implicated long ncRNAs in a variety of disease states and support an involvement and co-operation in neurological disease and cancer.

The first published report of an alteration in lncRNA abundance in aging and human neurological disease was provided by Lukiw et al.[140] in a study using short post-mortem interval tissues from patients with Alzheimer's disease and non-Alzheimer's dementia (NAD) ; this early work was based on the prior identification of a primate brain-specific cytoplasmic transcript of the Alu repeat family by Watson and Sutcliffe in 1987 known as BC200 (brain, cytoplasmic, 200 nucleotide).[141]

While many association studies have identified unusual expression of long ncRNAs in disease states, there is little understanding of their role in causing disease. Expression analyses that compare tumor cells and normal cells have revealed changes in the expression of ncRNAs in several forms of cancer. For example, in prostate tumours, PCGEM1 (one of two overexpressed ncRNAs) is correlated with increased proliferation and colony formation suggesting an involvement in regulating cell growth.[142] PRNCR1 was found to promote tumor growth in several malignancies like prostate cancer, breast cancer, non-small cell lung cancer, oral squamous cell carcinoma and colorectal cancer.[143] MALAT1 (also known as NEAT2) was originally identified as an abundantly expressed ncRNA that is upregulated during metastasis of early-stage non-small cell lung cancer and its overexpression is an early prognostic marker for poor patient survival rates.[142] LncRNAs such as HEAT2 or KCNQ1OT1 have been shown to be regulated in the blood of patients with cardiovascular diseases such as heart failure or coronary artery disease and, moreover, to predict cardiovascular disease events.[144][145] More recently, the highly conserved mouse homologue of MALAT1 was found to be highly expressed in hepatocellular carcinoma.[146] Intronic antisense ncRNAs with expression correlated to the degree of tumor differentiation in prostate cancer samples have also been reported.[147] Despite a number of long ncRNAs having aberrant expression in cancer, their function and potential role in tumourigenesis is relatively unknown. For example, the ncRNAs HIS-1 and BIC have been implicated in cancer development and growth control, but their function in normal cells is unknown.[148][149] In addition to cancer, ncRNAs also exhibit aberrant expression in other disease states. Overexpression of PRINS is associated with psoriasis susceptibility, with PRINS expression being elevated in the uninvolved epidermis of psoriatic patients compared with both psoriatic lesions and healthy epidermis.[150]

Genome-wide profiling revealed that many transcribed non-coding ultraconserved regions exhibit distinct profiles in various human cancer states.[68] An analysis of chronic lymphocytic leukaemia, colorectal carcinoma and hepatocellular carcinoma found that all three cancers exhibited aberrant expression profiles for ultraconserved ncRNAs relative to normal cells. Further analysis of one ultraconserved ncRNA suggested it behaved like an oncogene by mitigating apoptosis and subsequently expanding the number of malignant cells in colorectal cancers.[68] Many of these transcribed ultraconserved sites that exhibit distinct signatures in cancer are found at fragile sites and genomic regions associated with cancer. It seems likely that the aberrant expression of these ultraconserved ncRNAs within malignant processes results from important functions they fulfil in normal human development.

Recently, a number of association studies examining single nucleotide polymorphisms (SNPs) associated with disease states have been mapped to long ncRNAs. For example, SNPs that identified a susceptibility locus for myocardial infarction mapped to a long ncRNA, MIAT (myocardial infarction associated transcript).[151] Likewise, genome-wide association studies identified a region associated with coronary artery disease[152] that encompassed a long ncRNA, ANRIL.[153] ANRIL is expressed in tissues and cell types affected by atherosclerosis[154][155] and its altered expression is associated with a high-risk haplotype for coronary artery disease.[155][156] Lately there has been increasing evidence on the role of non-coding RNAs in the development and in the categorization of heart failure.[157]

The complexity of the transcriptome, and our evolving understanding of its structure may inform a reinterpretation of the functional basis for many natural polymorphisms associated with disease states. Many SNPs associated with certain disease conditions are found within non-coding regions and the complex networks of non-coding transcription within these regions make it particularly difficult to elucidate the functional effects of polymorphisms. For example, a SNP both within the truncated form of ZFAT and the promoter of an antisense transcript increases the expression of ZFAT not through increasing the mRNA stability, but rather by repressing the expression of the antisense transcript.[158]

The ability of long ncRNAs to regulate associated protein-coding genes may contribute to disease if misexpression of a long ncRNA deregulates a protein coding gene with clinical significance. In similar manner, an antisense long ncRNA that regulates the expression of the sense BACE1 gene, a crucial enzyme in Alzheimer's disease etiology, exhibits elevated expression in several regions of the brain in individuals with Alzheimer's disease[159] Alteration of the expression of ncRNAs may also mediate changes at an epigenetic level to affect gene expression and contribute to disease aetiology. For example, the induction of an antisense transcript by a genetic mutation led to DNA methylation and silencing of sense genes, causing β-thalassemia in a patient.[160]

Alongside their role in mediating pathological processes, long noncoding RNAs play a role in the immune response to vaccination, as identified for both the influenza vaccine and the yellow fever vaccine.[161]

Structure

edit

It took over two decades after the discovery of the first human long non-coding transcripts for the functional significance of lncRNA structures to be fully recognized. Early structural studies led to the proposal of several hypotheses for classifying lncRNA architectures. One hypothesis suggests that lncRNAs may feature a compact tertiary structure, similar to ribozymes like the ribosome or self-splicing introns. Another possibility is that lncRNAs could have structured protein-binding sites arranged in a decentralized scaffold, lacking a compact core. A third hypothesis posits that lncRNAs might exhibit a largely unstructured architecture, with loosely organized protein-binding domains interspersed with long regions of disordered single-stranded RNA.[162]

Studying the tertiary structure of lncRNAs by conventional methods such as X- ray crystallography, cryo-EM and nuclear magnetic resonance (NMR) is unfortunately still hampered by their size and conformational dynamics, and by the fact that for now we still know too little about their mechanism to reconstruct stable and functionally-active lncRNA-ribonucleoprotein complexes. But some pioneering studies, showed that lncRNAs can already be studied by low-resolution single-particle and in-solution methods, such as atomic force microscopy (AFM) and small-angle X-ray scattering (SAXS), in some cases even in complexes with small molecule modulators.[163]

For instance, lncRNA MEG3 was shown to regulate transcription factor p53 thanks to its compact structured core.[164] Moreover, lncRNA Braveheart (Bvht) was shown to have a well-defined, albeit flexible 3D structure that is remodeled upon binding CNBP (Cellular Nucleic-acid Binding Protein) which recognizes distal domains in the RNA.[165] Finally, Xist a master regulator of X chromosome inactivation was shown to specifically bind a small molecule compound, which alters the conformation of Xist RepA motif and displaces two known interacting protein factors (PRC2 and SPEN) from the RNA. By such mechanism of action, the compound abrogates the initiation of X-chromosome inactivation.[166]

See also

edit

References

edit
  1. ^ a b Fernandes JC, Acuña SM, Aoki JI, Floeter-Winter LM, Muxel SM (February 2019). "Long Non-Coding RNAs in the Regulation of Gene Expression: Physiology and Disease". Non-Coding RNA. 5 (1): 17. doi:10.3390/ncrna5010017. PMC 6468922. PMID 30781588.
  2. ^ Perkel JM (June 2013). "Visiting "noncodarnia"". BioTechniques (paper). 54 (6): 301, 303–4. doi:10.2144/000114037. PMID 23750541. "We're calling long noncoding RNAs a class, when actually the only definition is that they are longer than 200 bp," says Ana Marques, a Research Fellow at the University of Oxford who uses evolutionary approaches to understand lncRNA function.
  3. ^ a b Ma L, Bajic VB, Zhang Z (June 2013). "On the classification of long non-coding RNAs". RNA Biology. 10 (6): 925–933. doi:10.4161/rna.24604. PMC 4111732. PMID 23696037.
  4. ^ Ma, Lina; Zhang, Zhang (September 2023). "The contribution of databases towards understanding the universe of long non-coding RNAs". Nature Reviews. Molecular Cell Biology. 24 (9): 601–602. doi:10.1038/s41580-023-00612-z. ISSN 1471-0080. PMID 37147495. S2CID 258528357.
  5. ^ Mattick, John S.; Amaral, Paulo P.; Carninci, Piero; Carpenter, Susan; Chang, Howard Y.; Chen, Ling-Ling; Chen, Runsheng; Dean, Caroline; Dinger, Marcel E.; Fitzgerald, Katherine A.; Gingeras, Thomas R.; Guttman, Mitchell; Hirose, Tetsuro; Huarte, Maite; Johnson, Rory (June 2023). "Long non-coding RNAs: definitions, functions, challenges and recommendations". Nature Reviews Molecular Cell Biology. 24 (6): 430–447. doi:10.1038/s41580-022-00566-8. ISSN 1471-0080. PMC 10213152. PMID 36596869.
  6. ^ Ransohoff JD, Wei Y, Khavari PA (March 2018). "The functions and unique features of long intergenic non-coding RNA". Nature Reviews. Molecular Cell Biology. 19 (3): 143–157. doi:10.1038/nrm.2017.104. PMC 5889127. PMID 29138516.
  7. ^ a b c Palazzo AF, Lee ES (2015-01-26). "Non-coding RNA: what is functional and what is junk?". Frontiers in Genetics. 6: 2. doi:10.3389/fgene.2015.00002. PMC 4306305. PMID 25674102.
  8. ^ a b Ponting CP, Haerty W (2022). "Genome-Wide Analysis of Human Long Noncoding RNAs: A Provocative Review". Annual Review of Genomics and Human Genetics. 23: 153–172. doi:10.1146/annurev-genom-112921-123710. hdl:20.500.11820/ede40d70-b99c-42b0-a378-3b9b7b256a1b. PMID 35395170.
  9. ^ a b c d Carninci P, Kasukawa T, Katayama S, Gough J, Frith MC, Maeda N, et al. (September 2005). "The transcriptional landscape of the mammalian genome". Science. 309 (5740): 1559–1563. Bibcode:2005Sci...309.1559F. doi:10.1126/science.1112014. PMID 16141072. S2CID 8712839.
  10. ^ Cheng J, Kapranov P, Drenkow J, Dike S, Brubaker S, Patel S, Long J, Stern D, Tammana H, Helt G, Sementchenko V, Piccolboni A, Bekiranov S, Bailey DK, Ganesh M, Ghosh S, Bell I, Gerhard DS, Gingeras TR (May 2005). "Transcriptional maps of 10 human chromosomes at 5-nucleotide resolution". Science. 308 (5725): 1149–1154. Bibcode:2005Sci...308.1149C. doi:10.1126/science.1108625. PMID 15790807. S2CID 13047538.
  11. ^ a b Necsulea A, Soumillon M, Warnefors M, Liechti A, Daish T, Zeller U, Baker JC, Grützner F, Kaessmann H (January 2014). "The evolution of lncRNA repertoires and expression patterns in tetrapods". Nature. 505 (7485): 635–640. Bibcode:2014Natur.505..635N. doi:10.1038/nature12943. PMID 24463510. S2CID 1179101.
  12. ^ a b c Derrien T, Johnson R, Bussotti G, Tanzer A, Djebali S, Tilgner H, Guernec G, Martin D, Merkel A, Knowles DG, Lagarde J, Veeravalli L, Ruan X, Ruan Y, Lassmann T, Carninci P, Brown JB, Lipovich L, Gonzalez JM, Thomas M, Davis CA, Shiekhattar R, Gingeras TR, Hubbard TJ, Notredame C, Harrow J, Guigó R (September 2012). "The GENCODE v7 catalog of human long noncoding RNAs: analysis of their gene structure, evolution, and expression". Genome Research. 22 (9): 1775–1789. doi:10.1101/gr.132159.111. PMC 3431493. PMID 22955988.
  13. ^ Hon CC, Ramilowski JA, Harshbarger J, Bertin N, Rackham OJ, Gough J, Denisenko E, Schmeier S, Poulsen TM, Severin J, Lizio M, Kawaji H, Kasukawa T, Itoh M, Burroughs AM, Noma S, Djebali S, Alam T, Medvedeva YA, Testa AC, Lipovich L, Yip CW, Abugessaisa I, Mendez M, Hasegawa A, Tang D, Lassmann T, Heutink P, Babina M, Wells CA, Kojima S, Nakamura Y, Suzuki H, Daub CO, de Hoon MJ, Arner E, Hayashizaki Y, Carninci P, Forrest AR (March 2017). "An atlas of human long non-coding RNAs with accurate 5′ ends". Nature. 543 (7644): 199–204. Bibcode:2017Natur.543..199H. doi:10.1038/nature21374. PMC 6857182. PMID 28241135.
  14. ^ a b Cabili MN, Trapnell C, Goff L, Koziol M, Tazon-Vega B, Regev A, Rinn JL (September 2011). "Integrative annotation of human large intergenic noncoding RNAs reveals global properties and specific subclasses". Genes & Development. 25 (18): 1915–1927. doi:10.1101/gad.17446611. PMC 3185964. PMID 21890647.
  15. ^ Ravasi T, Suzuki H, Pang KC, Katayama S, Furuno M, Okunishi R, Fukuda S, Ru K, Frith MC, Gongora MM, Grimmond SM, Hume DA, Hayashizaki Y, Mattick JS (January 2006). "Experimental validation of the regulated expression of large numbers of non-coding RNAs from the mouse genome". Genome Research. 16 (1): 11–19. doi:10.1101/gr.4200206. PMC 1356124. PMID 16344565.
  16. ^ Yunusov D, Anderson L, DaSilva LF, Wysocka J, Ezashi T, Roberts RM, Verjovski-Almeida S (September 2016). "HIPSTR and thousands of lncRNAs are heterogeneously expressed in human embryos, primordial germ cells and stable cell lines". Scientific Reports. 6: 32753. Bibcode:2016NatSR...632753Y. doi:10.1038/srep32753. PMC 5015059. PMID 27605307.
  17. ^ Li, Zhao; Liu, Lin; Jiang, Shuai; Li, Qianpeng; Feng, Changrui; Du, Qiang; Zou, Dong; Xiao, Jingfa; Zhang, Zhang; Ma, Lina (2021-01-08). "LncExpDB: an expression database of human long non-coding RNAs". Nucleic Acids Research. 49 (D1): D962–D968. doi:10.1093/nar/gkaa850. ISSN 1362-4962. PMC 7778919. PMID 33045751.
  18. ^ Yan L, Yang M, Guo H, Yang L, Wu J, Li R, Liu P, Lian Y, Zheng X, Yan J, Huang J, Li M, Wu X, Wen L, Lao K, Li R, Qiao J, Tang F (September 2013). "Single-cell RNA-Seq profiling of human preimplantation embryos and embryonic stem cells". Nature Structural & Molecular Biology. 20 (9): 1131–1139. doi:10.1038/nsmb.2660. PMID 23934149. S2CID 29209966.
  19. ^ Liu SJ, Nowakowski TJ, Pollen AA, Lui JH, Horlbeck MA, Attenello FJ, He D, Weissman JS, Kriegstein AR, Diaz AA, Lim DA (April 2016). "Single-cell analysis of long non-coding RNAs in the developing human neocortex". Genome Biology. 17: 67. doi:10.1186/s13059-016-0932-1. PMC 4831157. PMID 27081004.
  20. ^ Aliperti V, Skonieczna J, Cerase A (June 2021). "Long Non-Coding RNA (lncRNA) Roles in Cell Biology, Neurodevelopment and Neurological Disorders". Non-Coding RNA. 7 (2): 36. doi:10.3390/ncrna7020036. PMC 8293397. PMID 34204536.
  21. ^ Li, Zhao; Liu, Lin; Feng, Changrui; Qin, Yuxin; Xiao, Jingfa; Zhang, Zhang; Ma, Lina (2023-01-06). "LncBook 2.0: integrating human long non-coding RNAs with multi-omics annotations". Nucleic Acids Research. 51 (D1): D186–D191. doi:10.1093/nar/gkac999. ISSN 1362-4962. PMC 9825513. PMID 36330950.
  22. ^ Paytuví Gallart A, Hermoso Pulido A, Anzar Martínez de Lagrán I, Sanseverino W, Aiese Cigliano R (January 2016). "GREENC: a Wiki-based database of plant lncRNAs". Nucleic Acids Research. 44 (D1): D1161–6. doi:10.1093/nar/gkv1215. PMC 4702861. PMID 26578586.
  23. ^ Kapranov P, Cheng J, Dike S, Nix DA, Duttagupta R, Willingham AT, Stadler PF, Hertel J, Hackermüller J, Hofacker IL, Bell I, Cheung E, Drenkow J, Dumais E, Patel S, Helt G, Ganesh M, Ghosh S, Piccolboni A, Sementchenko V, Tammana H, Gingeras TR (June 2007). "RNA maps reveal new RNA classes and a possible function for pervasive transcription". Science. 316 (5830): 1484–1488. Bibcode:2007Sci...316.1484K. doi:10.1126/science.1138341. PMID 17510325. S2CID 25609839.
  24. ^ Kapranov P, Willingham AT, Gingeras TR (June 2007). "Genome-wide transcription and the implications for genomic organization". Nature Reviews Genetics. 8 (6): 413–423. doi:10.1038/nrg2083. PMID 17486121. S2CID 6465064.
  25. ^ Birney E, Stamatoyannopoulos JA, Dutta A, Guigó R, Gingeras TR, Margulies EH, et al. (June 2007). "Identification and analysis of functional elements in 1% of the human genome by the ENCODE pilot project". Nature. 447 (7146): 799–816. Bibcode:2007Natur.447..799B. doi:10.1038/nature05874. PMC 2212820. PMID 17571346.
  26. ^ Li A, Zhou H, Xiong S, Li J, Mallik S, Fei R, Liu Y, Zhou H, Wang X, Hei X, Wang L (October 2024). "PLEKv2: predicting lncRNAs and mRNAs based on intrinsic sequence features and the coding-net model". BMC Genomics. 25 (1): 756. doi:10.1186/s12864-024-10662-y. PMC 11295476. PMID 39095710.
  27. ^ Ritu, Gupta S, Kumar N, Shankar R (September 2022). "DeepPlnc: Bi-modal deep learning for highly accurate plant lncRNA discovery". Genomics. 114 (5): 110443. doi:10.1016/j.ygeno.2022.110443. PMID 35931273.
  28. ^ Camargo AP, Sourkov V, Pereira GA, Carazzolle MF (March 2020). "RNAsamba: neural network-based assessment of the protein-coding potential of RNA sequences". NAR Genomics and Bioinformatics. 2 (1): lqz024. doi:10.1093/nargab/lqz024. PMC 7671399. PMID 33575571.
  29. ^ Wang G, Yin H, Li B, Yu C, Wang F, Xu X, Cao J, Bao Y, Wang L, Abbasi AA, Bajic VB, Ma L, Zhang Z (January 2019). "Characterization and identification of long non-coding RNAs based on feature relationship". Bioinformatics. 41 (Database issue): D246–D251. doi:10.1093/bioinformatics/btz008. hdl:10754/631240. PMID 30649200.
  30. ^ Wang L, Park HJ, Dasari S, Wang S, Kocher JP, Li W (April 2013). "CPAT: Coding-Potential Assessment Tool using an alignment-free logistic regression model". Nucleic Acids Research. 41 (6): e74. doi:10.1093/nar/gkt006. PMC 3616698. PMID 23335781.
  31. ^ Hu L, Xu Z, Hu B, Lu ZJ (January 2017). "COME: a robust coding potential calculation tool for lncRNA identification and characterization based on multiple features". Nucleic Acids Research. 45 (1): e2. doi:10.1093/nar/gkw798. PMC 5224497. PMID 27608726.
  32. ^ Sun L, Luo H, Bu D, Zhao G, Yu K, Zhang C, Liu Y, Chen R, Zhao Y (September 2013). "Utilizing sequence intrinsic composition to classify protein-coding and long non-coding transcripts". Nucleic Acids Research. 41 (17): e166. doi:10.1093/nar/gkt646. PMC 3783192. PMID 23892401.
  33. ^ Li A, Zhang J, Zhou Z (September 2014). "PLEK: a tool for predicting long non-coding RNAs and messenger RNAs based on an improved k-mer scheme". BMC Bioinformatics. 15 (1): 311. doi:10.1186/1471-2105-15-311. PMC 4177586. PMID 25239089.
  34. ^ Wucher V, Legeai F, Hédan B, Rizk G, Lagoutte L, Leeb T, et al. (May 2017). "FEELnc: a tool for long non-coding RNA annotation and its application to the dog transcriptome". Nucleic Acids Research. 45 (8): e57. doi:10.1093/nar/gkw1306. PMC 5416892. PMID 28053114.
  35. ^ Lin MF, Jungreis I, Kellis M (July 2011). "PhyloCSF: a comparative genomics method to distinguish protein coding and non-coding regions". Bioinformatics. 27 (13): i275–i282. doi:10.1093/bioinformatics/btr209. PMC 3117341. PMID 21685081.
  36. ^ Chen J, Shishkin AA, Zhu X, Kadri S, Maza I, Guttman M, Hanna JH, Regev A, Garber M (Feb 2016). "Evolutionary analysis across mammals reveals distinct classes of long non-coding RNAs". Genome Biology. 17 (19): 19. doi:10.1186/s13059-016-0880-9. PMC 4739325. PMID 26838501.
  37. ^ Anderson DM, Anderson KM, Chang CL, Makarewich CA, Nelson BR, McAnally JR, Kasaragod P, Shelton JM, Liou J, Bassel-Duby R, Olson EN (February 2015). "A micropeptide encoded by a putative long noncoding RNA regulates muscle performance". Cell. 160 (4): 595–606. doi:10.1016/j.cell.2015.01.009. PMC 4356254. PMID 25640239.
  38. ^ Matsumoto A, Pasut A, Matsumoto M, Yamashita R, Fung J, Monteleone E, Saghatelian A, Nakayama KI, Clohessy JG, Pandolfi PP (January 2017). "mTORC1 and muscle regeneration are regulated by the LINC00961-encoded SPAR polypeptide". Nature. 541 (7636): 228–232. Bibcode:2017Natur.541..228M. doi:10.1038/nature21034. PMID 28024296. S2CID 205253245.
  39. ^ Pauli A, Norris ML, Valen E, Chew GL, Gagnon JA, Zimmerman S, Mitchell A, Ma J, Dubrulle J, Reyon D, Tsai SQ, Joung JK, Saghatelian A, Schier AF (February 2014). "Toddler: an embryonic signal that promotes cell movement via Apelin receptors". Science. 343 (6172): 1248636. doi:10.1126/science.1248636. PMC 4107353. PMID 24407481.
  40. ^ Ingolia NT, Lareau LF, Weissman JS (November 2011). "Ribosome profiling of mouse embryonic stem cells reveals the complexity and dynamics of mammalian proteomes". Cell. 147 (4): 789–802. doi:10.1016/j.cell.2011.10.002. PMC 3225288. PMID 22056041.
  41. ^ a b Ji Z, Song R, Regev A, Struhl K (December 2015). "Many lncRNAs, 5'UTRs, and pseudogenes are translated and some are likely to express functional proteins". eLife. 4: e08890. doi:10.7554/eLife.08890. PMC 4739776. PMID 26687005.
  42. ^ Guttman M, Russell P, Ingolia NT, Weissman JS, Lander ES (July 2013). "Ribosome profiling provides evidence that large noncoding RNAs do not encode proteins". Cell. 154 (1): 240–251. doi:10.1016/j.cell.2013.06.009. PMC 3756563. PMID 23810193.
  43. ^ Guttman M, Amit I, Garber M, French C, Lin MF, Feldser D, Huarte M, Zuk O, Carey BW, Cassady JP, Cabili MN, Jaenisch R, Mikkelsen TS, Jacks T, Hacohen N, Bernstein BE, Kellis M, Regev A, Rinn JL, Lander ES (March 2009). "Chromatin signature reveals over a thousand highly conserved large non-coding RNAs in mammals". Nature. 458 (7235): 223–227. Bibcode:2009Natur.458..223G. doi:10.1038/nature07672. PMC 2754849. PMID 19182780.
  44. ^ Ponjavic J, Ponting CP, Lunter G (May 2007). "Functionality or transcriptional noise? Evidence for selection within long noncoding RNAs". Genome Research. 17 (5): 556–565. doi:10.1101/gr.6036807. PMC 1855172. PMID 17387145.
  45. ^ Haerty W, Ponting CP (May 2013). "Mutations within lncRNAs are effectively selected against in fruitfly but not in human". Genome Biology. 14 (5): R49. doi:10.1186/gb-2013-14-5-r49. PMC 4053968. PMID 23710818.
  46. ^ Washietl S, Kellis M, Garber M (April 2014). "Evolutionary dynamics and tissue specificity of human long noncoding RNAs in six mammals". Genome Research. 24 (4): 616–628. doi:10.1101/gr.165035.113. PMC 3975061. PMID 24429298.
  47. ^ Kutter C, Watt S, Stefflova K, Wilson MD, Goncalves A, Ponting CP, Odom DT, Marques AC (2012). "Rapid turnover of long noncoding RNAs and the evolution of gene expression". PLOS Genetics. 8 (7): e1002841. doi:10.1371/journal.pgen.1002841. PMC 3406015. PMID 22844254.
  48. ^ Brosius J (May 2005). "Waste not, want not—transcript excess in multicellular eukaryotes". Trends in Genetics. 21 (5): 287–288. doi:10.1016/j.tig.2005.02.014. PMID 15851065.
  49. ^ Struhl K (February 2007). "Transcriptional noise and the fidelity of initiation by RNA polymerase II". Nature Structural & Molecular Biology. 14 (2): 103–105. doi:10.1038/nsmb0207-103. PMID 17277804. S2CID 29398526.
  50. ^ Kapusta A, Feschotte C (October 2014). "Volatile evolution of long noncoding RNA repertoires: mechanisms and biological implications". Trends in Genetics. 30 (10): 439–452. doi:10.1016/j.tig.2014.08.004. PMC 4464757. PMID 25218058.
  51. ^ Chen J, Shishkin AA, Zhu X, Kadri S, Maza I, Guttman M, Hanna JH, Regev A, Garber M (February 2016). "Evolutionary analysis across mammals reveals distinct classes of long non-coding RNAs". Genome Biology. 17: 19. doi:10.1186/s13059-016-0880-9. PMC 4739325. PMID 26838501.
  52. ^ Ulitsky I (October 2016). "Evolution to the rescue: using comparative genomics to understand long non-coding RNAs". Nature Reviews Genetics. 17 (10): 601–614. doi:10.1038/nrg.2016.85. PMID 27573374. S2CID 13833164.
  53. ^ Hezroni H, Koppstein D, Schwartz MG, Avrutin A, Bartel DP, Ulitsky I (May 2015). "Principles of long noncoding RNA evolution derived from direct comparison of transcriptomes in 17 species". Cell Reports. 11 (7): 1110–1122. doi:10.1016/j.celrep.2015.04.023. PMC 4576741. PMID 25959816.
  54. ^ Johnsson P, Lipovich L, Grandér D, Morris KV (March 2014). "Evolutionary conservation of long non-coding RNAs; sequence, structure, function". Biochimica et Biophysica Acta (BBA) - General Subjects. 1840 (3): 1063–1071. doi:10.1016/j.bbagen.2013.10.035. PMC 3909678. PMID 24184936.
  55. ^ Rivas E, Clements J, Eddy SR (January 2017). "A statistical test for conserved RNA structure shows lack of evidence for structure in lncRNAs". Nature Methods. 14 (1): 45–48. doi:10.1038/nmeth.4066. PMC 5554622. PMID 27819659.
  56. ^ Mercer TR, Dinger ME, Mattick JS (March 2009). "Long non-coding RNAs: insights into functions". Nature Reviews Genetics. 10 (3): 155–159. doi:10.1038/nrg2521. PMID 19188922. S2CID 18441501.
  57. ^ Dinger ME, Amaral PP, Mercer TR, Mattick JS (November 2009). "Pervasive transcription of the eukaryotic genome: functional indices and conceptual implications". Briefings in Functional Genomics & Proteomics. 8 (6): 407–423. doi:10.1093/bfgp/elp038. PMID 19770204.
  58. ^ Amaral PP, Clark MB, Gascoigne DK, Dinger ME, Mattick JS (January 2011). "lncRNAdb: a reference database for long noncoding RNAs". Nucleic Acids Research. 39 (Database issue): D146–51. doi:10.1093/nar/gkq1138. PMC 3013714. PMID 21112873.
  59. ^ Quek XC, Thomson DW, Maag JL, Bartonicek N, Signal B, Clark MB, Gloss BS, Dinger ME (January 2015). "lncRNAdb v2.0: expanding the reference database for functional long noncoding RNAs". Nucleic Acids Research. 43 (Database issue): D168–73. doi:10.1093/nar/gku988. PMC 4384040. PMID 25332394.
  60. ^ a b Liu, Lin; Li, Zhao; Liu, Chang; Zou, Dong; Li, Qianpeng; Feng, Changrui; Jing, Wei; Luo, Sicheng; Zhang, Zhang; Ma, Lina (2022-01-07). "LncRNAWiki 2.0: a knowledgebase of human long non-coding RNAs with enhanced curation model and database system". Nucleic Acids Research. 50 (D1): D190–D195. doi:10.1093/nar/gkab998. ISSN 1362-4962. PMC 8728265. PMID 34751395.
  61. ^ Smith JE, Alvarez-Dominguez JR, Kline N, Huynh NJ, Geisler S, Hu W, Coller J, Baker KE (June 2014). "Translation of small open reading frames within unannotated RNA transcripts in Saccharomyces cerevisiae". Cell Reports. 7 (6): 1858–1866. doi:10.1016/j.celrep.2014.05.023. PMC 4105149. PMID 24931603.
  62. ^ a b Goodrich JA, Kugel JF (August 2006). "Non-coding-RNA regulators of RNA polymerase II transcription". Nature Reviews Molecular Cell Biology. 7 (8): 612–616. doi:10.1038/nrm1946. PMID 16723972. S2CID 22274894.
  63. ^ a b Feng J, Bi C, Clark BS, Mady R, Shah P, Kohtz JD (June 2006). "The Evf-2 noncoding RNA is transcribed from the Dlx-5/6 ultraconserved region and functions as a Dlx-2 transcriptional coactivator". Genes & Development. 20 (11): 1470–1484. doi:10.1101/gad.1416106. PMC 1475760. PMID 16705037.
  64. ^ Panganiban G, Rubenstein JL (October 2002). "Developmental functions of the Distal-less/Dlx homeobox genes". Development. 129 (19): 4371–4386. doi:10.1242/dev.129.19.4371. PMID 12223397.
  65. ^ Pennacchio LA, Ahituv N, Moses AM, Prabhakar S, Nobrega MA, Shoukry M, Minovitsky S, Dubchak I, Holt A, Lewis KD, Plajzer-Frick I, Akiyama J, De Val S, Afzal V, Black BL, Couronne O, Eisen MB, Visel A, Rubin EM (November 2006). "In vivo enhancer analysis of human conserved non-coding sequences". Nature. 444 (7118): 499–502. Bibcode:2006Natur.444..499P. doi:10.1038/nature05295. PMID 17086198. S2CID 4307332.
  66. ^ Visel A, Prabhakar S, Akiyama JA, Shoukry M, Lewis KD, Holt A, Plajzer-Frick I, Afzal V, Rubin EM, Pennacchio LA (February 2008). "Ultraconservation identifies a small subset of extremely constrained developmental enhancers". Nature Genetics. 40 (2): 158–160. doi:10.1038/ng.2007.55. PMC 2647775. PMID 18176564.
  67. ^ Pibouin L, Villaudy J, Ferbus D, Muleris M, Prospéri MT, Remvikos Y, Goubin G (February 2002). "Cloning of the mRNA of overexpression in colon carcinoma-1: a sequence overexpressed in a subset of colon carcinomas". Cancer Genetics and Cytogenetics. 133 (1): 55–60. doi:10.1016/S0165-4608(01)00634-3. PMID 11890990.
  68. ^ a b c Calin GA, Liu CG, Ferracin M, Hyslop T, Spizzo R, Sevignani C, Fabbri M, Cimmino A, Lee EJ, Wojcik SE, Shimizu M, Tili E, Rossi S, Taccioli C, Pichiorri F, Liu X, Zupo S, Herlea V, Gramantieri L, Lanza G, Alder H, Rassenti L, Volinia S, Schmittgen TD, Kipps TJ, Negrini M, Croce CM (September 2007). "Ultraconserved regions encoding ncRNAs are altered in human leukemias and carcinomas". Cancer Cell. 12 (3): 215–229. doi:10.1016/j.ccr.2007.07.027. PMID 17785203.
  69. ^ Zhang, Troy; Yu, Hui; Bai, Yongsheng; Guo, Yan (2023). "Mutation density analyses on long noncoding RNA reveal comparable patterns to protein-coding RNA and prognostic value". Computational and Structural Biotechnology Journal. 21: 4887–4894. doi:10.1016/j.csbj.2023.09.027. ISSN 2001-0370. PMC 10582829. PMID 37860228.
  70. ^ Wang X, Arai S, Song X, Reichart D, Du K, Pascual G, Tempst P, Rosenfeld MG, Glass CK, Kurokawa R (July 2008). "Induced ncRNAs allosterically modify RNA-binding proteins in cis to inhibit transcription". Nature. 454 (7200): 126–130. Bibcode:2008Natur.454..126W. doi:10.1038/nature06992. PMC 2823488. PMID 18509338.
  71. ^ Adelman K, Egan E (March 2017). "Non-coding RNA: More uses for genomic junk". Nature. 543 (7644): 183–185. Bibcode:2017Natur.543..183A. doi:10.1038/543183a. PMID 28277509.
  72. ^ Halley P, Kadakkuzha BM, Faghihi MA, Magistri M, Zeier Z, Khorkova O, Coito C, Hsiao J, Lawrence M, Wahlestedt C (January 2014). "Regulation of the apolipoprotein gene cluster by a long noncoding RNA". Cell Reports. 6 (1): 222–230. doi:10.1016/j.celrep.2013.12.015. PMC 3924898. PMID 24388749.
  73. ^ Reinius B, Shi C, Hengshuo L, Sandhu KS, Radomska KJ, Rosen GD, Lu L, Kullander K, Williams RW, Jazin E (November 2010). "Female-biased expression of long non-coding RNAs in domains that escape X-inactivation in mouse". BMC Genomics. 11: 614. doi:10.1186/1471-2164-11-614. PMC 3091755. PMID 21047393.
  74. ^ Martianov I, Ramadass A, Serra Barros A, Chow N, Akoulitchev A (February 2007). "Repression of the human dihydrofolate reductase gene by a non-coding interfering transcript". Nature. 445 (7128): 666–670. doi:10.1038/nature05519. PMID 17237763. S2CID 3012142.
  75. ^ Lee JS, Burkholder GD, Latimer LJ, Haug BL, Braun RP (February 1987). "A monoclonal antibody to triplex DNA binds to eucaryotic chromosomes". Nucleic Acids Research. 15 (3): 1047–1061. doi:10.1093/nar/15.3.1047. PMC 340507. PMID 2434928.
  76. ^ a b Kwek KY, Murphy S, Furger A, Thomas B, O'Gorman W, Kimura H, Proudfoot NJ, Akoulitchev A (November 2002). "U1 snRNA associates with TFIIH and regulates transcriptional initiation". Nature Structural Biology. 9 (11): 800–805. doi:10.1038/nsb862. PMID 12389039. S2CID 22982547.
  77. ^ Yang S, Tutton S, Pierce E, Yoon K (November 2001). "Specific double-stranded RNA interference in undifferentiated mouse embryonic stem cells". Molecular and Cellular Biology. 21 (22): 7807–7816. doi:10.1128/MCB.21.22.7807-7816.2001. PMC 99950. PMID 11604515.
  78. ^ Yik JH, Chen R, Nishimura R, Jennings JL, Link AJ, Zhou Q (October 2003). "Inhibition of P-TEFb (CDK9/Cyclin T) kinase and RNA polymerase II transcription by the coordinated actions of HEXIM1 and 7SK snRNA". Molecular Cell. 12 (4): 971–982. doi:10.1016/S1097-2765(03)00388-5. PMID 14580347.
  79. ^ Lander ES, Linton LM, Birren B, Nusbaum C, Zody MC, Baldwin J, et al. (February 2001). "Initial sequencing and analysis of the human genome". Nature. 409 (6822): 860–921. Bibcode:2001Natur.409..860L. doi:10.1038/35057062. hdl:2027.42/62798. PMID 11237011.
  80. ^ Waterston RH, Lindblad-Toh K, Birney E, Rogers J, Abril JF, Agarwal P, et al. (December 2002). "Initial sequencing and comparative analysis of the mouse genome". Nature. 420 (6915): 520–562. Bibcode:2002Natur.420..520W. doi:10.1038/nature01262. PMID 12466850.
  81. ^ Liu WM, Chu WM, Choudary PV, Schmid CW (May 1995). "Cell stress and translational inhibitors transiently increase the abundance of mammalian SINE transcripts". Nucleic Acids Research. 23 (10): 1758–1765. doi:10.1093/nar/23.10.1758. PMC 306933. PMID 7784180.
  82. ^ a b Allen E, Xie Z, Gustafson AM, Sung GH, Spatafora JW, Carrington JC (December 2004). "Evolution of microRNA genes by inverted duplication of target gene sequences in Arabidopsis thaliana". Nature Genetics. 36 (12): 1282–1290. doi:10.1038/ng1478. PMID 15565108. S2CID 11997028.
  83. ^ a b Espinoza CA, Allen TA, Hieb AR, Kugel JF, Goodrich JA (September 2004). "B2 RNA binds directly to RNA polymerase II to repress transcript synthesis". Nature Structural & Molecular Biology. 11 (9): 822–829. doi:10.1038/nsmb812. PMID 15300239. S2CID 22199826.
  84. ^ Espinoza CA, Goodrich JA, Kugel JF (April 2007). "Characterization of the structure, function, and mechanism of B2 RNA, an ncRNA repressor of RNA polymerase II transcription". RNA. 13 (4): 583–596. doi:10.1261/rna.310307. PMC 1831867. PMID 17307818.
  85. ^ a b c d e f Mariner PD, Walters RD, Espinoza CA, Drullinger LF, Wagner SD, Kugel JF, Goodrich JA (February 2008). "Human Alu RNA is a modular transacting repressor of mRNA transcription during heat shock". Molecular Cell. 29 (4): 499–509. doi:10.1016/j.molcel.2007.12.013. PMID 18313387.
  86. ^ Shamovsky I, Nudler E (February 2008). "Modular RNA heats up". Molecular Cell. 29 (4): 415–417. doi:10.1016/j.molcel.2008.02.001. PMID 18313380.
  87. ^ Mattick JS (October 2003). "Challenging the dogma: the hidden layer of non-protein-coding RNAs in complex organisms". BioEssays. 25 (10): 930–939. CiteSeerX 10.1.1.476.7561. doi:10.1002/bies.10332. PMID 14505360.
  88. ^ Mohammad F, Pandey RR, Nagano T, Chakalova L, Mondal T, Fraser P, Kanduri C (June 2008). "Kcnq1ot1/Lit1 noncoding RNA mediates transcriptional silencing by targeting to the perinucleolar region". Molecular and Cellular Biology. 28 (11): 3713–3728. doi:10.1128/MCB.02263-07. PMC 2423283. PMID 18299392.
  89. ^ Wutz A, Rasmussen TP, Jaenisch R (February 2002). "Chromosomal silencing and localization are mediated by different domains of Xist RNA". Nature Genetics. 30 (2): 167–174. doi:10.1038/ng820. PMID 11780141. S2CID 28643222.
  90. ^ Zearfoss NR, Chan AP, Kloc M, Allen LH, Etkin LD (April 2003). "Identification of new Xlsirt family members in the Xenopus laevis oocyte". Mechanisms of Development. 120 (4): 503–509. doi:10.1016/S0925-4773(02)00459-8. PMID 12676327. S2CID 16781978.
  91. ^ Singh K, Carey M, Saragosti S, Botchan M (1985). "Expression of enhanced levels of small RNA polymerase III transcripts encoded by the B2 repeats in simian virus 40-transformed mouse cells". Nature. 314 (6011): 553–556. Bibcode:1985Natur.314..553S. doi:10.1038/314553a0. PMID 2581137. S2CID 4359937.
  92. ^ Tang RB, Wang HY, Lu HY, Xiong J, Li HH, Qiu XH, Liu HQ (February 2005). "Increased level of polymerase III transcribed Alu RNA in hepatocellular carcinoma tissue". Molecular Carcinogenesis. 42 (2): 93–96. doi:10.1002/mc.20057. PMID 15593371. S2CID 10513502.
  93. ^ a b Shamovsky I, Nudler E (October 2006). "Gene control by large noncoding RNAs". Science's STKE. 2006 (355): pe40. doi:10.1126/stke.3552006pe40. PMID 17018852. S2CID 41151259.
  94. ^ a b Dieci G, Fiorino G, Castelnuovo M, Teichmann M, Pagano A (December 2007). "The expanding RNA polymerase III transcriptome". Trends in Genetics. 23 (12): 614–622. doi:10.1016/j.tig.2007.09.001. hdl:11381/1706964. PMID 17977614.
  95. ^ Pagano JM, Farley BM, McCoig LM, Ryder SP (March 2007). "Molecular basis of RNA recognition by the embryonic polarity determinant MEX-5". The Journal of Biological Chemistry. 282 (12): 8883–8894. doi:10.1074/jbc.M700079200. PMID 17264081.
  96. ^ Yoon JH, Abdelmohsen K, Gorospe M (October 2013). "Posttranscriptional gene regulation by long noncoding RNA". Journal of Molecular Biology. 425 (19): 3723–3730. doi:10.1016/j.jmb.2012.11.024. PMC 3594629. PMID 23178169.
  97. ^ a b Beltran M, Puig I, Peña C, García JM, Alvarez AB, Peña R, Bonilla F, de Herreros AG (March 2008). "A natural antisense transcript regulates Zeb2/Sip1 gene expression during Snail1-induced epithelial-mesenchymal transition". Genes & Development. 22 (6): 756–769. doi:10.1101/gad.455708. PMC 2275429. PMID 18347095.
  98. ^ Munroe SH, Lazar MA (November 1991). "Inhibition of c-erbA mRNA splicing by a naturally occurring antisense RNA". The Journal of Biological Chemistry. 266 (33): 22083–22086. doi:10.1016/S0021-9258(18)54535-X. PMID 1657988.
  99. ^ Tiedge H, Chen W, Brosius J (June 1993). "Primary structure, neural-specific expression, and dendritic location of human BC200 RNA". The Journal of Neuroscience. 13 (6): 2382–2390. doi:10.1523/JNEUROSCI.13-06-02382.1993. PMC 6576500. PMID 7684772.
  100. ^ Tiedge H, Fremeau RT, Weinstock PH, Arancio O, Brosius J (March 1991). "Dendritic location of neural BC1 RNA". Proceedings of the National Academy of Sciences of the United States of America. 88 (6): 2093–2097. Bibcode:1991PNAS...88.2093T. doi:10.1073/pnas.88.6.2093. PMC 51175. PMID 1706516.
  101. ^ Muslimov IA, Banker G, Brosius J, Tiedge H (June 1998). "Activity-dependent regulation of dendritic BC1 RNA in hippocampal neurons in culture". The Journal of Cell Biology. 141 (7): 1601–1611. doi:10.1083/jcb.141.7.1601. PMC 1828539. PMID 9647652.
  102. ^ Wang H, Iacoangeli A, Lin D, Williams K, Denman RB, Hellen CU, Tiedge H (December 2005). "Dendritic BC1 RNA in translational control mechanisms". The Journal of Cell Biology. 171 (5): 811–821. doi:10.1083/jcb.200506006. PMC 1828541. PMID 16330711.
  103. ^ Centonze D, Rossi S, Napoli I, Mercaldo V, Lacoux C, Ferrari F, Ciotti MT, De Chiara V, Prosperetti C, Maccarrone M, Fezza F, Calabresi P, Bernardi G, Bagni C (August 2007). "The brain cytoplasmic RNA BC1 regulates dopamine D2 receptor-mediated transmission in the striatum". The Journal of Neuroscience. 27 (33): 8885–8892. doi:10.1523/JNEUROSCI.0548-07.2007. PMC 6672174. PMID 17699670.
  104. ^ Lewejohann L, Skryabin BV, Sachser N, Prehn C, Heiduschka P, Thanos S, Jordan U, Dell'Omo G, Vyssotski AL, Pleskacheva MG, Lipp HP, Tiedge H, Brosius J, Prior H (September 2004). "Role of a neuronal small non-messenger RNA: behavioural alterations in BC1 RNA-deleted mice". Behavioural Brain Research. 154 (1): 273–289. CiteSeerX 10.1.1.572.8071. doi:10.1016/j.bbr.2004.02.015. PMID 15302134. S2CID 18840384.
  105. ^ Golden DE, Gerbasi VR, Sontheimer EJ (August 2008). "An inside job for siRNAs". Molecular Cell. 31 (3): 309–312. doi:10.1016/j.molcel.2008.07.008. PMC 2675693. PMID 18691963.
  106. ^ Czech B, Malone CD, Zhou R, Stark A, Schlingeheyde C, Dus M, Perrimon N, Kellis M, Wohlschlegel JA, Sachidanandam R, Hannon GJ, Brennecke J (June 2008). "An endogenous small interfering RNA pathway in Drosophila". Nature. 453 (7196): 798–802. Bibcode:2008Natur.453..798C. doi:10.1038/nature07007. PMC 2895258. PMID 18463631.
  107. ^ a b Ogawa Y, Sun BK, Lee JT (June 2008). "Intersection of the RNA interference and X-inactivation pathways". Science. 320 (5881): 1336–1341. Bibcode:2008Sci...320.1336O. doi:10.1126/science.1157676. PMC 2584363. PMID 18535243.
  108. ^ Kiefer JC (April 2007). "Epigenetics in development". Developmental Dynamics. 236 (4): 1144–1156. doi:10.1002/dvdy.21094. PMID 17304537. S2CID 23292265.
  109. ^ a b Mikkelsen TS, Ku M, Jaffe DB, Issac B, Lieberman E, Giannoukos G, Alvarez P, Brockman W, Kim TK, Koche RP, Lee W, Mendenhall E, O'Donovan A, Presser A, Russ C, Xie X, Meissner A, Wernig M, Jaenisch R, Nusbaum C, Lander ES, Bernstein BE (August 2007). "Genome-wide maps of chromatin state in pluripotent and lineage-committed cells". Nature. 448 (7153): 553–560. Bibcode:2007Natur.448..553M. doi:10.1038/nature06008. PMC 2921165. PMID 17603471.
  110. ^ Nickerson JA, Krochmalnic G, Wan KM, Penman S (January 1989). "Chromatin architecture and nuclear RNA". Proceedings of the National Academy of Sciences of the United States of America. 86 (1): 177–181. Bibcode:1989PNAS...86..177N. doi:10.1073/pnas.86.1.177. PMC 286427. PMID 2911567.
  111. ^ Rodríguez-Campos A, Azorín F (November 2007). "RNA is an integral component of chromatin that contributes to its structural organization". PLOS ONE. 2 (11): e1182. Bibcode:2007PLoSO...2.1182R. doi:10.1371/journal.pone.0001182. PMC 2063516. PMID 18000552.
  112. ^ Chen X, Xu H, Yuan P, Fang F, Huss M, Vega VB, Wong E, Orlov YL, Zhang W, Jiang J, Loh YH, Yeo HC, Yeo ZX, Narang V, Govindarajan KR, Leong B, Shahab A, Ruan Y, Bourque G, Sung WK, Clarke ND, Wei CL, Ng HH (June 2008). "Integration of external signaling pathways with the core transcriptional network in embryonic stem cells". Cell. 133 (6): 1106–1117. doi:10.1016/j.cell.2008.04.043. PMID 18555785. S2CID 1768190.
  113. ^ a b c Rinn JL, Kertesz M, Wang JK, Squazzo SL, Xu X, Brugmann SA, Goodnough LH, Helms JA, Farnham PJ, Segal E, Chang HY (June 2007). "Functional demarcation of active and silent chromatin domains in human HOX loci by noncoding RNAs". Cell. 129 (7): 1311–1323. doi:10.1016/j.cell.2007.05.022. PMC 2084369. PMID 17604720.
  114. ^ a b Sanchez-Elsner T, Gou D, Kremmer E, Sauer F (February 2006). "Noncoding RNAs of trithorax response elements recruit Drosophila Ash1 to Ultrabithorax". Science. 311 (5764): 1118–1123. Bibcode:2006Sci...311.1118S. doi:10.1126/science.1117705. PMID 16497925. S2CID 16423723. (Retracted, see doi:10.1126/science.344.6187.981-a, PMID 24876484,  Retraction Watch. If this is an intentional citation to a retracted paper, please replace {{retracted|...}} with {{retracted|...|intentional=yes}}.)
  115. ^ Jia L, Wang Y, Wang C, Du Z, Zhang S, Wen X, Zhang S (2020). "Oplr16 serves as a novel chromatin factor to control stem cell fate by modulating pluripotency-specific chromosomal looping and TET2-mediated DNA demethylation". Nucleic Acids Research. 48 (7): 3935–3948. doi:10.1093/nar/gkaa097. PMC 7144914. PMID 32055844.  This article incorporates text from this source, which is available under the CC BY 4.0 license.
  116. ^ Mazo A, Hodgson JW, Petruk S, Sedkov Y, Brock HW (August 2007). "Transcriptional interference: an unexpected layer of complexity in gene regulation". Journal of Cell Science. 120 (Pt 16): 2755–2761. doi:10.1242/jcs.007633. PMID 17690303. S2CID 16059065.
  117. ^ Cerase A, Tartaglia GG (September 2020). "Long non-coding RNA-polycomb intimate rendezvous". Open Biology. 10 (9): 200126. doi:10.1098/rsob.200126. PMC 7536065. PMID 32898472.
  118. ^ Denisenko O, Shnyreva M, Suzuki H, Bomsztyk K (October 1998). "Point mutations in the WD40 domain of Eed block its interaction with Ezh2". Molecular and Cellular Biology. 18 (10): 5634–5642. doi:10.1128/MCB.18.10.5634. PMC 109149. PMID 9742080.
  119. ^ Katayama S, Tomaru Y, Kasukawa T, Waki K, Nakanishi M, Nakamura M, Nishida H, Yap CC, Suzuki M, Kawai J, Suzuki H, Carninci P, Hayashizaki Y, Wells C, Frith M, Ravasi T, Pang KC, Hallinan J, Mattick J, Hume DA, Lipovich L, Batalov S, Engström PG, Mizuno Y, Faghihi MA, Sandelin A, Chalk AM, Mottagui-Tabar S, Liang Z, Lenhard B, Wahlestedt C (September 2005). "Antisense transcription in the mammalian transcriptome". Science. 309 (5740): 1564–1566. Bibcode:2005Sci...309.1564R. doi:10.1126/science.1112009. PMID 16141073. S2CID 34559885.
  120. ^ a b c Yu W, Gius D, Onyango P, Muldoon-Jacobs K, Karp J, Feinberg AP, Cui H (January 2008). "Epigenetic silencing of tumour suppressor gene p15 by its antisense RNA". Nature. 451 (7175): 202–206. Bibcode:2008Natur.451..202Y. doi:10.1038/nature06468. PMC 2743558. PMID 18185590.
  121. ^ Pauler FM, Koerner MV, Barlow DP (June 2007). "Silencing by imprinted noncoding RNAs: is transcription the answer?". Trends in Genetics. 23 (6): 284–292. doi:10.1016/j.tig.2007.03.018. PMC 2847181. PMID 17445943.
  122. ^ Braidotti G, Baubec T, Pauler F, Seidl C, Smrzka O, Stricker S, Yotova I, Barlow DP (2004). "The Air noncoding RNA: an imprinted cis-silencing transcript". Cold Spring Harbor Symposia on Quantitative Biology. 69: 55–66. doi:10.1101/sqb.2004.69.55. PMC 2847179. PMID 16117633.
  123. ^ Mitsuya K, Meguro M, Lee MP, Katoh M, Schulz TC, Kugoh H, Yoshida MA, Niikawa N, Feinberg AP, Oshimura M (July 1999). "LIT1, an imprinted antisense RNA in the human KvLQT1 locus identified by screening for differentially expressed transcripts using monochromosomal hybrids". Human Molecular Genetics. 8 (7): 1209–1217. doi:10.1093/hmg/8.7.1209. PMID 10369866.
  124. ^ Mancini-Dinardo D, Steele SJ, Levorse JM, Ingram RS, Tilghman SM (May 2006). "Elongation of the Kcnq1ot1 transcript is required for genomic imprinting of neighboring genes". Genes & Development. 20 (10): 1268–1282. doi:10.1101/gad.1416906. PMC 1472902. PMID 16702402.
  125. ^ a b Umlauf D, Goto Y, Cao R, Cerqueira F, Wagschal A, Zhang Y, Feil R (December 2004). "Imprinting along the Kcnq1 domain on mouse chromosome 7 involves repressive histone methylation and recruitment of Polycomb group complexes". Nature Genetics. 36 (12): 1296–1300. doi:10.1038/ng1467. PMID 15516932. S2CID 19084498.
  126. ^ Sleutels F, Zwart R, Barlow DP (February 2002). "The non-coding Air RNA is required for silencing autosomal imprinted genes". Nature. 415 (6873): 810–813. Bibcode:2002Natur.415..810S. doi:10.1038/415810a. PMID 11845212. S2CID 4420245.
  127. ^ Zwart R, Sleutels F, Wutz A, Schinkel AH, Barlow DP (September 2001). "Bidirectional action of the Igf2r imprint control element on upstream and downstream imprinted genes". Genes & Development. 15 (18): 2361–2366. doi:10.1101/gad.206201. PMC 312779. PMID 11562346.
  128. ^ Fournier C, Goto Y, Ballestar E, Delaval K, Hever AM, Esteller M, Feil R (December 2002). "Allele-specific histone lysine methylation marks regulatory regions at imprinted mouse genes". The EMBO Journal. 21 (23): 6560–6570. doi:10.1093/emboj/cdf655. PMC 136958. PMID 12456662.
  129. ^ a b Wutz A, Gribnau J (October 2007). "X inactivation Xplained". Current Opinion in Genetics & Development. 17 (5): 387–393. doi:10.1016/j.gde.2007.08.001. PMID 17869504.
  130. ^ Morey C, Navarro P, Debrand E, Avner P, Rougeulle C, Clerc P (February 2004). "The region 3′ to Xist mediates X chromosome counting and H3 Lys-4 dimethylation within the Xist gene". The EMBO Journal. 23 (3): 594–604. doi:10.1038/sj.emboj.7600071. PMC 1271805. PMID 14749728.
  131. ^ Costanzi C, Pehrson JR (June 1998). "Histone macroH2A1 is concentrated in the inactive X chromosome of female mammals". Nature. 393 (6685): 599–601. Bibcode:1998Natur.393..599C. doi:10.1038/31275. PMID 9634239. S2CID 205001095.
  132. ^ Blasco MA (October 2007). "Telomere length, stem cells and aging". Nature Chemical Biology. 3 (10): 640–649. doi:10.1038/nchembio.2007.38. PMID 17876321.
  133. ^ a b Schoeftner S, Blasco MA (February 2008). "Developmentally regulated transcription of mammalian telomeres by DNA-dependent RNA polymerase II". Nature Cell Biology. 10 (2): 228–236. doi:10.1038/ncb1685. PMID 18157120. S2CID 5890629.
  134. ^ a b Azzalin CM, Reichenbach P, Khoriauli L, Giulotto E, Lingner J (November 2007). "Telomeric repeat containing RNA and RNA surveillance factors at mammalian chromosome ends". Science. 318 (5851): 798–801. Bibcode:2007Sci...318..798A. doi:10.1126/science.1147182. PMID 17916692. S2CID 20693275.
  135. ^ Donley N, Stoffregen EP, Smith L, Montagna C, Thayer MJ (April 2013). Bartolomei MS (ed.). "Asynchronous replication, mono-allelic expression, and long range Cis-effects of ASAR6". PLOS Genetics. 9 (4): e1003423. doi:10.1371/journal.pgen.1003423. PMC 3617217. PMID 23593023.
  136. ^ Donley N, Smith L, Thayer MJ (January 2015). Bartolomei MS (ed.). "ASAR15, A cis-acting locus that controls chromosome-wide replication timing and stability of human chromosome 15". PLOS Genetics. 11 (1): e1004923. doi:10.1371/journal.pgen.1004923. PMC 4287527. PMID 25569254.
  137. ^ Heskett MB, Smith LG, Spellman P, Thayer MJ (June 2020). "Reciprocal monoallelic expression of ASAR lncRNA genes controls replication timing of human chromosome 6". RNA. 26 (6): 724–738. doi:10.1261/rna.073114.119. PMC 7266157. PMID 32144193.
  138. ^ Murashko MM, Stasevich EM, Schwartz AM, Kuprash DV, Uvarova AN, Demin DE (April 2021). Blanco FJ (ed.). "The Role of RNA in DNA Breaks, Repair and Chromosomal Rearrangements". Biomolecules. 11 (4): 550. doi:10.3390/biom11040550. PMC 8069526. PMID 33918762.
  139. ^ Ma L, Cao J, Liu L, Du Q, Li Z, Zou D, Bajic VB, and Zhang Z (Jan 2019). "LncBook: a curated knowledgebase of human long non-coding RNAs". Nucleic Acids Research. 47 (Database issue): D128–D134. doi:10.1093/nar/gky960. PMC 6323930. PMID 30329098.
  140. ^ Lukiw WJ, Handley P, Wong L, Crapper McLachlan DR (June 1992). "BC200 RNA in normal human neocortex, non-Alzheimer dementia (NAD), and senile dementia of the Alzheimer type (AD)". Neurochemical Research. 17 (6): 591–597. doi:10.1007/bf00968788. PMID 1603265. S2CID 39305428.
  141. ^ Watson JB, Sutcliffe JG (September 1987). "Primate brain-specific cytoplasmic transcript of the Alu repeat family". Molecular and Cellular Biology. 7 (9): 3324–3327. doi:10.1128/MCB.7.9.3324. PMC 367971. PMID 2444875.
  142. ^ a b Fu X, Ravindranath L, Tran N, Petrovics G, Srivastava S (March 2006). "Regulation of apoptosis by a prostate-specific and prostate cancer-associated noncoding gene, PCGEM1". DNA and Cell Biology. 25 (3): 135–141. doi:10.1089/dna.2006.25.135. PMID 16569192.
  143. ^ Bardhan A, Banerjee A, Basu K, Pal DK, Ghosh A (January 2022). "PRNCR1: a long non-coding RNA with a pivotal oncogenic role in cancer". Human Genetics. 141 (1): 15–29. doi:10.1007/s00439-021-02396-8. PMC 8561087. PMID 34727260.
  144. ^ Vausort M, Wagner DR, Devaux Y (September 2014). "Long noncoding RNAs in patients with acute myocardial infarction". Circulation Research. 115 (7): 668–677. doi:10.1161/CIRCRESAHA.115.303836. PMID 25035150. S2CID 26576988.
  145. ^ Boeckel JN, Perret MF, Glaser SF, Seeger T, Heumüller AW, Chen W, et al. (January 2019). "Identification and regulation of the long non-coding RNA Heat2 in heart failure". Journal of Molecular and Cellular Cardiology. 126: 13–22. doi:10.1016/j.yjmcc.2018.11.004. PMID 30445017. S2CID 53569324.
  146. ^ Lin R, Maeda S, Liu C, Karin M, Edgington TS (February 2007). "A large noncoding RNA is a marker for murine hepatocellular carcinomas and a spectrum of human carcinomas". Oncogene. 26 (6): 851–858. doi:10.1038/sj.onc.1209846. PMID 16878148. S2CID 9657308.
  147. ^ Reis EM, Nakaya HI, Louro R, Canavez FC, Flatschart AV, Almeida GT, Egidio CM, Paquola AC, Machado AA, Festa F, Yamamoto D, Alvarenga R, da Silva CC, Brito GC, Simon SD, Moreira-Filho CA, Leite KR, Camara-Lopes LH, Campos FS, Gimba E, Vignal GM, El-Dorry H, Sogayar MC, Barcinski MA, da Silva AM, Verjovski-Almeida S (August 2004). "Antisense intronic non-coding RNA levels correlate to the degree of tumor differentiation in prostate cancer". Oncogene. 23 (39): 6684–6692. doi:10.1038/sj.onc.1207880. PMID 15221013. S2CID 260632006.
  148. ^ Eis PS, Tam W, Sun L, Chadburn A, Li Z, Gomez MF, Lund E, Dahlberg JE (March 2005). "Accumulation of miR-155 and BIC RNA in human B cell lymphomas". Proceedings of the National Academy of Sciences of the United States of America. 102 (10): 3627–3632. Bibcode:2005PNAS..102.3627E. doi:10.1073/pnas.0500613102. PMC 552785. PMID 15738415.
  149. ^ Li J, Witte DP, Van Dyke T, Askew DS (April 1997). "Expression of the putative proto-oncogene His-1 in normal and neoplastic tissues". The American Journal of Pathology. 150 (4): 1297–1305. PMC 1858164. PMID 9094986.
  150. ^ Sonkoly E, Bata-Csorgo Z, Pivarcsi A, Polyanka H, Kenderessy-Szabo A, Molnar G, Szentpali K, Bari L, Megyeri K, Mandi Y, Dobozy A, Kemeny L, Szell M (June 2005). "Identification and characterization of a novel, psoriasis susceptibility-related noncoding RNA gene, PRINS" (PDF). The Journal of Biological Chemistry. 280 (25): 24159–24167. doi:10.1074/jbc.M501704200. PMID 15855153.
  151. ^ Ishii N, Ozaki K, Sato H, Mizuno H, Saito S, Takahashi A, Miyamoto Y, Ikegawa S, Kamatani N, Hori M, Saito S, Nakamura Y, Tanaka T (2006). "Identification of a novel non-coding RNA, MIAT, that confers risk of myocardial infarction". Journal of Human Genetics. 51 (12): 1087–1099. doi:10.1007/s10038-006-0070-9. PMID 17066261.
  152. ^ McPherson R, Pertsemlidis A, Kavaslar N, Stewart A, Roberts R, Cox DR, Hinds DA, Pennacchio LA, Tybjaerg-Hansen A, Folsom AR, Boerwinkle E, Hobbs HH, Cohen JC (June 2007). "A common allele on chromosome 9 associated with coronary heart disease". Science. 316 (5830): 1488–1491. Bibcode:2007Sci...316.1488M. doi:10.1126/science.1142447. PMC 2711874. PMID 17478681.
  153. ^ Pasmant E, Laurendeau I, Héron D, Vidaud M, Vidaud D, Bièche I (April 2007). "Characterization of a germ-line deletion, including the entire INK4/ARF locus, in a melanoma-neural system tumor family: identification of ANRIL, an antisense noncoding RNA whose expression coclusters with ARF". Cancer Research. 67 (8): 3963–3969. doi:10.1158/0008-5472.CAN-06-2004. PMID 17440112.
  154. ^ Broadbent HM, Peden JF, Lorkowski S, Goel A, Ongen H, Green F, Clarke R, Collins R, Franzosi MG, Tognoni G, Seedorf U, Rust S, Eriksson P, Hamsten A, Farrall M, Watkins H (March 2008). "Susceptibility to coronary artery disease and diabetes is encoded by distinct, tightly linked SNPs in the ANRIL locus on chromosome 9p". Human Molecular Genetics. 17 (6): 806–814. doi:10.1093/hmg/ddm352. PMID 18048406.
  155. ^ a b Jarinova O, Stewart AF, Roberts R, Wells G, Lau P, Naing T, Buerki C, McLean BW, Cook RC, Parker JS, McPherson R (October 2009). "Functional analysis of the chromosome 9p21.3 coronary artery disease risk locus". Arteriosclerosis, Thrombosis, and Vascular Biology. 29 (10): 1671–1677. doi:10.1161/ATVBAHA.109.189522. PMID 19592466.
  156. ^ Liu Y, Sanoff HK, Cho H, Burd CE, Torrice C, Mohlke KL, Ibrahim JG, Thomas NE, Sharpless NE (April 2009). "INK4/ARF transcript expression is associated with chromosome 9p21 variants linked to atherosclerosis". PLOS ONE. 4 (4): e5027. Bibcode:2009PLoSO...4.5027L. doi:10.1371/journal.pone.0005027. PMC 2660422. PMID 19343170.
  157. ^ Liu Y, Sanoff HK, Cho H, Burd CE, Torrice C, Mohlke KL, Ibrahim JG, Thomas NE, Sharpless NE (April 2024). "Distinct Plasma Extracellular Vesicle Transcriptomes in Acute Decompensated Heart Failure Subtypes: A Liquid Biopsy Approach". Circulation. 149 (14): 1147–1149. doi:10.1161/CIRCULATIONAHA.123.065513. PMC 10987042. PMID 38557125.
  158. ^ Shirasawa S, Harada H, Furugaki K, Akamizu T, Ishikawa N, Ito K, Ito K, Tamai H, Kuma K, Kubota S, Hiratani H, Tsuchiya T, Baba I, Ishikawa M, Tanaka M, Sakai K, Aoki M, Yamamoto K, Sasazuki T (October 2004). "SNPs in the promoter of a B cell-specific antisense transcript, SAS-ZFAT, determine susceptibility to autoimmune thyroid disease". Human Molecular Genetics. 13 (19): 2221–2231. doi:10.1093/hmg/ddh245. PMID 15294872.
  159. ^ Faghihi MA, Modarresi F, Khalil AM, Wood DE, Sahagan BG, Morgan TE, Finch CE, St Laurent G, Kenny PJ, Wahlestedt C (July 2008). "Expression of a noncoding RNA is elevated in Alzheimer's disease and drives rapid feed-forward regulation of beta-secretase". Nature Medicine. 14 (7): 723–730. doi:10.1038/nm1784. PMC 2826895. PMID 18587408.
  160. ^ Tufarelli C, Stanley JA, Garrick D, Sharpe JA, Ayyub H, Wood WG, Higgs DR (June 2003). "Transcription of antisense RNA leading to gene silencing and methylation as a novel cause of human genetic disease". Nature Genetics. 34 (2): 157–165. doi:10.1038/ng1157. PMID 12730694. S2CID 7226446.
  161. ^ de Lima DS, Cardozo LE, Maracaja-Coutinho V, Suhrbier A, Mane K, Jeffries D, et al. (August 2019). "Long noncoding RNAs are involved in multiple immunological pathways in response to vaccination". Proceedings of the National Academy of Sciences of the United States of America. 116 (34): 17121–17126. Bibcode:2019PNAS..11617121D. doi:10.1073/pnas.1822046116. PMC 6708379. PMID 31399544.
  162. ^ Novikova, Irina V.; Hennelly, Scott P.; Sanbonmatsu, Karissa Y. (November 2012). "Sizing up long non-coding RNAs: Do lncRNAs have secondary and tertiary structure?". BioArchitecture. 2 (6): 189–199. doi:10.4161/bioa.22592. ISSN 1949-0992. PMC 3527312. PMID 23267412.
  163. ^ Chillón, Isabel; Marcia, Marco (2020-10-12). "The molecular structure of long non-coding RNAs: emerging patterns and functional implications". Critical Reviews in Biochemistry and Molecular Biology. 55 (6): 662–690. doi:10.1080/10409238.2020.1828259. ISSN 1040-9238. PMID 33043695.
  164. ^ Uroda, Tina; Anastasakou, Eleni; Rossi, Annalisa; Teulon, Jean-Marie; Pellequer, Jean-Luc; Annibale, Paolo; Pessey, Ombeline; Inga, Alberto; Chillón, Isabel; Marcia, Marco (September 2019). "Conserved Pseudoknots in lncRNA MEG3 Are Essential for Stimulation of the p53 Pathway". Molecular Cell. 75 (5): 982–995.e9. doi:10.1016/j.molcel.2019.07.025. ISSN 1097-2765. PMC 6739425. PMID 31444106.
  165. ^ Kim, Doo Nam; Thiel, Bernhard C.; Mrozowich, Tyler; Hennelly, Scott P.; Hofacker, Ivo L.; Patel, Trushar R.; Sanbonmatsu, Karissa Y. (2020-01-09). "Zinc-finger protein CNBP alters the 3-D structure of lncRNA Braveheart in solution". Nature Communications. 11 (1): 148. Bibcode:2020NatCo..11..148K. doi:10.1038/s41467-019-13942-4. ISSN 2041-1723. PMC 6952434. PMID 31919376.
  166. ^ Aguilar, Rodrigo; Spencer, Kerrie B.; Kesner, Barry; Rizvi, Noreen F.; Badmalia, Maulik D.; Mrozowich, Tyler; Mortison, Jonathan D.; Rivera, Carlos; Smith, Graham F.; Burchard, Julja; Dandliker, Peter J.; Patel, Trushar R.; Nickbarg, Elliott B.; Lee, Jeannie T. (2022-03-30). "Targeting Xist with compounds that disrupt RNA structure and X inactivation". Nature. 604 (7904): 160–166. Bibcode:2022Natur.604..160A. doi:10.1038/s41586-022-04537-z. ISSN 0028-0836. PMC 11549687. PMID 35355011.