Physiological, Biochemical, and Molecular-Genetic Events Governing the Recognition and Resolution of RNA/DNA Hybrids
- Robert J. Crouch, PhD, Head, Section on Formation of RNA
- Susana M. Cerritelli, PhD, Staff Scientist
- Kiran Sakhuja, MS, MSc, Biologist
- Caitlin Darling, BS, Postbaccalaurate Fellow
- Andrea Mota, BS, Postbaccalaureate Fellow
- Alexander Robinson, BS, Postbaccalaureate Fellow
- Matthew Roy, BS, Postbaccalaurate Fellow
Ribonucleases H (RNases H) are considered essential enzymes in multicellular organisms, thereby placing the genes that encode the enzymes in the housekeeping category. Damaged DNA is a leading cause of many human diseases and disorders. We study the formation and resolution of RNA/DNA hybrids, which occur during DNA replication and RNA transcription. Such hybrid molecules may lead to increased DNA damage, but may also play critical roles in normal cellular processes. We are interested in how RNA/DNA hybrids are resolved and in the role that RNases H play in their elimination. Two classes of RNases H, Class I and Class II, are present in most organisms.
Patients with mutations in the RNASEH1 gene exhibit typical mitochondrial myopathy symptoms (neuromuscular disorder). We were the first to show that RNase H1 is essential for the maintenance of mitochondrial DNA. More than 1,000 proteins are targeted to mitochondria, many of which, when mutated, are known causes of mitochondrial myopathies. Mice deleted for the Rnaseh1 gene arrest embryonic development at day 8.5 because of failure to amplify mitochondrial DNA.
Aicardi-Goutières syndrome (AGS), a severe neurological disorder with symptoms appearing at or soon after birth, can be caused by defective human RNase H2. As many as 38 Mendelian genotypes may result in a type I interferonopathy, including mutations in each of the genes encoding the subunits of the heterotrimeric RNase H2, the hallmark of which is activation of the innate immune response.
Differences between Class I and Class II RNases H
Over the years, many of our investigations focused on RNase H1. RNase H1 recognizes the 2′-OH of four consecutive ribonucleotides (rNMPs), while the DNA strand is distorted to fit into a pocket of the enzyme. Thus, the enzyme requires more than one ribonucleotide for cleavage of RNA in RNA/DNA hybrids. In both prokaryotes and eukaryotes, RNases H1 consist of a single polypeptide. In contrast, in eukaryotes RNase H2 is a complex of three distinct polypeptides but a single polypeptide in prokaryotes. The catalytic subunit of the hetero-trimeric RNase H2 of eukaryotes is similar in its primary amino-acid sequence to the prokaryotic enzyme. RNase H2 can recognize and cleave both RNA/DNA hybrids and a single ribonucleotide embedded in DNA, making an incision at the 5′ of the ribonucleotide, or the transition from the ribonucleotide in the case of RNA–primed DNA synthesis (e.g., rrrrrDDDD in DNA—italics indicate transition from ribonucleotide to deoxyribonucleotide) [References 1 & 2].
Several types of RNA/DNA hybrid structures can be formed, and they are processed differently. Simple RNA/DNA hybrids consist of one strand of RNA paired with one strand of DNA. The HIV–AIDS reverse transcriptase (RT) forms such hybrids when copying its genomic RNA into DNA. The RT also has an RNase H domain that is structurally and functionally similar to the class I cellular RNase H and is necessary for several steps of viral DNA synthesis. R-loop hybrids (three-stranded nucleic acid structures) have two separated DNA strands, with one hybridized to RNA while the other is in a single-strand form. Such structures sometimes form during transcription and can lead to chromosomal breakage. However, they are also part of the normal process of switching (recombination) from one form of immunoglobulin to another, resulting in distinct isoforms of antibodies. Another form of hybrid are single or multiple ribonucleotides incorporated into DNA during replication [Reference 1]. The first two types of hybrids are substrates for class I and II RNases H. The third is uniquely recognized by type 2 RNases H, when the stretch of ribonucleotides embedded in DNA is less than four nucleotides long.
Dual activities of RNase H2; Aicardi-Goutières syndrome
Eukaryotic RNases H2 recognize and resolve RNA hybridized or covalently attached to DNA (two chemically distinct structures), using the same catalytic mechanism for hydrolysis. RNase H2 mutations that reduce catalytic activity, or fail to properly interact with in vivo substrates, cause Aicardi-Goutières syndrome (AGS). Mutations in seven genes are known to cause AGS, with more than 50% of AGS patients having mutations in any of the three subunits of RNase H2. We previously expressed (in Escherichia coli) and purified human RNases H2 with mutations corresponding to several of those seen in AGS patients; one such mutation, RNASEH2A–G37S (G37S), has significant loss of RNase H2 activity. Using the 3D structure of the human enzyme that we had determined, we could locate all known mutations in RNase H2 that cause AGS. The wide distribution of the mutations suggests that modest changes in stability and interaction with other unknown proteins, as well as loss of catalysis, can all cause AGS. A mutation near the catalytic center of G37S found in some AGS patients results in low RNase H2 activity for both embedded ribonucleotides in DNA and RNA/DNA hybrids [Reference 1]. We are developing mouse models of AGS to clarify which defects are associated with each RNase H2 activity.
Mice bearing the G37S mutation in homozygous form are perinatal lethal, i.e., either dead at birth or die within a few hours of birth [Reference 1]. Mutations in another gene, TREX1 (which encodes a nuclease), also cause AGS, and it has been shown that homozygous knockout (KO) mice are viable but die after a few weeks owing to a cardiomyopathy, which can be prevented by blocking either an innate or adaptive immune response. In contrast, the G37S–mutant perinatal lethality and the fact that RNase H2 KO mice die during early embryogenesis suggest a more severe defect than that seen in TREX1–KO mice. Damaged DNA that finds its way into the cytoplasm can be sensed by the cGAS protein, producing the small molecule cGAMP, which interacts with the Sting protein, an important protein for the DNA–sensing in the innate immune pathway. Mice that are homozygous for G37S and deleted for the cGAS or Sting genes are mostly perinatal lethal but no longer exhibit increases in ISGs (interferon-stimulated genes). Interestingly, a small fraction of the double G37S–Sting KO are viable, indicating only limited involvement of ISGs in perinatal lethality. Further studies are under way, which we expect will lead us to the cause of lethality.
To distinguish among the defects that persistent RNA/DNA hybrids and single ribonucleotides joined to DNA cause in vivo, Hyongi Chon, a former postdoctoral fellow, rationally designed a modified RNase H2 to make an enzyme unable to cleave single ribonucleotides embedded in DNA but that retained RNA/DNA hydrolytic activity. The mutant enzyme, which we called RED (ribonucleotide-excision deficient), resolves RNA/DNA hybrids, which are substrates of both RNase H1 and RNase H2. Unlike the mouse and human RNases H2, RNase H2 activity is not required in the yeast Saccharomyces cerevisiae. Employing the ease of genetic mutation studies in yeast, we demonstrated that, in yeast producing the RNase H2RED, the enzyme acted in vivo by leaving embedded ribonucleotides (rNMPs) in DNA but was potent in removing RNA in RNA/DNA hybrids.
Embryonic lethality of mice Rnaseh2b–KO strains has been attributed to accumulation of rNMPs in DNA, but lethality could be the result of loss of RNA/DNA hydrolysis or a combination of both rNMP and RNA/DNA hydrolysis defects [References 1 & 2]. To distinguish among the possible causes of embryonic lethality, we generated a mouse that produces the RNase H2RED enzyme. Mouse embryonic fibroblasts (MEFs) derived from Rnaseh2RED mice have the same high level of rNMPs as seen in Rnaseh2b–KO MEFs [Reference 2]. Interestingly, the Rnaseh2RED mice die around the same time as the Rnaseh2b–KO mice. Therefore, lethality of the KO and RED RNase H2 mouse strains may be caused by increased rNMPs in genomic DNA. Rnaseh2aG37S/RED embryos also arrest at approximately the same stage as Rnaseh2aRED/RED embryos because of better association of RNase H2RED than RNase H2G37S with DNA substrate containing embedded rNMPs. The result is important because some RNase H2–AGS patients have similar compound heterozygous mutations in which there may be a dominant mutated enzyme.
Our studies on RNase H2-RED have permitted us and others to assign specific substrates to each of the two activities and determine which functions are related to various phenotypes seen when RNase H2 is absent [References 1–3]. One of our goals is to produce an RNase H2 with robust incision at single rNMPs in DNA but with poor RNA/DNA hybrid cleavage (hybrid-defective [HD]) to complement the RNase H2-RED enzyme. We have identified amino-acid changes that appear to produce such RNase H2-HD and are in the early stages of studies in both cell cultures and mice.
The rNMPs embedded in DNA could be converted to abasic sites in which the flanking dNMPs would be connected by a ribose phosphate rather than an rNMP. In collaboration with the Storici and Tell groups, we examined the abilities of eukaryotic RNases H2 to cleave substrates containing a single ribose abasic (only sugar phosphate) site in duplex DNA. Prokaryotic RNases HII, but not eukaryotic RNases H2, can recognize and cleave at abasic rNMPs in duplex DNA. Little was known about abasic sites in RNA until, in collaboration with Vivian Cheung’s group, we discovered that abasic sites are present in RNAs of yeast and human cells, and likely in all organisms. The abasic sites are located in or near R-loops. The Cheung group had previously shown that MPG (N-methylpurine DNA glycosylase) and APE1 (apurinic/apyrimidinic endonuclease) interact with R-loops. MPG is a glycosylase that removes the base of dNMPs or rNMPs (as reported in our papers with Cheung), which, in turn, can be cleaved by the APE1. Abasic sites in DNA are repaired by Ape1 excision using the complementary DNA strand as template for repair. RNAs have no template to correct for the absence of a base. The association of abasic sites with R-loops is intriguing and suggests that RNAs complexed with DNA are protected from RNase H activities. Indeed, we found that RNA transcripts, having formed an R-loop, pause and, in one case, serve as an enhancer for APOE, the most common gene associated with Alzheimer’s disease. The RNA of the R-loop has an adenosine that becomes methylated and attracts MPG glycosylase, leading to an abasic site in the RNA, probably stabilizing the R-loop and protecting it from RNase H resolution [Reference 4].
Loss of RNase H1 in early B cell development activates the mitochondrial unfolded protein response without affecting the nuclear R-loops.
We made a knockout of the mouse Rnaseh1 gene and discovered that two isoforms of RNase H1 are produced from a single mRNA by a leaky scanning method for differential translation. One protein is localized to the nucleus and a second is targeted to mitochondria. Nuclear DNA replication begins at fertilization, with mitochondrial DNA (mtDNA) beginning amplification several days later. We observed early embryonic death shortly after mtDNA replication should have begun, thereby linking the absence of mtDNA replication with lethality. We were curious to see the contribution effects in a system less complicated than embryonic development. We chose mouse B cell development because (1) B cells are not required for viability when mice are housed in a germ-free environment; (2) B cell development occurs in only a few rounds of cell duplication; (3) resting B cells are in G0, providing a population of cells that respond together when stimulated; (4) many useful tools for analyses and manipulation are available; and (5) RNase H has potential, known substrates in B cell development.
We generated an Rnaseh1 conditional KO mouse strain in which we can specifically knocked out the gene, using a CRE-lox method with the Mb1 (the Mb1 gene encodes the Ig-α signaling subunit of the B cell antigen receptor) promoter–driving CRE. Transcripts of Mb1 are initiated from the earliest stage of B cell development and persist until plasmacytes are formed. B cells develop to the resting stage, at which point they can be stimulated to undergo isotype switching by class switch recombination (CSR), ultimately producing circulating antibodies. We found that Mb1-CRE KO of the Rnaseh1 gene resulted in little or no circulating antibodies but did produce resting B cells, although yielding half as many B cells as did wild-type (WT) mice. Stimulation of these B cells initiated transitioning from G0 to G1 phase of the cell cycle, but essentially they never entered S-phase. The resting B cells had no RNase H1 activity, no mtDNA, and their mitochondria exhibited abnormal morphology. We performed RNA-Seq analyses of resting and 24 h–stimulated mutant and WT B cells to discover genes related to loss of mtDNA and/or a nuclear DNA–damage response. We observed reductions in the following pathways: the cell-cycle, the immune system, DNA replication, and in mitochondrion, RNA processing, and ribosomes. The 50% yield of resting B cells in the KO strain must occur during cell amplification in bone marrow. The loss of RNase H1 was initiated just prior to cell amplification and might limit the number of cell cycles. It is also possible that defects affecting the time of residence of the B cells in the bone marrow niche are affected. Loss of the Nidogen1 gene (NID1) results in a 50% reduction of resting B cells compared with normal, the same reduction as in our KO mice. We noticed a significant difference between WT and mutant resting B cells for the NID1 transcripts.
The list of genes with the highest difference between resting and stimulated KO mice are Atf5, Gdf15, Atf3, Hspa9, and Ddit3. Atf5, Atf3, and Ddit3, all hallmarks of the unfolded mitochondrial response (UPRmt). The activation of the UPRmt indicates that loss of mtDNA takes precedence over nuclear DNA–damage response, just as we observed in embryonic development when the Rnaseh1 gene was deleted in the male and female gametes. We checked the presence of R-loops by DRIP-Seq and, surprisingly, found no alteration in R-loops, indicating the lack of a role RNase H1 in the processing of these structures.
The RNA exosome and RNases H cooperate to suppress R-loop–mediated genome instability.
In addition to RNase H processing, there are other mechanisms to resolve R-loops and/or prevent their formation. The RNA exosome is a major 3′-5′ RNA degradation, multi-subunit complex in eukaryotes, which eliminates cryptic and defective transcripts, preventing their engagement with DNA and suppressing R-loop formation.
In the yeast S. cerevisiae, we are studying the interaction and cooperation between RNases H and the RNA exosome in preventing R-loop–mediated genome instability. We found that over-expression of RNase H1 partially suppressed the growth defects of exosome-deficient mutants, suggesting that some of the problems in these cells are caused by harmful R-loops. We observed that cells defective in both RNase H and exosome activities are hypersensitive to the drug hydroxyurea, which induces replicative stress by reducing the cellular dNTP supply, and that replicative stress in these conditions is eliminated by expressing the RNase H2-RED variant that processes R-loops.
We are using a genetic system that correlates R-loop accumulation with homologous recombination to further elucidate the interplay between RNases H and the exosome in R-loop processing. In this system, yeast strains contain a fragment of the mouse switch Mu sequence (from IgH locus) in an orientation that either facilitates or prevents R-loop formation. We found that RNAs with the fragment in both orientations accumulated in exosome-deficient mutants, indicating that these RNAs are targeted for degradation by the exosome. Moreover, strains with the fragment in the orientation that favors R-loop formation showed increased accumulation of RNA/DNA hybrids and recombination rates when both the exosome and RNases H were defective, compared with cells lacking only RNases H. We conclude that RNase H and the exosome pathways converge to promote genome stability by suppressing the harmful effects of R-loops.
- Uehara R, Cerritelli SM, Hasin N, Sakhuja K, London M, Iranzo J, Chon H, Grinberg A, Crouch RJ. Two RNase H2 mutants with differential rNMP processing activity reveal a threshold of ribonucleotide tolerance in DNA for embryonic development. Cell Rep 2018 25:1135–1145.
- Cerritelli SM, Iranzo J, Sharma S, Chabes A, Crouch RJ, Tollervey D, El Hage A. High density of unrepaired genomic ribonucleotides leads to Topoisomerase 1-mediated severe growth defects in absence of ribonucleotide reductase. Nucleic Acids Res 2020 48:4274–4297.
- Liu Y, Rodriguez Y, Ross RL, Zhao R, Watts JA, Grunseich C, Bruzel A, Li D, Burdick JT, Prasad R, Crouch RJ, Limbach PA, Wilson SH, Cheung, VG. RNA abasic sites in yeast and human cells. Proc Natl Acad Sci USA 2020 117:20689–20695.
- Watts JA, Grunseich C, Rodriguez Y, Liu Yaojuan, Li D, Burdick JT, Bruzel, Crouch RJ. Mahley RW, Wilson S, Cheung V. A common transcriptional mechanism involving R-loop and RNA abasic site regulates an enhancer RNA of APOE. Nucleic Acids Res 2022 50(21)12497–12514.
- Frederic Chedin, PhD, University of California Davis, Davis, CA
- Vivian G. Cheung, MD, University of Michigan Life Sciences Institute, Ann Arbor, MI
- Louis Dye, BS, Microscopy and Imaging Core, NICHD, Bethesda, MD
- Aziz El Hage, PhD, University of Edinburgh, Edinburgh, UK
- James Ibem, PhD, Molecular Genomics Core, NICHD, Bethesda, MD
- Francesca Storici, PhD, Georgia Institute of Technology, Atlanta, GA
- Gianluca Tell, PhD, Università degli Studi di Udine, Udine, Italy
- David Tollervey, PhD, University of Edinburgh, Edinburgh, UK
For more information, email firstname.lastname@example.org or visit https://www.nichd.nih.gov/research/atNICHD/Investigators/crouch.