- Review
- Open access
- Published:
In vivo dynamics and regulation of DNA G-quadruplex structures in mammals
Cell & Bioscience volume 13, Article number: 117 (2023)
Abstract
G-quadruplex (G4) is a four-stranded helical DNA secondary structure formed by guanine-rich sequence folding, and G4 has been computationally predicted to exist in a wide range of species. Substantial evidence has supported the formation of endogenous G4 (eG4) in living cells and revealed its regulatory dynamics and critical roles in several important biological processes, making eG4 a regulator of gene expression perturbation and a promising therapeutic target in disease biology. Here, we reviewed the methods for prediction of potential G4 sequences (PQS) and detection of eG4s. We also highlighted the factors affecting the dynamics of eG4s and the effects of eG4 dynamics. Finally, we discussed the future applications of eG4 dynamics in disease therapy.
Introduction
DNA generally exists as a double helix, and guanine-rich DNA sequences can form four-stranded secondary structures, which are called G-quadruplexes (G4s) (Fig. 1A). A canonical G4 structure is formed by the assembly of two or more G-tetrads, and each G-tetrad consists of four guanines linked by Hoogsteen hydrogen bonds (Fig. 1B) [1]. G4, with its unique secondary structure, is involved in a variety of important biological processes such as gene transcription, translation regulation, telomere extension, and chromatin modification.
The dynamics of endogenous G4s (eG4s), such as folding, unfolding, and their topological changes under different conditions in vivo can affect biological processes, and the detection of eG4s is of great importance for the exploration of their functions. Many methods, including low- or high-throughput methods, have been developed to identify potential G4 sequences (PQS) in silico or in vitro. Fortunately, the advent of cell imaging technology allows the visualization of eG4 folding and unfolding in the context of cell chromatin. With the development of sequencing technology, we can comprehensively identify genome-wide eG4s by ChIP-seq/CUT&Tag methods. The detection methods of eG4s can be used to determine factors affecting eG4 dynamics, including PQS itself and various intracellular factors, and to study eG4 functions.
This review summarizes eG4 detection methods, eG4 characteristics, and self-factors and intracellular factors affecting eG4 dynamics. The effects of eG4 dynamics on telomere extension, transcription, and genomic stability are highlighted. Gene expression can be regulated by eG4 dynamics, suggesting the strong potential of eG4 in biomedical applications.
Detection and characterization of endogenous G4s
The typical G4 sequence usually consists of four G-tracts of 2–4 bases and three loops connecting adjacent G-tracts. G4 sequences with G-tracts of 2–4 base lengths can form G4 structures with 2–4 G-tetrad layers. Such a sequence feature is used to predict PQS [2, 3] by four algorithms, including regular expression matching algorithms [4, 5], scoring algorithms [6, 7], sliding window algorithms [8, 9], and machine learning algorithms [10]. The algorithms for G4 prediction have been continuously improved, which increases our understanding of eG4s, but the false positive rate of existing algorithms is still high, and so far no software can predict dynamic eG4s, which hinders the study of eG4 function.
Biophysical and chemical methods are used to detect G4 structures in vitro. Classical biophysical methods, including circular dichroism (CD) [11] and nuclear magnetic resonance (NMR) [12], are used to detect the structure of G4s in vitro and to study the dynamics of G4s at different temperatures [13], PEG (polyethylene glycol) concentrations [13], and K+ concentrations [14]. The DNA polymerase stop method and the dimethyl sulfate (DMS) and piperidine cleavage method are used to successfully locate the G4s in telomere DNA sequences [15, 16]. Considering that DNA G4s impede polymerase, the high-throughput G4-seq method [17] has been further developed to identify genome-wide DNA G4s in vitro. However, these methods can only be used to detect G4s in vitro, not eG4s in vivo.
Cell imaging provides a strategy to detect and characterize eG4s. The earliest researchers attempted to use G4 antibodies to image intracellular eG4s by immunofluorescence technique. ScFv Sty49 is the first G4-specific antibody that successfully visualized eG4s in ciliate macronuclei [18]. Other antibodies such as scFv hf2 [19], GQ1 zinc finger protein [20], and G4 DARPins [21] are used to detect DNA G4s in vitro, but it turns out that they are unsuitable for whole-cell immunofluorescence assay [21]. There are two reasons for the insensitivity of the above antibodies to eG4s in the cellular chromatin environment. On the one hand, the intracellular chromatin structure covers the immune epitopes of eG4s. On the other hand, the dynamic eG4s can only be folded for a short time under specific cellular conditions [22]. The scFv antibody BG4 [23] generated by phage display and the mouse monoclonal antibody 1H6 [24] generated by immunizing mice with a stable G4 structure have been reported to be used for whole-cell immunofluorescence assay to detect the folding or unfolding of eG4s in vivo. In particular, the cross-reactivity of the G4 antibody 1H6 to immobilized single-stranded T-rich DNA must be considered when using this antibody [25]. Since the generation of the two antibodies depends on the fixed structure of the G4, not all the possible topological structures of eG4s can be detected.
In addition to antibodies, a series of small-molecule fluorescent chemical probes can also be used to detect eG4s. For example (Table 1), probe BMVC [26], squarylium dye TSQ1 [27], and single-molecule fluorescent probe SiR-PyPDS [28] can be used to detect DNA eG4s. ATPD dye [29], NaphthoTASQ [30], and DAOTA-M2 [31] can be used to detect DNA eG4s and RNA eG4s. These probes all have membrane permeability, low cytotoxicity, and high selectivity for eG4s. They can produce strong and specific fluorescence upon binding to eG4s. Small-molecule probes that detect eG4s should not induce the folding of eG4s, but only detect those eG4s that are pre-existing in cells. Currently, TSQ1 and low concentrations of SiR-PyPDS are reported not to induce eG4 folding in vivo [27, 28]. To advance this field, newly developed eG4s probes should ensure that they do not interfere with the dynamics of eG4s and thus detect the actual eG4s rather than spurious eG4s induced by the probe in vivo.
Using cell imaging, researchers can observe the dynamics of eG4s in vivo, study the effect of the cellular environment on the dynamics of eG4s, and better link eG4s to their biological functions. However, neither G4 antibodies nor small molecule probes are sufficient to detect all eG4s in vivo. The inability to obtain genome-wide eG4s limits the application of this cell imaging technology.
The combination of cell imaging and next-generation sequencing makes it possible to detect genome-wide eG4s at high resolution. For example, G4 ChIP-seq uses G4 antibodies to pull down eG4-forming regions in vivo [32]. By using fixed cell chromatin as the operation object, the detection results are closer to eG4s in the real chromatin environment, thus expanding the application of the G4 ChIP-seq technique to detect dynamic eG4s in different treatments and different cell lines. For example, when BG4 antibodies were used to isolate eG4-containing fragments from chromatin fragments of human NHEK cells and immortalized HaCaT cells and to construct NGS sequencing libraries, approximately 10,000 DNA eG4s were detected from NHEK cell chromatin, while the number of eG4s detected from immortalized HaCaT cell chromatin was only 1/10 of that from NHEK cells [33]. In addition, BG4-based identification of eG4s and Tn5-based CUT&Tag approaches have been integrated as G4 CUT&Tag [34]. Unlike the G4 ChIP-seq method, which enriches eG4-containing fragments from fixed chromatin fragments, eG4s are bound by BG4 in native cells in the G4 CUT&Tag method. Meanwhile, the Tn5 transposase can generate eG4-containing genomic DNA fragments with sequencing adapters, while most of the extraneous chromatin is not interrupted [35]. Therefore, compared with G4 ChIP-seq, G4 CUT&Tag has simpler operation, higher signal-to-noise ratio, and higher resolution and sensitivity for eG4s detection in vivo. The application of G4 CUT&Tag demonstrates the variability of eG4 dynamics in different cells and the potential association between eG4s and enhancer elements [34, 36, 37]. By mapping eG4s at the genomic level, the distribution of eG4s in cells becomes clearer, and the relationship between eG4 dynamics and eG4 function is further strengthened. The G4 detection methods are listed in Table 2.
Factors affecting eG4 dynamics
During the folding of eG4s, many self-factors will affect the topology. The sequence of eG4 affects its own folding and conformation. The length of the G-tract sequence will affect the conformation of G4s. It has been reported that the human telomere sequence d[TAGG(TTAGGG)3] forms hybrid G4s in K+ solution, and the silkworm telomere sequence d[TAGG(TTAGG)3] with one G missing in each G-tract sequence forms antiparallel G4s in vitro [38]. The loop sequence also affects the topological conformation of G4s. G4s with longer loop sequences tend to form antiparallel structures, whereas G4s with shorter loop sequences tend to form parallel structures [39]. G4 sequences containing two single-base loop sequences will fold into parallel structures [39]. For example, G4 sequences myc-2345, myc-1245, VEGF, HIF-1α, and c-kit21T all have two single-base loop sequences, and NMR detection shows that the above G4 sequences all fold into parallel structures [40, 41], which may be because the short loop sequence is not enough to connect two guanines in the same G-tetrad, but it can connect two guanines in different G-tetrads, as a result, the two G-tract sequences connected by the short loop sequences are parallel.
In addition to the G4 sequence itself (including the G-tract sequence and the loop sequence), flanking sequences and cations are also involved in the folding of eG4s. Cations, including Ca2+, Pb2+, Sr2+, NH4+, Na+, and K+, can coordinate between G-tetrads and participate in the folding of G4s [42,43,44,45,46]. Among these cations, K+, which is the most abundant metal cation in mammalian cells [47], has the highest affinity for G-tetrads [48, 49]. The K+ environment is favorable for the folding of intermolecular parallel G4s, whereas the Na+ environment is favorable for the folding of intramolecular antiparallel G4s. This is because different cations coordinate differently. For example, K+ coordinates with guanines in the form of eightfold, while Na+ coordinates with guanines in the form of square-planar [39]. The interaction between flanking sequences and loop sequences also affects the folding of eG4 structures. Different flanking sequences interact with loop sequences in different ways, thus forming different base pairs and cap structures and affecting the intra-loop and inter-loop interactions, ultimately affecting the structures of eG4s. For example, under the condition of K+-induced folding, the Tel24 sequence can form two conformations: hybrid-1 (dominant) and hybrid-2. When thymine is used to replace A24 (adenine at position 24) on the 3' flanking sequence, Tel24 G4 with hybrid-2 conformation is significantly increased, which is because A24 on the 3' flanking sequence can be paired with T13 base on the loop sequence to form a cap structure, and similarly, T1 on the 5' flanking sequence can be paired with A20 on the loop sequence to form a cap structure. When A24 is replaced by T, the 3' flanking sequence can no longer interact with the loop sequence, so A7 and A8 on the loop sequence are paired to form a cap structure, and similarly, T1 on the 5' flanking sequence is paired with A14 on the loop sequence to form a cap structure, which is conducive to the folding of Tel24 G4 into the hybrid-2 conformation [14].
The structures of eG4s are also affected by interacting proteins in vivo. During DNA replication, dsDNA is unwound into ssDNA by helicases and stabilized by ssDNA-binding proteins. During transcription, the promoter TATA box interacts with TFIIH to melt the promoter. As a kind of nucleic acid structures, eG4s will inevitably be regulated by interacting proteins. The proteins that can interact with eG4s can be divided into two types according to their functions: one is the protein that can unfold eG4s (such as G4 helicase), and the other is the protein that can bind and stabilize eG4s. These two types of proteins together regulate the dynamics of eG4s in vivo.
Most helicases, such as RecQ-like and DEAH box helicase families, can unfold eG4s. Bloom syndrome protein (BLM) and Werner syndrome ATP-dependent helicase (WRN), two of the most important members of the RecQ-like helicase family, can unfold DNA eG4s [50, 51], thereby maintaining genomic stability during DNA replication, repair, and recombination. The mechanism of unfolding eG4s of BLM and WRN is similar, their helicase-RNaseD domain binds to the ssDNA 3' terminal, and the RecQ domain cooperatively binds to eG4s. Finally, eG4s are unfolded in the 3' to 5' direction in an ATP-dependent manner [52]. Both DHX36 and DHX9 of the DEAH-box helicase family have been reported to efficiently unfold eG4s [53,54,55]. DHX36 unfolds eG4s via the DEAH family helicase mechanism [56]. DHX36 first binds to the 3' end of the nucleic acid chain and unfolds eG4s through a process dependent on ATP shift to the 5' end. In addition, there are Pif1, FANCJ, DNA2, XPD, and ChlR1 helicases, which unfold DNA eG4s from 5' to 3' in an ATP-dependent manner by binding to the 5' tail of DNA [57,58,59,60,61]. In particular, the activity of ChlR1 unfolded antiparallel G4s was much higher than that of unfolded parallel G4s. Most of these enzymes that unfold DNA eG4s are involved in maintaining genomic stability during DNA replication. This reflects the elimination of one of the negative effects of eG4 folding in vivo.
In addition to these helicases, many proteins without helicase activity can also unfold eG4. Cellular nucleic acid-binding protein (CNBP) can bind to eG4s through the zinc finger domain, the CCHC domain, and the RG-rich domain. Recent studies indicate that CNBP can unfold DNA G4s in a variety of proto-oncogene promoters in vitro, and CNBP can regulate gene transcription in vivo by unfolding promoter eG4s [62]. Another transcription factor, MAZ, can promote the unfolding of HRAS promoter G4s and their conversion into a B-type double helix, and MAZ is thought to activate HRAS transcription by unfolding eG4s [63]. Some proteins can only unfold the eG4s with a specific topology. For example, the ssDNA-binding protein POT1 can only unfold the antiparallel eG4s [64].
Some G4-interacting proteins can induce folding and stabilization of eG4s after binding to eG4. The RBD1 and RBD2 domains of nucleolin have been reported to have a high affinity for the eG4 in the c-Myc gene promoter, and thus nucleolin can induce folding and stabilization of eG4, which may be the mechanism by which nucleolin is involved in down-regulating the transcription level of the c-Myc gene [65]. LARK can recognize and bind eG4s through the RRM1 and RRM2 domains, thereby inducing folding and stabilization of eG4s, and it can bind many DNA eG4s, such as the eG4s in the promoters of BmPOUM2, c-MYC, HIF-1a, and c-kit [66]. The multifunctional DNA repair enzyme APE1 can induce the folding of eG4s and stabilize eG4s, and its loss will not be conducive to the formation of eG4s in vivo [67].
The development of systematic methods for the identification of G4-interacting proteins has promoted the improvement of this field. The genome-wide shRNA screen based on G4 stable ligands contributes to the identification of G4-interacting proteins [68]. This method can systematically identify genes that interact with G4 directly (G4-interacting proteins) or indirectly (proteins involved in G4-dependent pathways). The RNA helicase DDX42 was identified as a G4-interacting protein by the above method and showed a high affinity for RNA G4 in vitro. The co-binding-mediated protein profiling (CMPP) strategy based on the G4 small-molecule photocrosslinking probe can efficiently capture G4-interacting proteins in living cells [69]. SMARCA4 was identified as a G4-interacting protein by CMPP. The high overlap between the genome-wide binding sites of SMARCA4 and DNA eG4s confirmed the results of CMPP. Since SMARCA4 inhibits the formation of the R-loop structure, which is highly co-localized with G4 in vivo [70], SMARCA4 may have the function of unfolding eG4s.
Currently, the regulation of eG4 dynamics by G4-interacting proteins is largely unknown. How G4-interacting proteins regulate the dynamics of specific eG4s at the right time to affect specific biological processes remains poorly understood. We speculated that different proteins might recognize and bind different types of eG4s, or that different G4s might have different mechanisms to recruit specific G4-interacting proteins. However, the above speculation needs to be verified by G4-interacting protein ChIP-seq and other new technologies. Some G4-interacting proteins found in humans are listed in Table 3.
In addition to proteins, many cellular factors also influence the dynamics of eG4s in vivo. The folding of eG4s requires the formation of Hoogsteen hydrogen bonds between guanine bases, which makes the folding of genomic DNA eG4s compete with the formation of other nucleic acid structures with Watson–Crick hydrogen bonds. In the processes of DNA replication, transcription, and damage repair, double-strand separation leads to the disruption of Watson–Crick hydrogen bonds, which is conducive to the folding of DNA eG4s. The folding of eG4s is highly likely to occur during DNA replication (Fig. 2A), because the replication mechanism causes the double-stranded DNA to unwind and form a single-stranded DNA, and the single-stranded DNA without the interference of the Watson–Crick hydrogen bond is easier to fold into eG4s [71]. During the cell cycle, it has been reported that eG4 folding reaches its highest level in the S phase with DNA replication [72]. BG4 immunofluorescence shows that in the synchronous cell population, the folding level of eG4 was the lowest in the G0/G1 phase without DNA replication. At the G1/S checkpoint, where DNA replication occurred, the number of eG4s increased 2.5-fold compared to the G0/G1 phase. In the S phase, the maximum number of eG4s was 4.8 times higher than in G0/G1. When aphidicolin was used to inhibit DNA replication, the number of eG4s was reduced by more than twofold, indicating that the folding of eG4s is associated with DNA replication [23]. In a similar study, the fluorescent probe SiR-PyPDS was used to detect the eG4s in the synchronous cell population, and the results showed that during the cell cycle, only 3 eG4s were observed in the G0/G1 phase, 103 eG4s in the G1/S phase, and 208 eG4s in the S phase, indicating that DNA replication in the S phase of the cell cycle promoted the folding of the eG4s in vivo [28]. During gene transcription, heterozygous double-stranded RNA–DNA is formed from newly synthesized RNA and template DNA, which may contribute to the folding of DNA eG4s on non-template chains (Fig. 2B) [73].
Nucleosome depletion may contribute to the folding of eG4s (Fig. 2C). Several studies on the detection and localization of DNA eG4s in chromatin have reported that eG4s are mainly formed in nucleosome-depleted regions and the regulatory regions of transcriptionally active genes [32, 33, 74]. With stem cell differentiation (CNCC, ESC, and NSC) and increased cell specificity, nucleosome-depleted regions disappeared, resulting in the disappearance of eG4s [75].
In addition, telomerase is involved in the unfolding of eG4s in the process of telomere extension. After the expression of telomerase TR subunit and TERT subunit was inhibited by the RNAi method, the accumulation of eG4s was observed in the telomere region, indicating that telomerase or telomerase-associated protein is involved in the unfolding of telomere eG4s in vivo [76].
Effects of eG4 dynamics
The dynamics of eG4s can regulate gene expression, influence telomere homeostasis, and cause genomic instability.
The dynamics of eG4s can influence gene transcription. Promoter regions are rich in G-rich sequences that form eG4s [77]. The promoter regions of several oncogenes such as Rb [78], RET [79], VEGF [80], c-Myc [81], and bcl-2 [82] could be folded to form G4s in vitro, which was supported by G4 ChIP-seq results of oncogenes such as PTEN, MYC, KRAS, and TSEN34 [33]. Chromatin DNA eG4s were reported to be enriched at the promoters of highly transcribed genes [33, 34]. Transcriptional downregulation of MYC, KRAS, and KIT genes was observed after cells were treated with G4 stable ligands [83,84,85]. These results further support the relationship between transcription and eG4s.
The eG4 dynamics can regulate gene transcription by altering the binding between the transcription factors and the promoters (Fig. 3A). Early analyses based on computational methods have shown that transcription factor-binding motifs are enriched in some promoter G4 motifs [86]. The transcription factor nucleolin can bind and stabilize G4 in the MYC promoter in vitro [65]. After their folding, eG4s recruit the transcription factor nucleolin to bind to the c-MYC promoter, thereby inhibiting gene expression [87]. NM23-H2 can bind and unfold the MYC promoter eG4s [88]. Based on these findings, it has been speculated that nucleolin and NM23-H2 may be involved in stabilizing and unfolding MYC promoter eG4s, respectively, to regulate MYC transcription. A number of transcription factors, including CNBP [89], SP1 [90], and LARK [66], interact with G4 in vitro. A recent study showed that after their folding, the eG4s in the BmPOUM2 gene promoter recruit the transcription factor LARK to activate the transcription of the BmPOUM2 gene in the silkworm [66]. The role of some eG4s in the coding region of the gene is similar to that of promoter eG4s. For example, the folding of eG4s in the first exon of the human hTERT gene prevents the binding of the CCCTC-binding factor to the hTERT gene, resulting in increased transcription of the hTERT gene [91].
The eG4s in genes regulate gene transcription in several ways. The folding of DNA eG4s on the gene template chain tends to inhibit the transcriptional elongation of the genes (Fig. 3B). For example, in human embryonic kidney cells and Escherichia coli, the folding of eG4s on the template chain hinders the elongation of RNA polymerase, thereby reducing the transcription level of the corresponding gene [92, 93]. Folding of DNA eG4s on the non-template chain can regulate the transcription level of the gene (Fig. 3C). On the one hand, the non-template chain eG4s hinder the renaturation between the template chain and the non-template chain, which helps to improve the stability of the R-loop structure formed by the template chain and the RNA. The stable R-loop structure causes the polymerase to stall, preventing normal RNA polymerase-mediated transcription. Insertion of the G-rich sequence into the non-template chain resulted in a decrease in T7 polymerase transcription levels, suggesting that eG4s may contribute to R-loop-mediated transcription inhibition [91]. On the other hand, the non-template chain eG4s may be beneficial for transcription restart because they can keep the template chain and non-template chain unwound. Human genes containing more G4 sequences on the non-template chain within 500 bp downstream of the transcription start sites showed higher transcription levels and RNA PolII occupancy rates [94]. Folding of intermolecular eG4s derived from the non-template chain and RNA leads to premature termination of transcription, thereby reducing the level of gene transcription (Fig. 3D). RNA and non-template DNA chains generated from the transcription of the mitochondrial gene CSBII fold into stable DNA-RNA heterozygous eG4s to promote transcription termination [95]. Insertion of the DNA-RNA heterozygous eG4-forming sequence into the reporter plasmid resulted in the inhibition of plasmid transcription [96].
In addition to the direct regulation methods mentioned above, eG4s can also regulate gene transcription through epigenetic modifications. Chromatin is mainly composed of DNA and histones. On the one hand, the dynamics of eG4s regulate DNA methylation (Fig. 3E). DNA methyltransferase catalyzes the formation of 5-methylcytosine at CpG dinucleotide sites. DNA methyltransferase prefers to bind DNA G4 over double-helix DNA in vitro [97]. When folded, DNA eG4s recruit DNA methyltransferase 1 (DNMT1) and inhibit its activity, which is confirmed by the high hypomethylation of the DNMT1 binding site of eG4s in human leukemic cells [98]. DNA methylation reduces the accessibility of DNA and thus inhibits gene expression, while eG4 folding avoids DNA methylation-induced transcriptional downregulation through the above mechanism. On the other hand, the dynamics of eG4s are involved in the regulation of nucleosome histone modification (Fig. 3F). After folding, eG4s in the hTERT gene promoter recruit the REST-LSD1 (RE1-silencing transcription factor and lysine-specific histone demethylase 1A) complex by binding to NME2, so that the histone H3 Lys4 near the promoter is demethylated, which makes the chromatin structure compact and eventually inhibits gene transcription [99]. The CDKN1A gene has a similar mechanism. After folding, eG4s in the CDKN1A promoter bind TRF2 to recruit the REST-LSD1 complex, resulting in histone demethylation, which ultimately leads to transcriptional downregulation [100].
Endogenous G4s are also found in telomeres [23, 101] located at the end of chromosomes, and the maintenance of telomere homeostasis depends on the folding of eG4s (Fig. 4A). After binding to eG4s in telomeres, the long non-coding RNA TERRA can be used as a platform for telomere-protein binding [102]. In addition, FUS protein binds to TERRA eG4s and telomere eG4s to recruit histone methyltransferases that maintain the state of telomere heterochromatin. The methyltransferase methylates the histone of telomere nucleosomes to compact the structure of telomere chromatin and maintain the state of heterochromatin. Meanwhile, telomere eG4s prevent DNA polymerases from replicating (Fig. 4B). Many proteins, such as the CTC1-STN1-TEN1 (CST) complex [103] and regulator of telomere elongation helicase 1 (RTEL1) [104], can unfold telomere eG4s, thereby preventing telomere eG4-induced reduction in telomere replication rate, telomere shortening, and abnormal telomere formation [105]. When the CST complex is depleted and eG4s is stabilized by PDS, telomeres are lost in vivo [103]. In addition, different topologies formed by eG4 folding can regulate telomere extension by telomerase (Fig. 4C), a reverse transcriptase composed of RNA and protein. In cancer cells and stem cells with high division and proliferation capacities, telomerase uses its own RNA component as a template to reverse transcribe and extend the 3' end of chromosomal telomere DNA to prevent DNA replication-induced telomere shortening. Folding of the antiparallel intramolecular telomere DNA eG4s restricts the binding of the telomere 3' end to telomerase and inhibits telomere extension [106]. The parallel intermolecular telomere DNA eG4s formed in the S phase of the cell cycle are the substrate and localization site of telomerase, which contributes to telomere extension [107].
In addition, the folding of DNA eG4s increases the likelihood of DNA mutations, leading to genomic instability. Caenorhabditis elegans strains with deletions in the DOG-1 gene (whose product is a homolog of the FANCJ helicase and can unfold DNA eG4s) accumulate gene mutations with base deletions in the G-rich regions that form eG4s [108]. This may be because the folded eG4s interfere with the progression of DNA replication forks, thus causing replication stalling and DNA double-strand breaks, which may lead to gene mutations in the process of DNA double-strand break repair [109, 110], which is supported by the following studies. A previous study found that deletion of the FANCJ helicase or the presence of G4 stable ligands leads to replication stalling at eG4s [111]. A recently developed DSB detection technique called i-BLESS has detected DNA double-strand breaks at eG4s [112]. Genetic analysis of Saccharomyces cerevisiae shows that the G4 helicase Pif1 can unfold eG4s and inhibit DNA damage and chromosome recombination caused by eG4s [113]. The FANCJ helicase and the ssDNA-binding protein RPA can jointly promote the unfolding of DNA eG4s to ensure DNA replication in the S phase of the cell cycle [58]. BLM, WRN, and FANCJ jointly mediate DNA eG4 unfolding, thereby reducing the negative effects of eG4 folding in vivo and maintaining genome stability to some extent [114].
Regulation of eG4 dynamics in therapeutic applications
Because eG4 dynamics are involved in many biological processes in cells as described above, they affect some obvious characteristics of organisms. As mentioned above, eG4s are enriched in proto-oncogene promoters and telomeres, and thus eG4s are associated with cancer.
First, the overexpression of proto-oncogenes will lead to abnormal cell proliferation and cause the transformation of normal cells into cancer cells. The G4-stabilizing ligands or eG4 stabilizers can inhibit the upregulation of proto-oncogenes at their promoters, and they have great potential for cancer treatment (Fig. 5). A previous study has shown that the tri-substituted naphthalene diimide G4-targeting ligand CM03 exhibits anticancer activity on the pancreatic ductal adenocarcinoma cell line and the patient-derived xenograft (PDX) model. RNA-seq analysis shows that CM03 down-regulates many PQS-rich genes associated with cell proliferation, metastasis, and drug resistance in cancer cells [115]. In addition, the tetra-substituted naphthalene diimide derivative MM41 has been reported to exhibit anticancer activity against the MIA PaCa-2 pancreatic cancer PDX model and to reduce tumor growth by 80% in tumor-bearing mice [116]. This may be due to the fact that MM41 can bind and stabilize the promoter eG4s of proto-oncogenes such as BCL-2 and k-RAS, resulting in approximately 40% and 30% downregulation of their transcription, respectively. The G4-stabilizing ligand PDS has also been reported to inhibit the transcriptional expression of several oncogenes [117].
Second, the expression of telomerase is activated in cancer cells, thus resulting in the repair of telomere DNA shortening caused by the continuous division of cancer cells. The G4 stabilizer can inhibit the unfolding of telomere eG4s, thereby inhibiting telomerase-mediated telomere DNA extension in cancer cells and ultimately leading to cancer cell death [118].
Finally, DNA double-strand breaks (DSBs) caused by eG4 dynamics have also been used to treat cancer (Fig. 5). The folding of eG4s in the genome leads to abnormal DNA replication, resulting in DSBs. Normally, DSBs are repaired by homologous recombination (HR) and non-homologous end joining (NHEJ). When DSBs are induced by the G4 stabilizer in cancer cells and cannot be repaired by the HR or NHEJ pathways, the cancer cells are effectively killed. For example, the G4 stabilizer CX-5461 can induce eG4 folding and stabilize eG4s in BRCA1/2-deficient cancer cells, thereby inducing replication-dependent DSBs and ultimately killing cancer cells [119]. BRCA1/2 is an important DNA damage repair protein involved in the HR pathway, and after DSBs, BRCA1/2-deficient cancer cells die because DSBs cannot be repaired by the HR pathway. The G4 stabilizer CX-5461 has been reported to have good anticancer effects, and it can inhibit the growth of the BRAC2-deficient colon cancer cell line HCT-116 and suppress triple-negative breast cancer in a PDX model with mutations in the BRAC1 and BRAC2 genes. Inhibition of the NHEJ pathway can also play a similar role. DNA-PK is an important protein in the NHEJ pathway and is involved in the repair of broken DNA ends. The anticancer effect of the combination of the G4-stabilizing ligand PDS and the DNA-PK inhibitor NU7441 on the human fibrosarcoma cell line HT1080 is increased by about 45% compared to the use of PDS alone [120].
Conclusion
As critical nucleic acid structures, eG4s endow DNA with some additional functions, such as participation in transcription regulation, DNA methylation, and histone modification. The dynamics of eG4s have two opposing effects on the intracellular chromatin DNA. On the one hand, the folding of eG4s is very important for telomere structure to maintain the independence and integrity of linear DNA. On the other hand, the unfolding of eG4s is a factor that contributes to the stability of genomic DNA. Since eG4 dynamics have a variety of biological functions in cells, the use of G4-stabilizing ligands or G4 stabilizers to regulate eG4 dynamics has been confirmed as an effective therapy for cancer. In recent years, eG4 research methods have continuously evolved from various extracellular biophysical methods to the application of cell imaging and high-throughput sequencing for genome-wide detection of eG4s. The above methods have been used to investigate the factors affecting the dynamics of eG4s, the critical roles of eG4 dynamics in cells, and the interacting proteins of eG4s. However, there is still much room for improvement in the resolution and sensitivity of existing methods for studying eG4 dynamics. In the future, as research technology continues to improve, the influence and molecular mechanism of eG4 dynamics in cells will be further explored.
Availability of data and materials
Not applicable.
References
Burge S, Parkinson GN, Hazel P, Todd AK, Neidle S. Quadruplex DNA: sequence, topology and structure. Nucleic Acids Res. 2006;34(19):5402–15. https://doi.org/10.1093/nar/gkl655.
Lombardi EP, Londoño-Vallejo A. A guide to computational methods for G-quadruplex prediction. Nucleic Acids Res. 2020;48(3):1603. https://doi.org/10.1093/nar/gkaa033.
Li Z, Qian SH, Wang F, Mohamed HI, Yang G, Chen Z-X, et al. G-quadruplexes in genomes of viruses infecting eukaryotes or prokaryotes are under different selection pressures from hosts. J Genet Genomics. 2022;49(1):20–9. https://doi.org/10.1016/j.jgg.2021.08.018.
Huppert JL, Balasubramanian S. Prevalence of quadruplexes in the human genome. Nucleic Acids Res. 2005;33(9):2908–16. https://doi.org/10.1093/nar/gki609.
Todd AK, Johnston M, Neidle S. Highly prevalent putative quadruplex sequence motifs in human DNA. Nucleic Acids Res. 2005;33(9):2901–7. https://doi.org/10.1093/nar/gki553.
Kikin O, D’Antonio L, Bagga PS. QGRS Mapper: a web-based server for predicting G-quadruplexes in nucleotide sequences. Nucleic Acids Res. 2006;34:W676–82. https://doi.org/10.1093/nar/gkl253.
Hon J, Martínek T, Zendulka J, Lexa M. pqsfinder: an exhaustive and imperfection-tolerant search tool for potential quadruplex-forming sequences in R. Bioinformatics. 2017;33(21):3373–9. https://doi.org/10.1093/bioinformatics/btx413.
Eddy J, Maizels N. Gene function correlates with potential for G4 DNA formation in the human genome. Nucleic Acids Res. 2006;34(14):3887–96. https://doi.org/10.1093/nar/gkl529.
Bedrat A, Lacroix L, Mergny JL. Re-evaluation of G-quadruplex propensity with G4Hunter. Nucleic Acids Res. 2016;44(4):1746–59. https://doi.org/10.1093/nar/gkw006.
Sahakyan AB, Chambers VS, Marsico G, Santner T, Di Antonio M, Balasubramanian S. Machine learning model for sequence-driven DNA G-quadruplex formation. Sci Rep. 2017;7(1):14535. https://doi.org/10.1038/s41598-017-14017-4.
Paramasivan S, Rujan I, Bolton PH. Circular dichroism of quadruplex DNAs: applications to structure, cation effects and ligand binding. Methods. 2007;43(4):324–31. https://doi.org/10.1016/j.ymeth.2007.02.009.
Webba da Silva M. NMR methods for studying quadruplex nucleic acids. Methods. 2007;43(4):264–77. https://doi.org/10.1016/j.ymeth.2007.05.007.
Lim KW, Lacroix L, Yue DJ, Lim JK, Lim JM, Phan AT. Coexistence of two distinct G-quadruplex conformations in the hTERT promoter. J Am Chem Soc. 2010;132(35):12331–42. https://doi.org/10.1021/ja101252n.
Bessi I, Jonker HR, Richter C, Schwalbe H. Involvement of long-lived intermediate states in the complex folding pathway of the human telomeric G-quadruplex. Angew Chem Int Ed Engl. 2015;54(29):8444–8. https://doi.org/10.1002/anie.201502286.
Han H, Hurley LH, Salazar M. A DNA polymerase stop assay for G-quadruplex-interactive compounds. Nucleic Acids Res. 1999;27(2):537–42. https://doi.org/10.1093/nar/27.2.537.
Williamson JR, Raghuraman MK, Cech TR. Monovalent cation-induced structure of telomeric DNA: the G-quartet model. Cell. 1989;59(5):871–80. https://doi.org/10.1016/0092-8674(89)90610-7.
Chambers VS, Marsico G, Boutell JM, Di Antonio M, Smith GP, Balasubramanian S. High-throughput sequencing of DNA G-quadruplex structures in the human genome. Nat Biotechnol. 2015;33(8):877–81. https://doi.org/10.1038/nbt.3295.
Schaffitzel C, Berger I, Postberg J, Hanes J, Lipps HJ, Plückthun A. In vitro generated antibodies specific for telomeric guanine-quadruplex DNA react with Stylonychia lemnae macronuclei. Proc Natl Acad Sci U S A. 2001;98(15):8572–7. https://doi.org/10.1073/pnas.141229498.
Fernando H, Rodriguez R, Balasubramanian S. Selective recognition of a DNA G-quadruplex by an engineered antibody. Biochemistry. 2008;47(36):9365–71. https://doi.org/10.1021/bi800983u.
Isalan M, Patel SD, Balasubramanian S, Choo Y. Selection of zinc fingers that bind single-stranded telomeric DNA in the G-quadruplex conformation. Biochemistry. 2001;40(3):830–6. https://doi.org/10.1021/bi001728v.
Scholz O, Hansen S, Plückthun A. G-quadruplexes are specifically recognized and distinguished by selected designed ankyrin repeat proteins. Nucleic Acids Res. 2014;42(14):9182–94. https://doi.org/10.1093/nar/gku571.
Shen J, Varshney D, Simeone A, Zhang X, Adhikari S, Tannahill D, et al. Promoter G-quadruplex folding precedes transcription and is controlled by chromatin. Genome Biol. 2021;22(1):143. https://doi.org/10.1186/s13059-021-02346-7.
Biffi G, Tannahill D, McCafferty J, Balasubramanian S. Quantitative visualization of DNA G-quadruplex structures in human cells. Nat Chem. 2013;5(3):182–6. https://doi.org/10.1038/nchem.1548.
Henderson A, Wu Y, Huang YC, Chavez EA, Platt J, Johnson FB, et al. Detection of G-quadruplex DNA in mammalian cells. Nucleic Acids Res. 2014;42(2):860–9. https://doi.org/10.1093/nar/gkt957.
Kazemier HG, Paeschke K, Lansdorp PM. Guanine quadruplex monoclonal antibody 1H6 cross-reacts with restrained thymidine-rich single stranded DNA. Nucleic Acids Res. 2017;45(10):5913–9. https://doi.org/10.1093/nar/gkx245.
Tseng TY, Wang ZF, Chien CH, Chang TC. In-cell optical imaging of exogenous G-quadruplex DNA by fluorogenic ligands. Nucleic Acids Res. 2013;41(22):10605–18. https://doi.org/10.1093/nar/gkt814.
Chen Y, Yan S, Yuan L, Zhou Y, Song Y, Xiao H, et al. Nonlinear optical dye TSQ1 as an efficiently selective fluorescent probe for G-quadruplex DNA. Org Chem Front. 2014;1(3):267–70. https://doi.org/10.1039/C3QO00048F.
Di Antonio M, Ponjavic A, Radzevičius A, Ranasinghe RT, Catalano M, Zhang X, et al. Single-molecule visualization of DNA G-quadruplex formation in live cells. Nat Chem. 2020;12(9):832–7. https://doi.org/10.1038/s41557-020-0506-4.
Cogoi S, Shchekotikhin AE, Membrino A, Sinkevich YB, Xodo LE. Guanidino anthrathiophenediones as G-quadruplex binders: uptake, intracellular localization, and anti-Harvey-Ras gene activity in bladder cancer cells. J Med Chem. 2013;56(7):2764–78. https://doi.org/10.1021/jm3019063.
Laguerre A, Wong JM, Monchaud D. Direct visualization of both DNA and RNA quadruplexes in human cells via an uncommon spectroscopic method. Sci Rep. 2016;6:32141. https://doi.org/10.1038/srep32141.
Shivalingam A, Izquierdo MA, Marois AL, Vyšniauskas A, Suhling K, Kuimova MK, et al. The interactions between a small molecule and G-quadruplexes are visualized by fluorescence lifetime imaging microscopy. Nat Commun. 2015;6:8178. https://doi.org/10.1038/ncomms9178.
Hänsel-Hertsch R, Spiegel J, Marsico G, Tannahill D, Balasubramanian S. Genome-wide mapping of endogenous G-quadruplex DNA structures by chromatin immunoprecipitation and high-throughput sequencing. Nat Protoc. 2018;13(3):551–64. https://doi.org/10.1038/nprot.2017.150.
Hänsel-Hertsch R, Beraldi D, Lensing SV, Marsico G, Zyner K, Parry A, et al. G-quadruplex structures mark human regulatory chromatin. Nat Genet. 2016;48(10):1267–72. https://doi.org/10.1038/ng.3662.
Li C, Wang H, Yin Z, Fang P, Xiao R, Xiang Y, et al. Ligand-induced native G-quadruplex stabilization impairs transcription initiation. Genome Res. 2021;31(9):1546–60. https://doi.org/10.1101/gr.275431.121.
Kaya-Okur HS, Wu SJ, Codomo CA, Pledger ES, Bryson TD, Henikoff JG, et al. CUT&Tag for efficient epigenomic profiling of small samples and single cells. Nat Commun. 2019;10(1):1930. https://doi.org/10.1038/s41467-019-09982-5.
Lyu J, Shao R, Kwong Yung PY, Elsässer SJ. Genome-wide mapping of G-quadruplex structures with CUT&Tag. Nucleic Acids Res. 2022;50(3):e13. https://doi.org/10.1093/nar/gkab1073.
Li C, Yin Z, Xiao R, Huang B, Cui Y, Wang H, et al. G-quadruplexes sense natural porphyrin metabolites for regulation of gene transcription and chromatin landscapes. Genome Biol. 2022;23(1):259. https://doi.org/10.1186/s13059-022-02830-8.
Amrane S, Ang RW, Tan ZM, Li C, Lim JK, Lim JM, et al. A novel chair-type G-quadruplex formed by a Bombyx mori telomeric sequence. Nucleic Acids Res. 2009;37(3):931–8. https://doi.org/10.1093/nar/gkn990.
Smargiasso N, Rosu F, Hsia W, Colson P, Baker ES, Bowers MT, et al. G-quadruplex DNA assemblies: loop length, cation identity, and multimer formation. J Am Chem Soc. 2008;130(31):10208–16. https://doi.org/10.1021/ja801535e.
Phan AT, Modi YS, Patel DJ. Propeller-type parallel-stranded G-quadruplexes in the human c-myc promoter. J Am Chem Soc. 2004;126(28):8710–6. https://doi.org/10.1021/ja048805k.
Bugaut A, Balasubramanian S. A sequence-independent study of the influence of short loop lengths on the stability and topology of intramolecular DNA G-quadruplexes. Biochemistry. 2008;47(2):689–97. https://doi.org/10.1021/bi701873c.
Lee MP, Parkinson GN, Hazel P, Neidle S. Observation of the coexistence of sodium and calcium ions in a DNA G-quadruplex ion channel. J Am Chem Soc. 2007;129(33):10106–7. https://doi.org/10.1021/ja0740869.
Liu W, Zhu H, Zheng B, Cheng S, Fu Y, Li W, et al. Kinetics and mechanism of G-quadruplex formation and conformational switch in a G-quadruplex of PS2.M induced by Pb2+. Nucleic Acids Res. 2012;40(9):4229–36. https://doi.org/10.1093/nar/gkr1310.
Pedroso IM, Duarte LF, Yanez G, Baker AM, Fletcher TM. Induction of parallel human telomeric G-quadruplex structures by Sr(2+). Biochem Biophys Res Commun. 2007;358(1):298–303. https://doi.org/10.1016/j.bbrc.2007.04.126.
Deng H, Braunlin WH. Kinetics of sodium ion binding to DNA quadruplexes. J Mol Biol. 1996;255(3):476–83. https://doi.org/10.1006/jmbi.1996.0039.
Haider S, Parkinson GN, Neidle S. Crystal structure of the potassium form of an Oxytricha nova G-quadruplex. J Mol Biol. 2002;320(2):189–200. https://doi.org/10.1016/s0022-2836(02)00428-x.
Zoroddu MA, Aaseth J, Crisponi G, Medici S, Peana M, Nurchi VM. The essential metals for humans: a brief overview. J Inorg Biochem. 2019;195:120–9. https://doi.org/10.1016/j.jinorgbio.2019.03.013.
Campbell NH, Neidle S. G-quadruplexes and metal ions. Met Ions Life Sci. 2012;10:119–34. https://doi.org/10.1007/978-94-007-2172-2_4.
Hardin CC, Perry AG, White K. Thermodynamic and kinetic characterization of the dissociation and assembly of quadruplex nucleic acids. Biopolymers. 2000;56(3):147–94. https://doi.org/10.1002/1097-0282(2000/2001)56:3%3c147::Aid-bip10011%3e3.0.Co;2-n.
Fry M, Loeb LA. Human werner syndrome DNA helicase unwinds tetrahelical structures of the fragile X syndrome repeat sequence d(CGG)n. J Biol Chem. 1999;274(18):12797–802. https://doi.org/10.1074/jbc.274.18.12797.
Mohaghegh P, Karow JK, Brosh RM Jr, Bohr VA, Hickson ID. The Bloom’s and Werner’s syndrome proteins are DNA structure-specific helicases. Nucleic Acids Res. 2001;29(13):2843–9. https://doi.org/10.1093/nar/29.13.2843.
Chatterjee S, Zagelbaum J, Savitsky P, Sturzenegger A, Huttner D, Janscak P, et al. Mechanistic insight into the interaction of BLM helicase with intra-strand G-quadruplex structures. Nat Commun. 2014;5:5556. https://doi.org/10.1038/ncomms6556.
Creacy SD, Routh ED, Iwamoto F, Nagamine Y, Akman SA, Vaughn JP. G4 resolvase 1 binds both DNA and RNA tetramolecular quadruplex with high affinity and is the major source of tetramolecular quadruplex G4-DNA and G4-RNA resolving activity in HeLa cell lysates. J Biol Chem. 2008;283(50):34626–34. https://doi.org/10.1074/jbc.M806277200.
Chakraborty P, Grosse F. Human DHX9 helicase preferentially unwinds RNA-containing displacement loops (R-loops) and G-quadruplexes. DNA Repair (Amst). 2011;10(6):654–65. https://doi.org/10.1016/j.dnarep.2011.04.013.
Chen MC, Tippana R, Demeshkina NA, Murat P, Balasubramanian S, Myong S, et al. Structural basis of G-quadruplex unfolding by the DEAH/RHA helicase DHX36. Nature. 2018;558(7710):465–9. https://doi.org/10.1038/s41586-018-0209-9.
Yangyuoru PM, Bradburn DA, Liu Z, Xiao TS, Russell R. The G-quadruplex (G4) resolvase DHX36 efficiently and specifically disrupts DNA G4s via a translocation-based helicase mechanism. J Biol Chem. 2018;293(6):1924–32. https://doi.org/10.1074/jbc.M117.815076.
Byrd AK, Raney KD. Structure and function of Pif1 helicase. Biochem Soc Trans. 2017;45(5):1159–71. https://doi.org/10.1042/bst20170096.
Wu Y, Shin-ya K, Brosh RM Jr. FANCJ helicase defective in Fanconia anemia and breast cancer unwinds G-quadruplex DNA to defend genomic stability. Mol Cell Biol. 2008;28(12):4116–28. https://doi.org/10.1128/mcb.02210-07.
Lin W, Sampathi S, Dai H, Liu C, Zhou M, Hu J, et al. Mammalian DNA2 helicase/nuclease cleaves G-quadruplex DNA and is required for telomere integrity. Embo j. 2013;32(10):1425–39. https://doi.org/10.1038/emboj.2013.88.
Wu Y, Sommers JA, Khan I, de Winter JP, Brosh RM Jr. Biochemical characterization of Warsaw breakage syndrome helicase. J Biol Chem. 2012;287(2):1007–21. https://doi.org/10.1074/jbc.M111.276022.
Gray LT, Vallur AC, Eddy J, Maizels N. G quadruplexes are genomewide targets of transcriptional helicases XPB and XPD. Nat Chem Biol. 2014;10(4):313–8. https://doi.org/10.1038/nchembio.1475.
David AP, Pipier A, Pascutti F, Binolfi A, Weiner AMJ, Challier E, et al. CNBP controls transcription by unfolding DNA G-quadruplex structures. Nucleic Acids Res. 2019;47(15):7901–13. https://doi.org/10.1093/nar/gkz527.
Cogoi S, Shchekotikhin AE, Xodo LE. HRAS is silenced by two neighboring G-quadruplexes and activated by MAZ, a zinc-finger transcription factor with DNA unfolding property. Nucleic Acids Res. 2014;42(13):8379–88. https://doi.org/10.1093/nar/gku574.
Ray S, Bandaria JN, Qureshi MH, Yildiz A, Balci H. G-quadruplex formation in telomeres enhances POT1/TPP1 protection against RPA binding. Proc Natl Acad Sci U S A. 2014;111(8):2990–5. https://doi.org/10.1073/pnas.1321436111.
González V, Guo K, Hurley L, Sun D. Identification and characterization of nucleolin as a c-myc G-quadruplex-binding protein. J Biol Chem. 2009;284(35):23622–35. https://doi.org/10.1074/jbc.M109.018028.
Niu K, Xiang L, Jin Y, Peng Y, Wu F, Tang W, et al. Identification of LARK as a novel and conserved G-quadruplex binding protein in invertebrates and vertebrates. Nucleic Acids Res. 2019;47(14):7306–20. https://doi.org/10.1093/nar/gkz484.
Pramanik S, Chen Y, Song H, Khutsishvili I, Marky LA, Ray S, et al. The human AP-endonuclease 1 (APE1) is a DNA G-quadruplex structure binding protein and regulates KRAS expression in pancreatic ductal adenocarcinoma cells. Nucleic Acids Res. 2022;50(6):3394–412. https://doi.org/10.1093/nar/gkac172.
Zyner KG, Mulhearn DS, Adhikari S, Martínez Cuesta S, Di Antonio M, Erard N, et al. Genetic interactions of G-quadruplexes in humans. eLife. 2019;8:e46793. https://doi.org/10.7554/eLife.46793.
Zhang X, Spiegel J, Martínez Cuesta S, Adhikari S, Balasubramanian S. Chemical profiling of DNA G-quadruplex-interacting proteins in live cells. Nat Chem. 2021;13(7):626–33. https://doi.org/10.1038/s41557-021-00736-9.
Bayona-Feliu A, Barroso S, Muñoz S, Aguilera A. The SWI/SNF chromatin remodeling complex helps resolve R-loop-mediated transcription–replication conflicts. Nat Genet. 2021;53(7):1050–63. https://doi.org/10.1038/s41588-021-00867-2.
Sarkies P, Reams C, Simpson LJ, Sale JE. Epigenetic instability due to defective replication of structured DNA. Mol Cell. 2010;40(5):703–13. https://doi.org/10.1016/j.molcel.2010.11.009.
Cui Y, Kong D, Ghimire C, Xu C, Mao H. Mutually exclusive formation of G-quadruplex and i-motif is a general phenomenon governed by steric hindrance in duplex DNA. Biochemistry. 2016;55(15):2291–9. https://doi.org/10.1021/acs.biochem.6b00016.
Duquette ML, Handa P, Vincent JA, Taylor AF, Maizels N. Intracellular transcription of G-rich DNAs induces formation of G-loops, novel structures containing G4 DNA. Genes Dev. 2004;18(13):1618–29. https://doi.org/10.1101/gad.1200804.
Kouzine F, Wojtowicz D, Baranello L, Yamane A, Nelson S, Resch W, et al. Permanganate/S1 Nuclease footprinting reveals non-B DNA structures with regulatory potential across a mammalian genome. Cell Syst. 2017;4(3):344-56.e7. https://doi.org/10.1016/j.cels.2017.01.013.
Zyner KG, Simeone A, Flynn SM, Doyle C, Marsico G, Adhikari S, et al. G-quadruplex DNA structures in human stem cells and differentiation. Nat Commun. 2022;13(1):142. https://doi.org/10.1038/s41467-021-27719-1.
Paeschke K, Juranek S, Simonsson T, Hempel A, Rhodes D, Lipps HJ. Telomerase recruitment by the telomere end binding protein-beta facilitates G-quadruplex DNA unfolding in ciliates. Nat Struct Mol Biol. 2008;15(6):598–604. https://doi.org/10.1038/nsmb.1422.
Huppert JL, Balasubramanian S. G-quadruplexes in promoters throughout the human genome. Nucleic Acids Res. 2007;35(2):406–13. https://doi.org/10.1093/nar/gkl1057.
Xu Y, Sugiyama H. Formation of the G-quadruplex and i-motif structures in retinoblastoma susceptibility genes (Rb). Nucleic Acids Res. 2006;34(3):949–54. https://doi.org/10.1093/nar/gkj485.
Guo K, Pourpak A, Beetz-Rogers K, Gokhale V, Sun D, Hurley LH. Formation of pseudosymmetrical G-quadruplex and i-motif structures in the proximal promoter region of the RET oncogene. J Am Chem Soc. 2007;129(33):10220–8. https://doi.org/10.1021/ja072185g.
Guo K, Gokhale V, Hurley LH, Sun D. Intramolecularly folded G-quadruplex and i-motif structures in the proximal promoter of the vascular endothelial growth factor gene. Nucleic Acids Res. 2008;36(14):4598–608. https://doi.org/10.1093/nar/gkn380.
Sun D, Hurley LH. The importance of negative superhelicity in inducing the formation of G-quadruplex and i-motif structures in the c-Myc promoter: implications for drug targeting and control of gene expression. J Med Chem. 2009;52(9):2863–74. https://doi.org/10.1021/jm900055s.
Kendrick S, Akiyama Y, Hecht SM, Hurley LH. The i-motif in the bcl-2 P1 promoter forms an unexpectedly stable structure with a unique 8:5:7 loop folding pattern. J Am Chem Soc. 2009;131(48):17667–76. https://doi.org/10.1021/ja9076292.
Siddiqui-Jain A, Grand CL, Bearss DJ, Hurley LH. Direct evidence for a G-quadruplex in a promoter region and its targeting with a small molecule to repress c-MYC transcription. Proc Natl Acad Sci U S A. 2002;99(18):11593–8. https://doi.org/10.1073/pnas.182256799.
Cogoi S, Xodo LE. G-quadruplex formation within the promoter of the KRAS proto-oncogene and its effect on transcription. Nucleic Acids Res. 2006;34(9):2536–49. https://doi.org/10.1093/nar/gkl286.
Bejugam M, Sewitz S, Shirude PS, Rodriguez R, Shahid R, Balasubramanian S. Trisubstituted isoalloxazines as a new class of G-quadruplex binding ligands: small molecule regulation of c-kit oncogene expression. J Am Chem Soc. 2007;129(43):12926–7. https://doi.org/10.1021/ja075881p.
Kumar P, Yadav VK, Baral A, Kumar P, Saha D, Chowdhury S. Zinc-finger transcription factors are associated with guanine quadruplex motifs in human, chimpanzee, mouse and rat promoters genome-wide. Nucleic Acids Res. 2011;39(18):8005–16. https://doi.org/10.1093/nar/gkr536.
Wu Y, Brosh RM Jr. G-quadruplex nucleic acids and human disease. Febs j. 2010;277(17):3470–88. https://doi.org/10.1111/j.1742-4658.2010.07760.x.
Thakur RK, Kumar P, Halder K, Verma A, Kar A, Parent JL, et al. Metastases suppressor NM23-H2 interaction with G-quadruplex DNA within c-MYC promoter nuclease hypersensitive element induces c-MYC expression. Nucleic Acids Res. 2009;37(1):172–83. https://doi.org/10.1093/nar/gkn919.
Borgognone M, Armas P, Calcaterra NB. Cellular nucleic-acid-binding protein, a transcriptional enhancer of c-Myc, promotes the formation of parallel G-quadruplexes. Biochem J. 2010;428(3):491–8. https://doi.org/10.1042/bj20100038.
Raiber EA, Kranaster R, Lam E, Nikan M, Balasubramanian S. A non-canonical DNA structure is a binding motif for the transcription factor SP1 in vitro. Nucleic Acids Res. 2012;40(4):1499–508. https://doi.org/10.1093/nar/gkr882.
Belotserkovskii BP, Soo Shin JH, Hanawalt PC. Strong transcription blockage mediated by R-loop formation within a G-rich homopurine-homopyrimidine sequence localized in the vicinity of the promoter. Nucleic Acids Res. 2017;45(11):6589–99. https://doi.org/10.1093/nar/gkx403.
Agarwal T, Roy S, Kumar S, Chakraborty TK, Maiti S. In the sense of transcription regulation by G-quadruplexes: asymmetric effects in sense and antisense strands. Biochemistry. 2014;53(23):3711–8. https://doi.org/10.1021/bi401451q.
Holder IT, Hartig JS. A matter of location: influence of G-quadruplexes on Escherichia coli gene expression. Chem Biol. 2014;21(11):1511–21. https://doi.org/10.1016/j.chembiol.2014.09.014.
Du Z, Zhao Y, Li N. Genome-wide analysis reveals regulatory role of G4 DNA in gene transcription. Genome Res. 2008;18(2):233–41. https://doi.org/10.1101/gr.6905408.
Wanrooij PH, Uhler JP, Shi Y, Westerlund F, Falkenberg M, Gustafsson CM. A hybrid G-quadruplex structure formed between RNA and DNA explains the extraordinary stability of the mitochondrial R-loop. Nucleic Acids Res. 2012;40(20):10334–44. https://doi.org/10.1093/nar/gks802.
Zheng KW, Xiao S, Liu JQ, Zhang JY, Hao YH, Tan Z. Co-transcriptional formation of DNA:RNA hybrid G-quadruplex and potential function as constitutional cis element for transcription control. Nucleic Acids Res. 2013;41(10):5533–41. https://doi.org/10.1093/nar/gkt264.
Cree SL, Fredericks R, Miller A, Pearce FG, Filichev V, Fee C, et al. DNA G-quadruplexes show strong interaction with DNA methyltransferases in vitro. FEBS Lett. 2016;590(17):2870–83. https://doi.org/10.1002/1873-3468.12331.
Mao S-Q, Ghanbarian AT, Spiegel J, Martínez Cuesta S, Beraldi D, Di Antonio M, et al. DNA G-quadruplex structures mold the DNA methylome. Nat Struct Mol Biol. 2018;25(10):951–7. https://doi.org/10.1038/s41594-018-0131-8.
Saha D, Singh A, Hussain T, Srivastava V, Sengupta S, Kar A, et al. Epigenetic suppression of human telomerase (hTERT) is mediated by the metastasis suppressor NME2 in a G-quadruplex-dependent fashion. J Biol Chem. 2017;292(37):15205–15. https://doi.org/10.1074/jbc.M117.792077.
Hussain T, Saha D, Purohit G, Kar A, Kishore Mukherjee A, Sharma S, et al. Transcription regulation of CDKN1A (p21/CIP1/WAF1) by TRF2 is epigenetically controlled through the REST repressor complex. Sci Rep. 2017;7(1):11541. https://doi.org/10.1038/s41598-017-11177-1.
Lim KW, Amrane S, Bouaziz S, Xu W, Mu Y, Patel DJ, et al. Structure of the human telomere in K+ solution: a stable basket-type G-quadruplex with only two G-tetrad layers. J Am Chem Soc. 2009;131(12):4301–9. https://doi.org/10.1021/ja807503g.
Takahama K, Takada A, Tada S, Shimizu M, Sayama K, Kurokawa R, et al. Regulation of telomere length by G-quadruplex telomere DNA- and TERRA-binding protein TLS/FUS. Chem Biol. 2013;20(3):341–50. https://doi.org/10.1016/j.chembiol.2013.02.013.
Zhang M, Wang B, Li T, Liu R, Xiao Y, Geng X, et al. Mammalian CST averts replication failure by preventing G-quadruplex accumulation. Nucleic Acids Res. 2019;47(10):5243–59. https://doi.org/10.1093/nar/gkz264.
Vannier JB, Pavicic-Kaltenbrunner V, Petalcorin MI, Ding H, Boulton SJ. RTEL1 dismantles T loops and counteracts telomeric G4-DNA to maintain telomere integrity. Cell. 2012;149(4):795–806. https://doi.org/10.1016/j.cell.2012.03.030.
Sfeir A, Kosiyatrakul ST, Hockemeyer D, MacRae SL, Karlseder J, Schildkraut CL, et al. Mammalian telomeres resemble fragile sites and require TRF1 for efficient replication. Cell. 2009;138(1):90–103. https://doi.org/10.1016/j.cell.2009.06.021.
Zahler AM, Williamson JR, Cech TR, Prescott DM. Inhibition of telomerase by G-quartet DNA structures. Nature. 1991;350(6320):718–20. https://doi.org/10.1038/350718a0.
Moye AL, Porter KC, Cohen SB, Phan T, Zyner KG, Sasaki N, et al. Telomeric G-quadruplexes are a substrate and site of localization for human telomerase. Nat Commun. 2015;6:7643. https://doi.org/10.1038/ncomms8643.
Kruisselbrink E, Guryev V, Brouwer K, Pontier DB, Cuppen E, Tijsterman M. Mutagenic capacity of endogenous G4 DNA underlies genome instability in FANCJ-defective C. elegans. Curr Biol. 2008;18(12):900–5. https://doi.org/10.1016/j.cub.2008.05.013.
Puget N, Miller KM, Legube G. Non-canonical DNA/RNA structures during transcription-coupled double-strand break repair: roadblocks or bona fide repair intermediates. DNA Repair (Amst). 2019;81:102661. https://doi.org/10.1016/j.dnarep.2019.102661.
Técher H, Koundrioukoff S, Nicolas A, Debatisse M. The impact of replication stress on replication dynamics and DNA damage in vertebrate cells. Nat Rev Genet. 2017;18(9):535–50. https://doi.org/10.1038/nrg.2017.46.
Castillo Bosch P, Segura-Bayona S, Koole W, van Heteren JT, Dewar JM, Tijsterman M, et al. FANCJ promotes DNA synthesis through G-quadruplex structures. Embo j. 2014;33(21):2521–33. https://doi.org/10.15252/embj.201488663.
Biernacka A, Zhu Y, Skrzypczak M, Forey R, Pardo B, Grzelak M, et al. i-BLESS is an ultra-sensitive method for detection of DNA double-strand breaks. Commun Biol. 2018;1:181. https://doi.org/10.1038/s42003-018-0165-9.
Paeschke K, Bochman ML, Garcia PD, Cejka P, Friedman KL, Kowalczykowski SC, et al. Pif1 family helicases suppress genome instability at G-quadruplex motifs. Nature. 2013;497(7450):458–62. https://doi.org/10.1038/nature12149.
Lansdorp P, van Wietmarschen N. Helicases FANCJ, RTEL1 and BLM Act on guanine quadruplex DNA in vivo. Genes (Basel). 2019. https://doi.org/10.3390/genes10110870.
Marchetti C, Zyner KG, Ohnmacht SA, Robson M, Haider SM, Morton JP, et al. Targeting multiple effector pathways in pancreatic ductal adenocarcinoma with a G-quadruplex-binding small molecule. J Med Chem. 2018;61(6):2500–17. https://doi.org/10.1021/acs.jmedchem.7b01781.
Ohnmacht SA, Marchetti C, Gunaratnam M, Besser RJ, Haider SM, Di Vita G, et al. A G-quadruplex-binding compound showing anti-tumour activity in an in vivo model for pancreatic cancer. Sci Rep. 2015;5:11385. https://doi.org/10.1038/srep11385.
Rodriguez R, Miller KM, Forment JV, Bradshaw CR, Nikan M, Britton S, et al. Small-molecule-induced DNA damage identifies alternative DNA structures in human genes. Nat Chem Biol. 2012;8(3):301–10. https://doi.org/10.1038/nchembio.780.
Neidle S, Parkinson G. Telomere maintenance as a target for anticancer drug discovery. Nat Rev Drug Discov. 2002;1(5):383–93. https://doi.org/10.1038/nrd793.
Xu H, Di Antonio M, McKinney S, Mathew V, Ho B, O’Neil NJ, et al. CX-5461 is a DNA G-quadruplex stabilizer with selective lethality in BRCA1/2 deficient tumours. Nat Commun. 2017;8:14432. https://doi.org/10.1038/ncomms14432.
McLuckie KI, Di Antonio M, Zecchini H, Xian J, Caldas C, Krippendorff BF, et al. G-quadruplex DNA as a molecular target for induced synthetic lethality in cancer cells. J Am Chem Soc. 2013;135(26):9640–3. https://doi.org/10.1021/ja404868t.
Acknowledgements
We thank all members of Chen lab for helpful discussions. We also thank Dr. Conghui Li from Wuhan University for helpful comments.
Funding
This work was supported by the The science and technology major program of Hubei Province (2021ABA011); the Foundation of Hubei Hongshan Laboratory (2021hszd012, 2022hszd024, 2022hszd028); and HZAU-AGIS Cooperation Fund (SZYJY2021010). Funding for open access charge: Foundation of Hubei Hongshan Laboratory.
Author information
Authors and Affiliations
Contributions
ZC conceived and supervised the study and provided resources; ZZ and SQ wrote the manuscript, made the tables and created the figures; ZC and DW made manuscript revisions.
Corresponding author
Ethics declarations
Ethics approval and consent to participate
Not applicable.
Consent for publication
Not applicable.
Competing interests
The authors declare no competing interest.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
About this article
Cite this article
Zhang, ZH., Qian, S.H., Wei, D. et al. In vivo dynamics and regulation of DNA G-quadruplex structures in mammals. Cell Biosci 13, 117 (2023). https://doi.org/10.1186/s13578-023-01074-8
Received:
Accepted:
Published:
DOI: https://doi.org/10.1186/s13578-023-01074-8