A subscription to JoVE is required to view this content. Sign in or start your free trial.
Method Article
The identification of physical interactions between genes and regulatory elements is challenging but has been facilitated by chromosome conformation capture methods. This modification to the 4C-seq protocol mitigates PCR bias by minimizing over-amplification of PCR templates and maximizes the mappability of reads by incorporating an addition restriction enzyme digest step.
The identification of regulatory elements for a given target gene poses a significant technical challenge owing to the variability in the positioning and effect sizes of regulatory elements to a target gene. Some progress has been made with the bioinformatic prediction of the existence and function of proximal epigenetic modifications associated with activated gene expression using conserved transcription factor binding sites. Chromatin conformation capture studies have revolutionized our ability to discover physical chromatin contacts between sequences and even within an entire genome. Circular chromatin conformation capture coupled with next-generation sequencing (4C-seq), in particular, is designed to discover all possible physical chromatin interactions for a given sequence of interest (viewpoint), such as a target gene or a regulatory enhancer. Current 4C-seq strategies directly sequence from within the viewpoint but require numerous and diverse viewpoints to be simultaneously sequenced to avoid the technical challenges of uniform base calling (imaging) with next generation sequencing platforms. This volume of experiments may not be practical for many laboratories. Here, we report a modified approach to the 4C-seq protocol that incorporates both an additional restriction enzyme digest and qPCR-based amplification steps that are designed to facilitate a greater capture of diverse sequence reads and mitigate the potential for PCR bias, respectively. Our modified 4C method is amenable to the standard molecular biology lab for assessing chromatin architecture.
The identification of regulatory elements for gene expression has been facilitated by the Encyclopedia of DNA Elements (ENCODE) Project that comprehensively annotated functional activity for 80% of the human genome1,2. The identification of the sites for in vivo transcription factor binding, DNaseI hypersensitivity, and epigenetic histone and DNA methylation modifications in individual cell types paved the way for the functional analyses of candidate regulatory elements for target gene expression. Armed with these findings, we are faced with the challenge of determining the functional interconnectivity between regulatory elements and genes. Specifically, what is the relationship between a given target gene and its enhancer(s)? The chromatin conformation capture (3C) method directly addresses this question by identifying physical, and likely functional, interactions between a region of interest and candidate interacting sequences through captured events in fixed chromatin3. As our understanding of chromatin interactions has increased, however, it is clear that the investigation of preselected candidate loci is insufficient to provide a complete understanding of gene-enhancer interactions. For example, ENCODE used the high-throughput chromosomal conformation capture carbon copy (5C) method to examine a small portion of the human genome (1%, pilot set of 44 loci) and reported complex interconnectivity of the loci. Genes and enhancers with identified interactions averaged 2–4 different interacting partners, many of which were hundreds of kilobases away in linear space4. Further, Li et al. used Chromatin Interaction Analysis by Paired-End Tag Sequencing (ChIA-PET) to analyze whole-genome promoter interactions and found that 65% of RNA polymerase II binding sites were involved in chromatin interactions. Some of these interactions resulted in large, multi-gene complexes spanning hundreds of kilobases of genomic distance and containing, on average, 8-9 genes each5. Together, these findings highlight the need for unbiased whole-genome methods for interrogating chromatin interactions. Some of these methods are reviewed in Schmitt et al.6.
More recent methods for chromatin conformation capture studies coupled with next-generation sequencing (Hi-C and 4C-seq) enable the discovery of unknown sequences interacting with a region of interest6. Specifically, circular chromosome conformation capture with next-generation sequencing (4C-seq) was developed to identify loci interacting with a sequence of interest in an unbiased manner7 by sequencing DNA from captured chromatin proximal to the region of interest in 3D space. Briefly, chromatin is fixed to preserve its native protein-DNA interactions, cleaved with a restriction enzyme, and subsequently ligated under dilute conditions to capture biologically relevant "tangles" of interacting loci (Figure 1). The cross links are reversed to remove the protein, thus leaving the DNA available for additional cleavage with a second restriction enzyme. A final ligation generates smaller circles of interacting loci. Primers to the sequence of interest are then used to generate an amplified library of unknown sequences from the circularized fragments, followed by downstream next generation sequencing.
The protocol presented here, which focuses on sample preparation, makes two major alterations to existing 4C-seq methods8,9,10,11,12. First, it uses a qPCR-based method to empirically determine the optimal number of amplification cycles for 4C-seq library preparation steps and thus mitigates the potential for PCR bias stemming from over-amplification of libraries. Second, it uses an additional restriction digest step in an effort to reduce the uniformity of known "bait" sequences that hinders accurate base-calling by the sequencing instrument and, hence, maximizes the unique, informative sequence in each read. Other protocols circumvent this issue by pooling many (12-15)8 4C-seq libraries with different bait sequences and/or restriction sites, a volume of experiments which may not be achievable by other laboratories. The modifications presented here allow a small number of experiments, samples, and/or replicates to be indexed and pooled into a single lane.
Access restricted. Please log in or start a trial to view this content.
1. Restriction Enzyme Selection
2. Design and Test Primers for Inverse PCR and Digestion Efficiency qPCR
3. Collection of Cells
4. Formaldehyde Cross-linking of Cells to Preserve Chromatin Interactions
5. Cell Lysis
6. First Restriction Digestion
7. First Ligation
8. Reverse Cross-linking and Isolate Chromatin
9. Second Restriction Digestion: Trimming the Circles
NOTE: This step creates smaller circles to minimize the overrepresentation of smaller captured fragments due to PCR bias in downstream amplification steps.
10. Second Ligation and DNA Purification
11. PCR Amplify Unknown Interacting Sequences by Inverse PCR
12. Third Restriction Digestion: Trim off Bait Sequences
NOTE: This step removes non-informative bait sequences from the inverse PCR products to maximize informative captured sequences in the downstream sequencing steps. To monitor digest efficiency, a “digest monitor”15 is digested in parallel using equivalent DNA and enzyme concentration. If RE1 and RE2 are incompatible for simultaneous digestion, for example, due to different optimal incubation temperatures or reaction buffers, this must be done as a sequential digest (this is not ideal).
13. Preparation of Sequencing Library
14. Sequencing and Analysis of Sequencing Data
15. Analysis of Sequence Data
Access restricted. Please log in or start a trial to view this content.
Primary human keratinocytes were isolated from 2–3 discarded neonatal foreskins, pooled, and cultured in KSFM supplemented with 30 µg/mL bovine pituitary extract, 0.26 ng/mL recombinant human epidermal growth factor, and 0.09 mM calcium chloride (CaCl2) at 37 °C, 5% carbon dioxide. The cells were split into two flasks, and one flask was differentiated by the addition of CaCl2 to a final concentration of 1.2 mM for 72 h. 107 cells each from ...
Access restricted. Please log in or start a trial to view this content.
4C results have the potential to reveal chromatin interactions that can identify previously unknown regulatory elements and/or target genes that are important in a specific biological context24,25,26. However, technical hurdles may limit the data obtained from these experiments. PCR bias stemming from over-amplification of template in 4C protocols is likely. This protocol addresses this issue by utilizing qPCR to determine the o...
Access restricted. Please log in or start a trial to view this content.
The authors declare that they have no competing financial interests.
This work was supported by NIAMS (R01AR065523).
Access restricted. Please log in or start a trial to view this content.
Name | Company | Catalog Number | Comments |
HindIII | NEB | R0104S | |
CviQI | NEB | R0639S | |
DNA oligonucleotide primers | IDT | To be designed by the reader | |
50 mL conical centrifuge tubes | Fisher Scientific | 06-443-19 | |
1.7 mL microcentrifuge tubes | MidSci | AVSS1700 | |
Phosphate buffered saline | Thermo Fisher | 14190-136 | |
Formaldehyde, methanol free | Electron Microscopy Sciences | 15710 | |
Nutator | VWR | 15172-203 | |
Glycine | JT Baker | 4059-00 | |
Benchtop centrifuge | |||
Refrigerated microcentrifuge | |||
Ethylenediaminetetraacetic acid (EDTA) | Sigma-Aldrich | ED2SS | |
20% SDS solution | Sigma-Aldrich | 05030 | |
Trypan Blue | Thermo Fisher | 15250061 | |
Glass slides | Fisher Scientific | 12-550-143 | |
Cover slips | VWR | 16004-094 | |
Light microscope | |||
Triton X-100 | Alfa Aesar | A16046 | |
Shaking heat block | |||
2 M Tris-HCl | Quality Biological | 351-048-101 | |
Proteinase K | NEB | P8107S | |
Phenol:chloroform:isoamyl alcohol (25:24:1) | Sigma-Aldrich | P2069 | |
Sodium acetate | Sigma-Aldrich | M5661 | |
20 mg/mL glycogen | Thermo Fisher | R0561 | |
Ethanol | Fisher Scientific | 04-355-223 | |
Nuclease-free water | Fisher Scientific | MT-46-000-CM | |
qPCR cycler | Thermo Fisher | 4453536 | |
qPCR plates | Thermo Fisher | 4309849 | |
Thermocycler | Thermo Fisher | 4375786 | |
PCR strip tubes | MidSci | AVSST-FL | |
1 M Magnesium chloride | Quality Biological | 351-033-721 | |
Dithiothreotol | Sigma-Aldrich | 43815 | |
Adenosine triphosphate | Sigma-Aldrich | A2383 | |
T4 DNA Ligase | NEB | M0202S | |
Agarose | Sigma-Aldrich | A6013 | |
RNase A | Thermo Fisher | EN0531 | |
Qiaquick PCR purification kit | Qiagen | 28104 | |
MinElute PCR Purification kit | Qiagen | 28004 | |
Spectrophotometer | |||
Expand Long Template PCR System | Sigma-Aldrich | 11681834001 | |
dNTP mix | Thermo Fisher | R0191 | |
SYBR Green I | Sigma-Aldrich | S9430 | |
ROX | BioRad | 172-5858 | |
Sodium chloride | Sigma-Aldrich | S5886 | |
End-It DNA End-Repair Kit | Lucigen | ER0720 | |
LigaFast Rapid DNA Ligation System | Promega | M8221 | |
SYBR Safe | Thermo Fisher | S33102 | |
Taq Polymerase | NEB | M0267S | |
UltraSieve Agarose | IBI Scientific | IB70054 | |
Qiaquick Gel Extraction Kit | Qiagen | 28704 |
Access restricted. Please log in or start a trial to view this content.
Request permission to reuse the text or figures of this JoVE article
Request PermissionThis article has been published
Video Coming Soon
Copyright © 2025 MyJoVE Corporation. All rights reserved