Method Article
The combination of chromatin immunoprecipitation and ultra-high-throughput sequencing (ChIP-seq) can identify and map protein-DNA interactions in a given tissue or cell line. Outlined is how to generate a high quality ChIP template for subsequent sequencing, using experience with the transcription factor TCF7L2 as an example.
ChIP-sequencing (ChIP-seq) methods directly offer whole-genome coverage, where combining chromatin immunoprecipitation (ChIP) and massively parallel sequencing can be utilized to identify the repertoire of mammalian DNA sequences bound by transcription factors in vivo. "Next-generation" genome sequencing technologies provide 1-2 orders of magnitude increase in the amount of sequence that can be cost-effectively generated over older technologies thus allowing for ChIP-seq methods to directly provide whole-genome coverage for effective profiling of mammalian protein-DNA interactions.
For successful ChIP-seq approaches, one must generate high quality ChIP DNA template to obtain the best sequencing outcomes. The description is based around experience with the protein product of the gene most strongly implicated in the pathogenesis of type 2 diabetes, namely the transcription factor transcription factor 7-like 2 (TCF7L2). This factor has also been implicated in various cancers.
Outlined is how to generate high quality ChIP DNA template derived from the colorectal carcinoma cell line, HCT116, in order to build a high-resolution map through sequencing to determine the genes bound by TCF7L2, giving further insight in to its key role in the pathogenesis of complex traits.
For many years there has been an unmet need to identify the set of genes bound and regulated by a given protein genome wide, in particular those in the transcription factor class.
Odom et al.1 used chromatin immunoprecipitation (ChIP) combined with promoter microarrays to systematically identify the genes occupied by pre-specified transcriptional regulators in human liver and pancreatic islets. Subsequently, Johnson et al.2 developed a large-scale chromatin immunoprecipitation assay based on direct ultra high-throughput DNA sequencing (ChIP-seq) in order to comprehensively map protein-DNA interactions across entire mammalian genomes. As a test case, they mapped in vivo the binding of the neuron-restrictive silencer factor (NRSF) to 1946 locations in the human genome. The data displayed sharp resolution of binding position (+50 base pairs), which facilitated both the isolation of motifs and the identification of NRSF-binding motifs. These ChIP-seq data also had high sensitivity and specificity and statistical confidence (P < 10−4), properties that are important for inferring new candidate interactions.
Robertson et al.3 also used ChIP-seq in order to map STAT1 targets in interferon-γ (IFN-γ)-stimulated and unstimulated human HeLa S3 cells in vivo. By ChIP-seq, using 15.1 and 12.9 million uniquely mapped sequence reads, and an estimated false discovery rate of less than 0.001, they identified 41,582 and 11,004 putative STAT1-binding regions in stimulated and unstimulated cells, respectively. Of the 34 loci known to contain STAT1 interferon-responsive binding sites4-8, ChIP-seq found 24 (71%). ChIP-seq targets were enriched in sequences similar to known STAT1 binding motifs. Comparisons with two existing ChIP-PCR data sets suggested that ChIP-seq sensitivity was between 70% and 92% and specificity was at least 95%. Additionally, it was clear that ChIP-seq offers both low analytical complexity and sensitivity that increases with sequencing depth.
As such, "next-generation" genome sequencing technologies provide 1-2 orders of magnitude increase in the amount of sequence that can be cost-effectively generated over older technologies9. ChIP-seq methods therefore directly provide whole-genome coverage for effective profiling of mammalian protein-DNA interactions3.
In 2006, a strong association of variants in the transcription factor 7-like 2 (TCF7L2) gene with type 2 diabetes was discovered10. Other investigators have already independently replicated this finding in different ethnicities and, interestingly, from the first genome wide association studies of type 2 diabetes published in Nature11,12, Science13-15 and elsewhere16,17, the strongest association was indeed with TCF7L2; this is now considered the most significant genetic finding in type 2 diabetes to date18-20. In addition, TCF7L2 has been linked to cancer risk 21,22; indeed, this connection became more obvious when the 8q24 locus revealed by genome wide association studies of a number of cancers, including colorectal carcinomas, was shown to be due to an extreme upstream TCF7L2-binding element driving the transcription of MYC23,24. As such, there is great interest in determining the downstream genes regulated by this key transcription factor.
Based on experience with TCF7L2 as an example of the methodology, this paper outlines how to generate high quality ChIP DNA template. ChIP was carried out in the colorectal carcinoma cell line, HCT116, for subsequent sequencing in order build a high-resolution map of the genes bound by TCF7L225 in an endeavor to yield further insight in to its key role in the pathogenesis of complex traits.
Access restricted. Please log in or start a trial to view this content.
1. Cross-link Chromatin
2. Prepare Nuclei (Proceed to Step 3.5 for Whole Cell Lysate)
3. Sonication*
*For native ChIP, microccocal nuclease digestion may be used alternatively to shear the DNA.
4. Block Agarose Beads*
5. Pre-clear Chromatin
6. Immunoprecipitation
7. Elution
8. Reverse Cross-link
9. DNA Purification
*Alternatively, ChIP-grade magnetic beads can be used in place of agarose for the immunoprecipitation portion.
10. PCR check
Step 1: 94 °C 3 min
Step 2: 94 °C 20 sec
59 °C 30 sec
72 °C 30 sec
(Repeat Step 2 for at least 30 cycles)
Step 3: 72 °C 2 min
Access restricted. Please log in or start a trial to view this content.
Once the chromatin has been sonicated and have been treated with RNase and Proteinase, the samples run on the 2% agarose gel should present a smear with the bulk of the DNA at the desired size. If several different cycles are tested, a gradual decrease in size should be seen as the number of cycles increase (Figure 2).
After completing the immunoprecipitation portion of the protocol the enrichment can either be checked by PCR or real-time PCR. For PCR samples run on an a...
Access restricted. Please log in or start a trial to view this content.
It is now feasible to carry out a genome-wide profile of protein-DNA interactions association using ChIP-seq, as has been very recently demonstrated with other transcription factors2,3. The key to a successful sequencing outcome is the generation of a high quality chromatin immunoprecipitation DNA template.
Once the DNA template has been generated and ascertained to be adequately enriched, one can then take it in library preparation for subsequent sequencing. For example, one...
Access restricted. Please log in or start a trial to view this content.
The authors declare that they have no competing financial interests.
The work is supported by an Institute Development Award from The Children's Hospital of Philadelphia.
Access restricted. Please log in or start a trial to view this content.
| Name | Company | Catalog Number | Comments |
|---|---|---|---|
| QIAquick PCR Purification Kit | Qiagen | 28104 | |
| EZ-ChIP Kit | Millipore | 17-371 | |
| GoTaq Hot Start Polymerase | Promega | M5001 | |
| Misonix Sonicator | Qsonica | XL-2000 | |
| NanoDrop 1000 Spectrophotometer | Thermo-Scientific | ||
| Positive control primer sequences (TCF7L2-1) Forward- 5'-TCGCCCTGTCAATAATCTCC-3' Reverse- 5'-GCTCACCTCCTGTATCTTCG-3' Negative control primer sequences (CTRL-1) Forward-5'-ATGTGGTGTGGCTGTGATGGGAAC-3' Reverse- 5'-CGAGCAATCGGTAAATAGGTCTGG-3' | |||
Access restricted. Please log in or start a trial to view this content.
Request permission to reuse the text or figures of this JoVE article
Request Permission