A subscription to JoVE is required to view this content. Sign in or start your free trial.
Method Article
Crystal structure of protein–DNA complexes can provide insight into protein function, mechanism, as well as, the nature of the specific interaction. Here, we report how to optimize the length, sequence and ends of duplex DNA for co-crystallization with Escherichia coli SeqA, a negative regulator of replication initiation.
Escherichia coli SeqA is a negative regulator of DNA replication that prevents premature reinitiation events by sequestering hemimethylated GATC clusters within the origin of replication1. Beyond the origin, SeqA is found at the replication forks, where it organizes newly replicated DNA into higher ordered structures2. SeqA associates only weakly with single GATC sequences, but it forms high affinity complexes with DNA duplexes containing multiple GATC sites. The minimal functional and structural unit of SeqA is a dimer, thereby explaining the requirement of at least two GATC sequences to form a high-affinity complex with hemimethylated DNA3. Additionally, the SeqA architecture, with the oligomerization and DNA-binding domains separated by a flexible linker, allows binding to GATC repeats separated by up to three helical turns. Therefore, understanding the function of SeqA at a molecular level requires the structural analysis of SeqA bound to multiple GATC sequences. In protein-DNA crystallization, DNA can have none to an exceptional effect on the packing interactions depending on the relative sizes and architecture of the protein and the DNA. If the protein is larger than the DNA or footprints most of the DNA, the crystal packing is primarily mediated by protein-protein interactions. Conversely, when the protein is the same size or smaller than the DNA or it only covers a fraction of the DNA, DNA-DNA and DNA-protein interactions dominate crystal packing. Therefore, crystallization of protein-DNA complexes requires the systematic screening of DNA length4 and DNA ends (blunt or overhang)5-7. In this report, we describe how to design, optimize, purify and crystallize hemimethylated DNA duplexes containing tandem GATC repeats in complex with a dimeric variant of SeqA (SeqAΔ(41-59)-A25R) to obtain crystals suitable for structure determination.
1. Protein Purification
The flexible linker connecting the N- (oligomerization) and C-terminal (DNA binding) domains of SeqA aids the recognition of hemimethylated GATC repeats separated by one to three turns on the DNA. For this study, we used a dimeric variant of SeqA (SeqAΔ(41-59)-A25R) with a point mutation in the N-terminal domain that prevents further oligomerization and a shortened linker that restricts DNA binding to tandem GATC repeats separated exclusively by one turn on the DNA (Figure 1)2,8.
2. DNA Purification
3. Protein-DNA Complex Formation and Analysis
4. Representative Results
To obtain the crystal structure of SeqAΔ(41-59)-A25R bound to hemimethylated DNA, we consecutively optimized three parameters on the DNA: (i) separation between hemimethylated GATC sequences; (ii) length of the duplex; and (iii) the absence/presence of 5' overhangs.
Electro-mobility shift assays indicate that SeqAΔ(41-59)-A25R preferentially binds GATC repeats separated by 9-10 base pairs (Figure 1). Therefore, we initially screened duplexes 23-24 base pairs (bps) long containing two hemimethylated GATC sequences separated by either 9 or 10 bps. Three duplexes yielded nicely shaped crystals (Figure 2). Although none of the crystals diffracted to high resolution, the 23 bps long duplex with the two GATC sites separated by 9 bps diffracted X-rays better than the rest, indicating that a GATC separation of 9 bps was preferred for crystallization. Therefore, we fixed the inter-GATC spacing to 9 bps for all subsequent screens.
Generally, DNA crystallization is favored for duplex lengths corresponding to exact helical turns because multiple DNA molecules can stack head-to-tail to form a continuous B-DNA within the crystal9. Therefore, we shortened the overall length of the duplexes to 21 bps (i.e. two helical turns). While a 21 bps duplex with blunt ends did not yield diffraction quality crystals (data not shown), a 21 bps duplex with a single 5' overhang nucleotide on each end did yield crystals that diffracted to 5 Å in our home source. The improvement on the diffraction limit suggested that end-to-end duplex association was indeed favoring crystal packing.
Since SeqA interacts with GATC sequences that are on the same face of the DNA, the opposite face of the DNA duplex should be exposed to the solvent. To further improve the crystals of the complex, we then modified the sequence of the duplex to include a CG dinucleotide between the two GATC sites to promote groove-backbone interactions with adjacent DNA molecules in the crystal through the opposite face of the duplex, a method that has been used to enhance DNA crystallization in the past10. However, the diffraction limit of the crystals grown with the CG-containing duplex was identical to those grown with a similar DNA duplex that did not contain the CG dinucleotide (Figure 3). This result indicated that groove-backbone interactions were not important in this case. We subsequently optimized the length of the overhangs by comparing the crystals grown with a DNA duplex that had two additional nucleotides at each 5' end. This change had a drastic effect on crystal morphology, as well as, diffraction limit indicating that the additional nucleotide dramatically changed the molecular contacts and enhanced crystal organization (Figure 3).
The crystal structure of SeqAΔ(41-59)-A25R bound to this last DNA duplex confirmed that the free face of the DNA duplex does not engage in crystal contacts, as expected from the limited effect of introducing a CG-dinucleotide. Despite the relative size of protein and DNA, most interactions between symmetry mates are mediated by protein-protein and protein-DNA interactions (Figure 4). Interestingly, in this particular case, the beneficial effect of a 5' dinucleotide overhang is not due to the formation of a pseudo-continuous DNA. Instead, the 5' end of the methylated strand projects away from the DNA axes and interacts with the proximal SeqA molecule of the complex, explaining why two nucleotides were strictly required to change the crystal packing8,11.
Figure 1. Binding of SeqAΔ(41-59)-A25R to hemimethylated DNA. (A) Schematic diagram of the domains of SeqA that mediate protein oligomerization and DNA binding. (B) Electrophoretic mobility shift assay of SeqAΔ(41-59)-A25R with DNAs containing two hemimethylated GATC sequences separated by an increasing number of base pairs (X). The left-most lane (labeled Blank) contains an equimolar mixture of DNAs with 5, 7, 12, 21, 25 and 34 base pairs between the two GATC sequences in the absence of SeqAΔ(41-59)-A25R. (C) Diagram depicting the three variables optimized on the DNA duplexes to obtain diffraction quality crystals.
Figure 2. Effect of varying the inter-GATC distance. Summary of the oligonucleotides used and crystals obtained with 23-24 bp long duplexes containing two hemimethylated GATC sites separated by 9 or 10 base pairs. All pictures of crystals were taken at the same magnification and the scale bar indicates 100 μm. The resolution limit of each SeqAΔ(41-59)-A25R-DNA crystal is based on diffraction images collected on a Rigaku RU-300 X-ray generator system. Click here to view larger figure.
Figure 3. Effect of varying the DNA ends. Summary of the oligonucleotides used and crystals obtained with 21 bps long duplexes containing two hemimethylated GATC sites separated by 9 base pairs and including 0, 1 or 2 nucleotide 5' end overhangs. All pictures of crystals were taken at the same magnification and the scale bar indicates 100 μm. The resolution limit of each SeqAΔ(41-59)-A25R-DNA crystal is based on diffraction images collected at beamlines X12C and X29 (NSLS, BNL). Click here to view larger figure.
Figure 4. Crystal packing of SeqAΔ(41-59)-A25R bound to DNA with dinucleotide overhang: Viewed from the (A) top and the side (B). The asymmetric unit contains two SeqAΔ(41-59)-A25R-DNA complexes where the protein is shown in grey while the DNA is shown in orange. Symmetry related SeqAΔ(41-59)-A25R-DNA molecules are shown in white for the protein and yellow for the DNA. This figure was prepared using PyMOL12. This figure is related to Movie 1. Click here to view larger figure.
Movie 1. Click here to view movie.
One of the biggest challenges in macromolecular X-ray crystallography is obtaining diffraction quality crystals. In the case of protein or protein-DNA complexes, this challenge is exacerbated due to the additional variables that must be optimized. It is widely believed that the length of the DNA and the presence of sticky overhangs to enhance association of neighboring DNA molecules in a longer pseudo-duplex are the main parameters to optimize. However, we have shown that the nature and length of these overhangs can have...
No conflicts of interest declared.
The authors wish to thank the PXRR staff at the NSLS (Brookhaven National Laboratory) for assistance during data collection and Monica Pillon for help with DNA purification. This work was supported by the Canadian Institutes of Health Research (MOP 67189).
Name | Company | Catalog Number | Comments |
TRIS | Bioshop | TRS003.5 | |
Ethylenediaminetetraacetic acid (EDTA) | Fisher Scientific | E478-500 | |
Dithiotheitrol (DTT) | Bio Basic Inc. | DB0058 | |
NaCl | Bioshop | SOD002.10 | |
Glycerol | Caledon | 5350-1 | |
Sucrose | Sigma-Aldrich | S5016-500G | |
Sodium dodecyl sulfate (SDS) | Bioshop | SDS001.500 | |
Urea | Bioshop | URE001.5 | |
40% 29:1 Bis/acrylamide | Bio Basic Inc. | A0007-500ml | Store at 4 °C |
Boric acid | EMD | BX0865-1 | |
Xylene cyanol FF | Bio-Rad | 161-0423 | |
Bromophenol Blue | Bioshop | BR0222 | |
Dual Adjustable Vertical Gel System | C.B.C. Scientific Company Inc. | DASG-250 | |
Index crystallization screen | Hampton Research | HR2-144 | Store at 4 °C |
Wizard I crystallization screen | Emerald BioSystems | EBS-WIZ-1 | Store at 4 °C |
Wizard II crystallization screen | Emerald BioSystems | EBS-WIZ-2 | Store at 4 °C |
Classics crystallization screen | Qiagen | 130701 | Store at 4 °C |
Intelliplate trays | Art Robbins Instruments | 102-0001-00 | |
Solutions Protein purification buffer: 100 mM TRIS pH 8, 2 mM EDTA, 2 mM DTT and 5% glycerol. Protein storage buffer: 20 mM TRIS pH 8, 150 mM NaCl, 5 mM DTT, 0.5 mM EDTA and 5% glycerol. Gel loading mix: Add 20 g of sucrose, 25 mg of bromophenol blue, 25 mg of xylene cyanol FF, 1 ml of 10% w/v SDS and 10 ml of 10X TBE to 70 ml of autoclaved ddH2O. Stir with mild heating until sucrose is dissolved and adjust the final volume to 100 ml with autoclaved ddH2O. Store at 4 °C. 2X loading buffer: Add 11 g of urea to 10 ml of gel loading mix. Stir on a hot plate until urea dissolves. Aliquot in 2 ml tubes and store at 4 °C. 10X PAGE mix: Mix 420.4 g of urea, 100 ml of 10X TBE (autoclaved), 250 ml of 40% 29:1 Bis/Acrylamide in ddH2O. Stir until totally dissolved and adjust volume to 1 liter. Store in dark bottles at 4 °C. 10X TBE: Dissolve 108 g of TRIS, 55 g of boric acid and 9.3 g of EDTA in 1 liter of ddH2O. Autoclave and store at room temperature. Elution buffer: Dilute 8 ml of 5 M NaCl, 2 ml of 1 M TRIS pH 7.5, 0.4 ml of 0.5 M EDTA pH 8 on 200 ml of ddH2O. Autoclave and store at room temperature. |
Request permission to reuse the text or figures of this JoVE article
Request PermissionThis article has been published
Video Coming Soon
Copyright © 2025 MyJoVE Corporation. All rights reserved