A subscription to JoVE is required to view this content. Sign in or start your free trial.
Method Article
We describe the engineering of a novel DNA-tethered T7 RNA polymerase to regulate in vitro transcription reactions. We discuss the steps for protein synthesis and characterization, validate proof-of-concept transcriptional regulation, and discuss its applications in molecular computing, diagnostics, and molecular information processing.
DNA nanotechnology enables programmable self-assembly of nucleic acids into user-prescribed shapes and dynamics for diverse applications. This work demonstrates that concepts from DNA nanotechnology can be used to program the enzymatic activity of the phage-derived T7 RNA polymerase (RNAP) and build scalable synthetic gene regulatory networks. First, an oligonucleotide-tethered T7 RNAP is engineered via expression of an N-terminally SNAP-tagged RNAP and subsequent chemical coupling of the SNAP-tag with a benzylguanine (BG)-modified oligonucleotide. Next, nucleic-acid strand displacement is used to program polymerase transcription on-demand. In addition, auxiliary nucleic acid assemblies can be used as "artificial transcription factors" to regulate the interactions between the DNA-programmed T7 RNAP with its DNA templates. This in vitro transcription regulatory mechanism can implement a variety of circuit behaviors such as digital logic, feedback, cascading, and multiplexing. The composability of this gene regulatory architecture facilitates design abstraction, standardization, and scaling. These features will enable the rapid prototyping of in vitro genetic devices for applications such as bio-sensing, disease detection, and data storage.
DNA computing uses a set of designed oligonucleotides as the medium for computation. These oligonucleotides are programmed with sequences to dynamically assemble according to user-specified logic and respond to specific nucleic-acid inputs. In proof-of-concept studies, the output of the computation typically consists of a set of fluorescently labelled oligonucleotides that can be detected via gel electrophoresis or fluorescence plate readers. Over the past 30 years, increasingly complex DNA computational circuitries have been demonstrated, such as various digital logic cascades, chemical reaction networks, and neural networks1,2,3. To assist with the preparation of these DNA circuits, mathematical models have been used to predict the functionality of synthetic gene circuits4,5, and computational tools have been developed for orthogonal DNA sequence design6,7,8,9,10. Compared to silicon-based computers, the advantages of DNA computers include their ability to interface directly with biomolecules, operate in solution in the absence of a power supply, as well as their overall compactness and stability. With the advent of next-generation sequencing, the cost of synthesizing DNA computers has been decreasing for the past two decades at a rate faster than Moore's Law11. Applications of such DNA-based computers are now beginning to emerge, such as for disease diagnosis12,13, for powering molecular biophysics14, and as data storage platforms15.
Figure 1: Mechanism of toehold-mediated DNA strand displacement. The toehold, δ, is a free, unbound sequence on a partial duplex. When a complementary domain (δ*) is introduced on a second strand, the free δ domain serves as a toehold for hybridization, allowing for the rest of the strand (ɑ*) to slowly displace its competitor through a zipping/unzipping reversible reaction known as strand migration. As the length of δ increases, the ΔG for the forward reaction decreases, and displacement happens more readily. Please click here to view a larger version of this figure.
To-date, the majority of DNA computers utilize a well-established motif in the field of dynamic DNA nanotechnology known as toehold-mediated DNA strand displacement (TMDSD, Figure 1)16. This motif consists of a partially double-stranded DNA (dsDNA) duplex displaying short "toehold" overhangs (i.e., 7- to 10 nucleotides (nt)). Nucleic acid "input" strands can interact with the partial duplexes through the toehold. This leads to the displacement of one of the strands from the partial duplex, and this liberated strand can then serve as input for downstream partial duplexes. Thus, TMDSD enables signal cascading and information processing. In principle, orthogonal TMDSD motifs can operate independently in solution, enabling parallel information processing. There have been a number of variations on the TMDSD reaction, such as toehold-mediated DNA strand exchange (TMDSE)17, "leakless" toeholds with double-long domains18, sequence-mismatched toeholds19, and "handhold"-mediated strand displacement20. These innovative design principles allow more finely tuned TMDSD energetics and dynamics for improving DNA computing performance.
Synthetic gene circuits, such as transcriptional gene circuits, are also capable of computation21,22,23. These circuits are regulated by protein transcription factors, which activate or repress transcription of a gene by binding to specific regulatory DNA elements. Compared to DNA-based circuits, transcriptional circuits have several advantages. First, enzymatic transcription has a much higher turnover rate than existing catalytic DNA circuits, thus generating more copies of output per single copy of input and providing a more efficient means of signal amplification. In addition, transcriptional circuits can produce different functional molecules, such as aptamers or messenger RNA (mRNA) encoding for therapeutic proteins, as computation outputs, which can be exploited for different applications. However, a major limitation of current transcriptional circuits is their lack of scalability. This is because there is a very limited set of orthogonal protein-based transcription factors, and de novo design of new protein transcription factors remains technically challenging and time-consuming.
Figure 2: Abstraction and mechanism of "tether" and "cage" polymerase complex. (A and B) An oligonucleotide tether is enzymatically labelled to a T7 polymerase through the SNAP-tag reaction. A cage consisting of a "faux" T7 promoter with a tether-complement overhang allows it to hybridize to the tether and block transcriptional activity. (C) When the operator (a*b*) is present, it binds to the toehold on the oligonucleotide tether (ab) and displaces the b* region of the cage, allowing transcription to occur. This figure has been modified from Chou and Shih27. Abbreviations: RNAP = RNA polymerase. Please click here to view a larger version of this figure.
This paper introduces a novel building block for molecular computing that combines the functionalities of transcriptional circuits with the scalability of DNA-based circuits. This building block is a T7 RNAP covalently attached with a single-stranded DNA tether (Figure 2A). To synthesize this DNA-tethered T7 RNAP, the polymerase was fused to an N-terminal SNAP-tag24 and recombinantly expressed in Escherichia coli. The SNAP-tag was then reacted with an oligonucleotide functionalized with the BG substrate. The oligonucleotide tether allows the positioning of molecular guests in close proximity to the polymerase via DNA hybridization. One such guest was a competitive transcriptional blocker referred to as a "cage", which consists of a "faux" T7 promoter DNA duplex with no gene downstream (Figure 2B). When bound to the RNAP via its oligonucleotide tether, the cage stalls polymerase activity by outcompeting other DNA templates for RNAP binding, rendering the RNAP in an "OFF" state (Figure 2C).
To activate the polymerase to an "ON" state, T7 DNA templates with single-stranded "operator" domains upstream of the T7 promoter of the gene were designed. The operator domain (i.e., domain a*b* Figure 2C) can be designed to displace the cage from the RNAP via TMDSD and position the RNAP proximal to the T7 promoter of the gene, thus initiating transcription. Alternatively, DNA templates were also designed where the operator sequence was complementary to auxiliary nucleic-acid strands that are referred to as "artificial transcription factors" (i.e., TFA and TFB strands in Figure 3A). When both strands are introduced into the reaction, they will assemble at the operator site, creating a new pseudo-contiguous domain a*b*. This domain can then displace the cage via TMDSD to initiate transcription (Figure 3B). These strands can be supplied either exogenously or produced.
Figure 3: Selective programming of polymerase activity through a three-component switch activator. (A) When the transcription factors (TFA and TFB) are present, they bind to the operator domain upstream of the promoter, forming a pseudo single-stranded sequence (a*b*) capable of displacing the cage through toehold mediated DNA displacement. (B) This a*b* domain can displace the cage via TMDSD to initiate transcription. This figure has been modified from Chou and Shih27. Abbreviations: TF = transcription factor; RNAP = RNA polymerase; TMDSD = toehold-mediated DNA strand displacement. Please click here to view a larger version of this figure.
The use of nucleic acid-based transcription factors for in vitro transcriptional regulation allows the scalable implementation of sophisticated circuit behaviors such as digital logic, feedback, and signal cascading. For example, one can build logic gate cascades by designing nucleic acid sequences such that the transcripts from an upstream gene activate a downstream gene. One application that exploits the cascading and multiplexing made capable by this proposed technology is the development of more sophisticated molecular computing circuitries for portable diagnostics and molecular data processing. In addition, integrating the molecular computing and de novo RNA synthesis capabilities can enable new applications. For example, a molecular circuit can be designed to detect one or a combination of user-defined RNAs as inputs and output therapeutic RNAs or mRNAs encoding functional peptides or proteins for point-of-care medical applications.
1. Buffer preparation
NOTE: Protein purification buffer preparation can occur on any day; here, it was done prior to beginning the experiments.
2. Overnight culture growth: Day 1
Figure 4: Plasmid map for SNAP T7 RNAP. The plasmid encodes a T7 RNAP containing an N-terminal histidine tag (6x His) and SNAP-tag domain (SNAP T7 RNAP) under a lac repressor (lacI) on a pQE-80L backbone. Other features include kanamycin resistance (KanR) and chloramphenicol resistance (CmR) genes. Abbreviation: RNAP = RNA polymerase. Please click here to view a larger version of this figure.
NOTE: The plasmid encodes a T7 RNAP containing an N-terminal histidine tag and a SNAP-tag domain (SNAP T7 RNAP), as well as a kanamycin resistance gene under a pQE-80L backbone (Figure 4)25.
3. Cell growth and induction: Day 2
4. Cell lysis, protein purification: Day 3
5. Sodium dodecyl sulfate-polyacrylamide gel electrophoresis (SDS-PAGE) analysis of protein product: Day 3
6. Functional verification of SNAP T7 RNAP via in vitro transcription
NOTE: This protocol uses DNA template, which encodes for the fluorescent Broccoli RNA aptamer and allows the use of fluorescence to monitor the kinetics of transcription on a fluorescence plate reader.
7. Preparation of BG-modified oligonucleotides: Day 1
8. Ethanol/acetone precipitation of BG-oligonucleotide conjugate: Day 2
9. BG-oligonucleotide cleanup via gel filtration chromatography
10. Denaturing PAGE analysis of BG-oligonucleotide conjugate
11. Conjugation of oligonucleotide to SNAP T7 RNAP and PAGE analysis
12. Purification of oligonucleotide-tethered SNAP-T7 using ion exchange columns
13. Demonstration of on-demand control of tethered RNA polymerase activity
Figure 5: SDS-PAGE analysis of SNAP T7 RNAP expression and in vitro transcription assay. (A) SNAP T7 RNAP protein purification analysis, SNAP T7 RNAP molecular weight: 119.4kDa. FT = flow-through from the column, W1 = elution fractions of wash buffer containing impurities, E1-3 = elution fractions containing purified product, and DE = 10...
This study demonstrates a DNA nanotechnology-inspired approach to control the activity of T7 RNA polymerase by covalently coupling an N-terminally SNAP-tagged recombinant T7 RNAP with a BG-functionalized oligonucleotide, which was subsequently used to program TMDSD reactions. By design, the SNAP-tag was positioned at the N-terminus of the polymerase, as the C-terminus of wild-type T7 RNAP is buried within the protein structure core and makes important contacts with the DNA template28. Prior attemp...
There are no competing financial interests to declare by any of the authors.
L.Y.T.C acknowledges generous support from the New Frontiers in Research Fund-Exploration (NFRF-E), the Natural Sciences and Engineering Research Council of Canada (NSERC) Discovery Grant, and the University of Toronto's Medicine by Design Initiative, which receives funding from the Canada First Research Excellence Fund (CFREF).
Name | Company | Catalog Number | Comments |
0.5% polysorbate 20 (TWEEN 20) | BioShop | TWN510.5 | |
0.5M ethylenediaminetetraacetic acid (EDTA) | Bio Basic | SD8135 | |
10 mM sodium phosphate buffer (pH 7) | Bio Basic | PD0435 | Tablets used to make 10 mM buffer |
10% ammonium persulfate (APS) | Sigma Aldrich | A3678-100G | |
100 kDa Amicon Ultra-15 Centrifugal Filter Unit | Fisher Scientific | UFC910008 | |
100% acetone | Fisher Chemical | A18P4 | |
100% ethanol (EtOH) | House Brand | 39752-P016-EAAN | |
10x in vitro transcription (IVT) buffer | New England Biolabs | B9012 | |
10x Tris-Borate-EDTA (TBE) buffer | Bio Basic | A0026 | |
1M Isopropyl β- d-1-thiogalactopyranoside (IPTG) | Sigma Aldrich | I5502-1G | |
1M sodium bicarbonate buffer | Sigma Aldrich | S6014-500G | |
1M Tris(hydroxymethyl)aminomethane (Tris) | Sigma Aldrich | 648311-1KG | |
1X Tris-EDTA (TE) buffer | ThermoFisher | 12090015 | |
2M imidazole | Sigma Aldrich | 56750-100G | |
2-mercaptoethanol (BME) | Sigma Aldrich | M3148 | |
3M sodium acetate | Bio Basic | SRB1611 | |
40% acrylamide (19:1) | Bio Basic | A00062 | |
4x LDS protein sample loading buffer | Fisher Scientific | NP0007 | |
5M sodium chloride (NaCl) | Bio Basic | DB0483 | |
5mM dithiothreitol (DTT) | Sigma Aldrich | 43815-1G | |
6x gel loading dye | New England Biolabs | B7024S | |
agarose B powder | Bio Basic | AB0014 | |
BG-GLA-NHS | New England Biolabs | S9151S | |
BL21 competent E. coli | Addgene | C2530H | |
BLUeye prestained protein ladder | FroggaBio | PM007-0500 | |
bromophenol blue | Bio Basic | BDB0001 | |
coomassie blue (SimplyBlue SafeStain) | ThermoFisher | LC6060 | |
cyanine dye (SYBR Gold nucleic acid gel stain) | Fisher Scientific | S11494 | |
cyanine dye (SYBR Safe nucleic acid gel stain) | Fisher Scientific | S33102 | |
dry dimethyl sulfoxide (DMSO) | Fisher Scientific | D12345 | |
formamide | Sigma Aldrich | F9037-100ML | |
glycerol | Bio Basic | GB0232 | |
kanamycin sulfate | BioShop | KAN201.5 | |
lysogeny broth | Sigma Aldrich | L2542-500ML | |
malachite green oxalate | Sigma Aldrich | 2437-29-8 | |
N,N,N'N'-Tetramethylethane-1,2-diamine (TEMED) | Sigma Aldrich | T9281-25ML | |
NuPAGE MES SDS running buffer (20x) | Fisher Scientific | LSNP0002 | |
NuPAGE Novex 4-12% Bis-Tris gel 1.0 mm 12-well | Life Technologies | NP0322BOX | |
oligonucleotide (cage antisense) | IDT | N/A | TATAGTGAGTCGTATTAATTTG |
oligonucleotide (cage sense) | IDT | N/A | TCAGTCACCTATCTGTTTCAAA TTAATACGACTCACTATA |
oligonucleotide (malachite green aptamer antisense) | IDT | N/A | GGATCCATTCGTTACCTGGCT CTCGCCAGTCGGGATCCTATA GTGAGTCGTATTACAGTTCCAT TATCGCCGTAGTTGGTGTACT |
oligonucleotide (malachite green aptamer sense) | IDT | N/A | TAATACGACTCACTATAGGATC CCGACTGGCGAGAGCCAGGT AACGAATGGATCC |
oligonucleotide (Transcription Factor A) | IDT | N/A | AGTACACCAACTACGAGTGAG |
oligonucleotide (Transcription Factor B) | IDT | N/A | TCAGTCACCTATCTGGCGATAA TGGAACTG |
oligonucleotide with 3’ Amine modification (tether) | IDT | N/A | GCTACTCACTCAGATAGGTGAC TGA/3AmMO/ |
Pierce strong ion exchange spin columns | Fisher Scientific | 90008 | |
plasmid encoding SNAP T7 RNAP and kanamycin resistance genes | Genscript | N/A | custom gene insert |
protein purification column (HisPur Ni-NTA spin column) | Fisher Scientific | 88226 | |
rNTP mix | New England Biolabs | N0466S | |
Roche mini quick DNA spin column | Sigma Aldrich | 11814419001 | |
Triton X-100 | Sigma Aldrich | T8787-100ML | |
Ultra Low Range DNA ladder | Fisher Scientific | 10597012 | |
urea | BioShop | URE001.1 |
Request permission to reuse the text or figures of this JoVE article
Request PermissionThis article has been published
Video Coming Soon
Copyright © 2025 MyJoVE Corporation. All rights reserved