Method Article
SILAC immunoprecipitation experiments represent a powerful means for discovering novel protein:protein interactions. By allowing the accurate relative quantification of protein abundance in both control and test samples, true interactions may be easily distinguished from experimental contaminants, and low affinity interactions preserved through use of less-stringent buffer conditions.
Quantitative proteomics combined with immuno-affinity purification, SILAC immunoprecipitation, represent a powerful means for the discovery of novel protein:protein interactions. By allowing the accurate relative quantification of protein abundance in both control and test samples, true interactions may be easily distinguished from experimental contaminants. Low affinity interactions can be preserved through the use of less-stringent buffer conditions and remain readily identifiable. This protocol discusses the labeling of tissue culture cells with stable isotope labeled amino acids, transfection and immunoprecipitation of an affinity tagged protein of interest, followed by the preparation for submission to a mass spectrometry facility. This protocol then discusses how to analyze and interpret the data returned from the mass spectrometer in order to identify cellular partners interacting with a protein of interest. As an example this technique is applied to identify proteins binding to the eukaryotic translation initiation factors: eIF4AI and eIF4AII.
An essential step in understanding protein function is identification of relevant interacting proteins. Where such proteins are unknown there are a number of techniques available, each with their own merits and drawbacks. These include the yeast two-hybrid system, pulldown assays using recombinant protein, as well as tandem affinity purification or TAP-tagging1,2.
A more recent addition to these techniques is the combination of affinity purification of a protein of interest from a relevant mammalian cell line, followed by quantitative mass spectrometry using stable isotope labeling of amino acids in cell culture (SILAC)3. This has advantages over the yeast two-hybrid approach in that cell localization and post-translational modifications are not perturbed, as well as advantages over traditional TAP-tagging in that it is a quantitative rather than qualitative approach allowing the user to readily distinguish non-specifically interacting proteins and contaminants, from host factors that bind specifically. Further, as a sample is typically analyzed whole, rather than as individual protein bands, proteins of interest are not masked by similarly migrating proteins on a gel, nor do they typically need to be present at sufficient levels to be visible after staining, leading to increased numbers of confidently identified proteins 4.
To demonstrate this technique, GFP fusions of the closely related eukaryotic translation initiation factor eIF4AI and eIF4AII that share over 90% amino acid identity were investigated by SILAC-immunoprecipitation quantitative proteomics. Human eIF4AI and II were cloned into pEGFP-C1 to form a fusion protein where GFP is fused to the N-terminus of eIF4A. To avoid the need for generating stable cell lines transient transfection was used to deliver these constructs to stable isotope labeled 293T cells.
Cells were first labeled for two weeks in SILAC cell culture media, followed by transfection of plasmid DNA encoding a protein of interest. Cells were then lysed, the protein concentration normalized, and equal amounts of lysate affinity purified on anti-GFP agarose. Equal amounts of eluate were then combined and submitted for LC-MS/MS analysis. The results of this analysis are then processed to identify high confidence protein:protein interactions (Figure 1).
SILAC immunoprecipitation enables the identification of not only direct interactions but also low affinity or indirect interactions with protein complexes4. Using this system, eIF4AI and II immunoprecipitations allowed reproducible and confident identification of the primary binding partner eIF4G (isoforms I/II and III)5, as well as indirect interactions with eIF4E, and numerous components of the eIF3 complex.
1. Generation and Passaging of SILAC-labeled Cell Lines
Note: the use of Trypsin-EDTA must be avoided at all stages of passaging and preparation of experimental samples for analysis as the trypsin may contain unlabeled amino acids, which would lead to incomplete labeling of samples.
2. Transfection of Labeled Cells with pEGFP-fusion Constructs
3. Harvesting Cell Lysates
4. Binding to Anti-GFP Beads
5. Washing, Elution, and Preparation of Samples for MS Analysis
6. Data Analysis I: Understanding the Results, and Removal of Low-confidence Identifications
Note: A list of the column headings returned by the Proteome Discoverer software is given in Table 1. Different software (e.g., MSQuant, MaxQuant) will return different headings, however, only a subset of these are required for analysis and these are common to the various software packages. Data should always include an accession number for each protein identified, ratio’s comparing each sample ratio (light vs medium, light vs heavy, medium vs heavy etc), the number of unique peptides identified, and some form of false positive rate or confidence indication.
7. Data Analysis II: Selecting High Confidence Interactions for Further Study
8. Data Analysis III: Merging Replica Datasets
Note: To be confident in identifying interaction partners a typical SILAC pulldown experiment should ideally be performed three times, with the medium and heavy labeled samples switched for one of the repeats to control for any effect of the media on the results. A protein that shows up as interacting in at least two of the three experiments, with the ‘switched’-media sample ideally representing one of these, is a high confidence interaction.
In a typical SILAC pulldown experiment, the vast majority of identified proteins (>90%) represent contaminants as well as proteins binding non-specifically to the affinity matrix and this is illustrated in Figure 2B, even when washing protocols remove a majority of cytoplasmic contaminants such as GAPDH (Figure 2A). However, the clustering of non-specifically binding proteins in a normal distribution allows proteins that specifically bind to a protein of interest to be distinguished as these have higher sample/mock ratios than the background. Whilst contaminants should theoretically cluster around a log2 SILAC ratio of 0, this is not necessarily the case, an example of this is given in Figure 3. Possible reasons for this include imperfect SILAC labeling of cells, loading unequal volumes or concentrations of lysate onto the anti-GFP beads, accidental loss of beads during purification or unequal mixing of samples at the end of the purification procedure3. However, assuming data are analyzed based on a threshold standard deviation from the mean of the normally distributed contaminants, minor shifts in the centering of the data should not affect the quality of the results.
When comparing differences in protein interactions between two related proteins, a similar situation may occur where one protein of interest is produced to higher levels within cells than a second (either due to variation in transfection efficiency, or an intrinsic property of the protein or mRNA). Some variation in expression (e.g., Figure 2A) can be corrected for by analyzing the SILAC ratio for these two samples. In this example these would be the GFP-eIF4AI and GFP-eIF4AII samples. By analyzing the 4AI/4AII SILAC ratio as discussed in section 7, it is possible to identify proteins whose binding varies significantly between isoforms.
In Figure 4, a representation of the eIF4A-binding proteins identified in one of the replica experiments conducted is shown illustrating the coverage of the initiation factor complex this protocol achieved. The highest ratios were typically observed with eIF4G, which binds directly to eIF4A, with lower ratios for eIF4E, which binds to eIF4G at a site away from the eIF4A-binding site. Lower ratios were observed for members of the eIF3 complex. However, this clearly illustrates that the experiment satisfactorily identified both direct and indirect binding partners of eIF4AI and II. As might be expected from their high sequence identity6, protein:protein interactions appeared largely conserved between the two isoforms in this experimental system7,8. A selection of some of the interacting proteins are given in Table 2, illustrating the data format.
Column Heading | Description |
Accession | Displays the accession number for the sequence |
Coverage | Proportion of the protein sequence covered by the identified peptides |
♯PSM | Peptide spectral match |
♯peptides | Total number of unique peptides identified for a protein |
♯AAs | The length of a protein in amino acids |
MW (Da) | The molecular weight of a protein in Daltons. Excludes modifications |
calc. PI | The theoretical isoelectric point of a protein |
Score | The total score of a protein (which represents the sum of the individual peptide scores). The exact score required for significance will vary between experiments. A MS facility will usually apply a 5% false discovery rate cutoff. |
Sequence | The sequence of amino acids constituting the protein |
Ratio | The relative intensity of peptides in a named labeled sample, compared to a second labeled sample |
Ratio Count | The number of peptide ratios that were used to calculate the a given protein ratio |
Ratio variability (%) | The variability of the individual peptide ratios used to calculate a given protein ratio |
Description | The name of the protein |
Table 1. Standard column headings from a Proteome Discoverer report. Whilst useful information can be gained from all of these columns, those critical for this analysis are shown in bold.
Accession | Peptides | 4AI/Mock | 4AII/Mock | Name | SILAC analysis |
A8K7F6 | 21 | 100 | 1 | eIF4AI | ‘Bait’ protein |
Q14240 | 22 | 0.01 | 90.855 | eIF4AII | |
G5E9S1 | 25 | 47.575 | 30.53 | eIF4GI | Interacting proteins |
Q59GJ0 | 5 | 11.778 | 10.619 | eIF4GII | |
P06730 | 3 | 7.22 | 7.57 | eIF4E | |
Q5T6W5 | 4 | 0.685 | 0.646 | hnRNPK | Non-specifically binding contaminants |
P62805 | 1 | 0.531 | 0.498 | Histone H4 | |
H6VRG2 | 18 | - | - | Keratin-1 | Environmental contaminants |
P35527 | 11 | 0.01 | 0.01 | Keratin-1 cytoskeletal 9 |
Table 2. Typical Data from a SILAC immunoprecipitation experiment. Giving example data for a protein of interest/bait (high peptides, high ratio), proteins interacting with a protein of interest (high/low peptides, high ratio), non-specifically binding proteins (high/low peptides, ratio falls below cutoff – in this experiment 0.96), and environmental contaminants (often high peptides, negative ratio/below threshold).
Figure 1. Experimental Plan. Firstly cells are grown in media lacking Arginine and Lysine and substituted with stable isotope labeled Arginine and Lysine for 2 weeks (1). (2) Cells are seeded into 10 cm2 dishes and transiently transfected with plasmids encoding GFP (Mock) or GFP fusion proteins (Samples). (3) Cells are lysed and GFP or GFP fusion proteins are immunoprecipitated from cell lysates. (4) Samples are combined in a 1:1 ratio and submitted for LC-MS/MS analysis. Data is then analyzed to remove low confidence protein identifications and to select a level of protein enrichment corresponding to genuine interacting proteins.
Figure 2. Confirming suitable immunoprecipitation conditions. A) Western blot analysis of cell lysates, as well as the unbound and bound fractions from the immunoprecipitation confirms expression and immunoprecipitation of the protein of interest. A western blot against GAPDH confirms depletion of non-interacting proteins and a further western blot a known interacting partner of eIF4A confirms the successful immunoprecipitation of proteins binding to the protein of interest. B) Where interacting partners of a protein of interest are not known, a silver-stained gel may confirm immunoprecipitation of interacting proteins. On this silver-stained gel bands for GFP and GFP-eIF4AI/II are clear, and a band migrating at the correct size for eIF4G is present only in the GFP-4AI/II-bound lanes and not in the GFP control lane.
Figure 3. Representative results. Histogram showing the distribution of protein ratios from one repeat of (A) GFP-eIF4AI or (B) GFP-eIF4AII pulldown. The 1.96 standard deviation cutoff is marked with a dashed line. Interacting proteins falling outside the normally-distributed contaminants are evident from ~0.25 and 1 in (A), and ~0.3-1.5 in (B).
Figure 4. Identification of eIF complex members from a single replica of a SILAC IP experiment. Proteins in green were the protein of interest used for the pulldown Interacting proteins are shaded from red to white according to log2 SILAC ratio in the SILAC IP with red being the most abundant protein in the analysis, and white being the 1.96 SD cutoff. Proteins shaded in grey were not identified in this analysis.
The SILAC pulldown strategy described here represents a very sensitive and powerful means of detecting novel protein:protein interactions, and furthermore allows the rapid and simple discrimination of altered binding patterns between closely related samples of interest. In this example this technique was used to investigate the protein:protein interactions of the eIF4AI and eIF4AII proteins6. To the author’s knowledge, this is the first study in the literature exploiting the utility of SILAC proteomics to investigate the cellular interactome of these two isoforms of eIF4A.
The approach as described above uses a GFP-tag and anti-GFP beads9,10 and therefore modifications may be required to enable this approach to be used for a specific protein of interest, for example whether the tag is placed at the N- or C-terminus of a protein. Where possible, western blots or functional assays should be performed to detect binding of a known protein interaction partner. Should a protein not tolerate fusion with a GFP tag, other tagging or pulldown strategies have been applied to SILAC pulldowns using both other tags (FLAG11, Biotin12, STREP (own data, unpublished)), or by using primary antibodies against a protein of interest where siRNA knockdown of the target protein provides a control sample13. Such experiments have been described elsewhere in the literature, but in brief, step 1 and steps 5.4-8 would be applied as above, with steps 2-5.3 modified as appropriate for the expression/pulldown system of choice using equal protein inputs as described in steps 2.4-2.5. As the quantification stages allow non-specific binding proteins to be removed at the analysis level, it is recommended to omit pre-incubation with control beads, or high salt washes in order to preserve low-affinity protein:protein interactions with a protein of interest. A nuclease may be included or omitted from this protocol according to the specifics of a particular experiment. For example: as the proteins used in this method are RNA helicases, RNase cocktail was included in the protocol to remove indirect interactions mediated via the RNA (step 3.2). In some cases however, there could be benefit from running parallel experiments with and without nuclease to identify nucleic acid dependent interactions.
Within this protocol, the switching of ‘medium’ and ‘heavy’ samples in replicate experiments is recommended to control for variation introduced by differences in the SILAC media or cell growth. An alternative control involves the sequential switching of all three (‘light’, ‘medium’, and ‘heavy’) media in replica experiments. Whilst this approach is potentially more stringent, it does increase the complexity of the analysis, as in at least one replicate, a protein of interest will be produced in ‘light’ labeled cells and so it is necessary to distinguish between proteins consistently identified in ‘light’ samples (environmental contaminants such as keratins), and those that are only enriched in ‘light’ samples when bound to a protein of interest.
Whilst the use of quantification data allows discrimination of specific- from non-specific interactions through use of a threshold, inevitably some genuine interactions may be discarded. The approach above is a simple and rapid approach to identifying protein:protein interactions that can be easily attempted by researchers with no previous experience with mass spectrometry, or analysis of large protein:protein interaction datasets. For most uses this is more than sufficient for identifying novel proteins of interest. Further modifications to this approach to help reduce this data loss are described elsewhere in the literature and include the use of a protein frequency library where known contaminant proteins for a specific set of experimental parameters (cell line, bead matrix, buffer conditions) may be excluded10,14. However, depending on the particular experimental parameters it may be necessary to run a number of control experiments to generate a bead proteome and this can therefore increase both the expense and complexity of the experiment. Further information on this technique is available from the www.peptracker.co.uk website14.
It should also be noted that the protocol described above involves mixing differently labeled samples at the end of the immunoprecipitation process (termed a Mixing After Purification – MAP SILAC experiment). This is done as protein:protein interactions occur at a given equilibrium15. It should be noted that other groups have combined this MAP SILAC approach described in this protocol with incubation of the samples prior to pulldown (Purification After Mixing – PAM SILAC) for different lengths of time (20 min to 2 hr have been used in the literature)15,16. Based on how rapidly a protein ratio drops towards 1:1, it is possible to qualitatively investigate binding affinities and to define proteins as stable or dynamic interacting proteins15.
In summary, SILAC pulldowns represent a very powerful means of identifying proteins interacting with a given protein of interest, in a physiologically relevant setting. The technique can be very easily adapted to a number of different purification strategies, allowing its application to any given protein of interest. Quantification of results vastly simplifies identification of genuine interactions, and permits relaxation of stringent buffer conditions used to remove non-specific binders, and thus preserves low affinity interactions. As up to three samples can be compared in the above strategy, the technique has clear strengths in comparing differences in protein binding between different protein isoforms, mutant proteins, or the effect of pharmacological inhibitors. As whole gel slices are analyzed rather than individual bands that stain by Coomassie, the numbers of proteins identified at high confidence are typically higher than those identified in a standard GST/TAP-pulldown, and experimenter bias in selecting proteins of interest is removed. The technique therefore compares very favorably with other commonly used techniques used in identification of novel protein-interactions (yeast 2-hybrid, GST/His or TAP Pulldowns).
The authors declare that they have no competing financial interests.
This work was supported by grants from the Wellcome Trust and BBSRC to IG. IG is a Wellcome Senior Fellow.
Name | Company | Catalog Number | Comments |
Dulbecco's Modified Eagle Medium (DMEM) | Dundee Cell Products | LM010 | DMEM lacking Arginine and Lysine. (and containing L-glutamine) |
Dialysed FBS (10 kDa cutoff) 500 ml | Dundee Cell Products | DS1003 | |
Arginine (R0) 25 g | Sigma-Aldrich | A8094 | Resuspend 0.5 g in 6 ml PBS, filter sterilize, and freeze in 0.5 ml aliquots (gives a 84 mg/ml stock) |
Arginine (R6) 0.5 g | Cambridge Isotope Labs | CLM-2265 | Resuspend 0.5 g in 6 ml PBS, filter sterilize, and freeze in 0.5 ml aliquots (gives a 84 mg/ml stock) |
Arginine (R10) 0.5 g | Cambridge Isotope Labs | CNLM-539 | Resuspend 0.5 g in 6 ml PBS, filter sterilize, and freeze in 0.5 ml aliquots (gives a 84 mg/ml stock) |
Lysine (K0) 25 g | Sigma-Aldrich | L8662 | Resuspend 0.5 g in 3.5 ml PBS, filter sterilize, and freeze in 0.5 ml aliquots (gives a 146 mg/ml stock) |
Lysine (K4) 0.5 g | Cambridge Isotope Labs | DLM-2640 | Resuspend 0.5 g in 3.5 ml PBS, filter sterilize, and freeze in 0.5 ml aliquots (gives a 146 mg/ml stock) |
Lysine (K8) 0.5 g | Cambridge Isotope Labs | CNLM-291 | Resuspend 0.5 g in 3.5 ml PBS, filter sterilize, and freeze in 0.5 ml aliquots (gives a 146 mg/ml stock) |
Penicillin/streptomycin, Liquid. 100 ml | Gibco | 15140-122 | |
Lipofectamine 2000 transfection reagent | Invitrogen | 11668-027 | |
GFP-trap Agarose (500 μl resin) | Chromotek | gta-20 | |
5x SDS Sample loading buffer | Fisher Scientific | PN39000 | The use of purchased rather than homemade sample buffer is recommended to minimize keratin contamination. |
Protease inhibitor cocktail set III | Calbiochem | 539134 | |
1.7 ml prelubricated tubes | Costar | 3207 | |
BCA protein assay kit (1 L) | Pierce | 23225 | |
Tris (Trizma) | Sigma-Aldrich | T1503 | |
Ethylenediaminetetraacetic acid (EDTA) | Sigma-Aldrich | E6758 | |
Sodium chloride | Sigma-Aldrich | S3014 | |
Rnase cocktail | Ambion | AM2286 | |
NP-40 alternative | Millipore | 492016 |
Request permission to reuse the text or figures of this JoVE article
Request PermissionThis article has been published
Video Coming Soon
Copyright © 2025 MyJoVE Corporation. All rights reserved