Biochemical characterization of type I-E anti-CRISPR proteins, AcrIE2 and AcrIE4

In bacteria and archaea, CRISPRs and Cas proteins constitute an adaptive immune system against invading foreign genetic materials, such as bacteriophages and plasmids. To counteract CRISPR-mediated immunity, bacteriophages encode anti-CRISPR (Acr) proteins that neutralize the host CRISPR–Cas systems. Several Acr proteins that act against type I-E CRISPR–Cas systems have been identified. Here, we describe the biochemical characterization of two type I-E Acr proteins, AcrIE2 and AcrIE4. We determined the crystal structure of AcrIE2 using single-wavelength anomalous diffraction and performed a structural comparison with the previously reported AcrIE2 structures solved by different techniques. Binding assays with type I-E Cas proteins were carried out for the target identification of AcrIE2. We also analyzed the interaction between AcrIE4 and its target Cas component using biochemical methods. Our findings corroborate and expand the knowledge on type I-E Acr proteins, illuminating diverse molecular mechanisms of inhibiting CRISPR-mediated prokaryotic anti-phage defense.

CRISPR-mediated immunity functions through three distinct stages of anti-phage defense [14,15].First, Cas proteins form an integrase complex that cleaves and inserts the invading DNA fragments into CRISPR loci as new spacers.Then, the acquired DNA sequences are transcribed as a long precursor CRISPR RNA, which is further processed into mature CRISPR RNAs (crRNAs) containing a single spacer unit.Finally, the crRNAs assemble with Cas protein(s) to form RNA-guided interference complexes for degrading target sequences in reinvading foreign nucleic acids.
CRISPR-Cas systems have been identified in ~ 40% of bacterial genomes and ~ 90% of archaeal genomes and can be classified into two major groups and six types [11][12][13].In class 1 (types I, III and IV) systems, multiple Cas proteins participate in the formation of the interference complex, whereas class 2 (types II, V and VI) systems use a single multi-domain Cas protein such as Cas9 or Cas12 [11][12][13]16].The type I CRISPR-Cas systems are broadly distributed in various prokaryotic genomes and can be divided into several subtypes depending on their signature Cas components [11,13].In type I-E CRISPR-Cas systems, which is one of the most extensively studied subtypes, five Cas proteins (Cas5e, Cas6e, Cas7e, Cas8e, and Cas11) associate with crRNAs to form a CRISPR-associated complex for antiviral defense (Cascade) that recognizes target DNA sequences and directs their degradation by Cas3 nucleases [7,17].
To counteract CRISPR-mediated immunity, bacteriophages encode anti-CRISPR (Acr) proteins that neutralize the host CRISPR-Cas systems [18].Several Acr proteins that act against type I-E CRISPR-Cas systems have been found [19][20][21][22][23]. AcrIE2 and AcrIE4 were among those discovered in Pseudomonas phages [19].The structures of AcrIE2 have been determined previously using NMR spectroscopy [24] and X-ray crystallography with ab initio phasing [25].Co-purification experiments by Mejdani et al. demonstrated that AcrIE2 interacts with Cascade and not with Cas3 [24].Nevertheless, in their transcriptional reporter assays, the presence of AcrIE2 did not prevent DNA binding of Cascade [24], suggesting a unique inhibition strategy of AcrIE2.To our knowledge, AcrIE4 has not been characterized biochemically.However, it is homologous to the N-terminal domain (NTD) of a fusion Acr protein, AcrIE4-F7, of which we have previously investigated the structure and function [26].AcrIE4-F7 is encoded by a mobile genetic element in Pseudomonas citronellolis [23].
In this study, we describe the biochemical characterization of two type I-E Acr proteins, AcrIE2 and AcrIE4.We determined the crystal structure of AcrIE2 using an experimental phasing technique, and performed in vitro assays to test its binding to individual Cas components comprising the type I-E Cascade.We also analyzed the interaction between AcrIE4 and its target Cas protein using multiple biochemical methods.These results corroborate and expand the knowledge on Acr inhibitors of type I-E CRISPR-Cas systems, highlighting diverse mechanisms for inactivating CRISPR-mediated bacterial anti-phage defense.

Crystallization and structure determination of AcrIE2
AcrIE2 was crystallized at 20 °C by the sitting-drop vapor diffusion method from 2.3 mM protein solution in buffer [150 mM NaCl, 5% (w/v) glycerol, 2 mM DTT, 20 mM HEPES pH 7.0] mixed with an equal amount of reservoir solution [17% (w/v) polyethylene glycol (PEG) 8000, 200 mM NaCl, 100 mM Na 2 HPO 4 -citric acid pH 3.8].To solve the phase problem using single-wavelength anomalous diffraction, selenomethionyl AcrIE2 was expressed in E. coli BL21(DE3) cells grown in M9 medium supplemented with selenomethionine, as described previously [27].The selenomethionyl AcrIE2 protein was purified as described above for native AcrIE2.The selenomethionyl crystals were obtained at 20 °C by the hanging-drop vapor diffusion method from 2.3 mM protein solution in buffer [150 mM NaCl, 5% (w/v) glycerol, 2 mM DTT, 20 mM HEPES pH 7.0] mixed with an equal amount of reservoir solution [17% (w/v) PEG 8000, 200 mM NaCl, 100 mM Na 2 HPO 4 -citric acid pH 4.2].The native and selenomethionyl crystals were flash-frozen in liquid nitrogen with additional 12% (w/v) PEG 8000 and 6% (w/v) glycerol as cryoprotecting reagents in the reservoir solution.Diffraction data were collected at beamline 7 A of the Pohang Accelerator Laboratory at 100 K. Diffraction images were processed using HKL2000 [28].The determinations of selenium positions, density modification, and initial model building for the selenomethionyl AcrIE2 structure were performed using PHENIX [29].The initial model of the selenomethionyl AcrIE2 was used for phasing the native AcrIE2 structure in PHASER [30].The final structure was completed using alternate cycles of manual fitting in COOT [31] and refinement in PHENIX [29].The stereochemical quality of the final model was assessed using MolProbity [32].

Cloning, expression, and purification of Cas proteins
The genes of type I-E Cas proteins were amplified by polymerase chain reaction from the genomic DNAs of Pseudomonas aeruginosa PRD-10 and E. coli DH5α.They were cloned into pET28b with an N-terminal (His) 6 -maltose binding protein (MBP) tag.E. coli BL21(DE3) cells containing these constructs were grown in LB medium at 37 °C until the optical density at 600 nm reached 0.6.Expression and purification of the type I-E Cas proteins were performed as described above for the Acr proteins except that a HiLoad 16/60 Superdex 200 column (GE Healthcare) was used during SEC.

Analytical size-exclusion chromatography
Analytical SEC experiments for testing AcrIE2 binding to Cascade subunits were performed using a Superdex 200 10/300 GL column (GE Healthcare) equilibrated with buffer [150 mM NaCl, 2 mM DTT, 20 mM tris(hydroxymethyl)aminomethane-HCl pH 7.5].Proteins (20 µM each) were mixed in the buffer and incubated at 4 °C for 1 h.The samples (700 µL) were then loaded onto the column at a flow rate of 0.5 mL/min.SEC runs for individual proteins were also performed as control experiments in the same manner.Elution fractions were analyzed by sodium dodecyl sulfate-polyacrylamide gel electrophoresis (SDS-PAGE) and visualized by Coomassie staining.Uncropped gel images are shown in Additional file 1: Figure S1.Analytical SEC runs for testing the interactions between AcrIE4 and Cas8e subunits were performed as described above for AcrIE2 except for using different buffer [150 mM NaCl, 5% (w/v) glycerol, 2 mM DTT, 20 mM HEPES pH 7.0].

Isothermal titration calorimetry (ITC)
ITC was performed at 25 °C using the MicroCal iTC200 calorimeter (Malvern).The (His) 6 -MBP-tagged Cas8e of , where I i (h) is the intensity of an individual measurement of the reflection and < I(h) > is the mean intensity of the reflection.
where F obs and F calc are the observed and calculated structure factor amplitudes, respectively d R free was calculated as R cryst using ~ 5% of the randomly selected unique reflections that were omitted from structure refinement

Native Selenomethionyl
Space group P2

Crystal structure of AcrIE2
The crystal structure of AcrIE2 was determined to a resolution of 1.35 Å using single-wavelength anomalous diffraction of the selenomethionyl protein.tag were not modeled in the final structure due to insufficient electron density.
Our crystal structure of AcrIE2 contains two α helices (α1 and α2) and five β strands (β1 to β5) comprising a single antiparallel β sheet (Fig. 1).The shorter α1 helix (residues 8-13) is located in the segment connecting the β1 and β2 strands.The longer α2 helix (residues 25-39) is positioned on the concave side of the fivestranded antiparallel β sheet, contacting residues in all five β strands.Topology and fold of our structure are essentially identical to those of the two previously determined structures (Fig. 2A) [24,25].The root mean square deviation (RMSD) values of the 80 Cα atomic positions among the three AcrIE2 structures ranged from 0.4 to 3.0 Å.Our structure is superposed better with the previous crystal structure solved by ab initio phasing than the NMR structure, despite the differences in crystallization condition and phasing method.The RMSD between the two crystal structures was only 0.4 Å for 80 Cα atoms.
Despite the high overall structural similarity, local conformational differences were noted between the structures.The most significant discrepancy was observed in the loop connecting the β4 and β5 strands (residues 65-71) (Fig. 2A).The two crystal structures had relatively high crystallographic B-factor values for this segment (Fig. 2B).Residues in the loop region also had relatively high average RMSD values of NMR structure ensembles (Fig. 2B).These observations indicate the highly dynamic nature of this region in the AcrIE2 structures.Thus, the detected structural heterogeneity most likely results from local intrinsic flexibility, which is revealed as different conformations in distinct environments, such as in crystal lattice and bulk solution.The conformational heterogeneity between the two crystal structures is also likely caused by different intermolecular contacts in the crystal lattice.They belong to the same space group (P2 1 2 1 2 1 ), and the unit cell parameters (a = 26.96Å, b = 47.44 Å, c = 56.11Å, α = β = γ = 90°) of the crystal structure solved by ab initio phasing are very similar to those of our structure (Table 1).However, surface residues involved in crystal contacts are not identical between the two crystal structures (Additional file 1: Table S1).Protein interfaces between AcrIE2 molecules in crystal lattice are also different (Additional file 1: Table S2).Overall, our crystal structure of AcrIE2 contains features similar to those of the two previously reported structures, with minor local conformational heterogeneity despite the technical differences in structure determination.

Test of AcrIE2 binding to Cas components in the type I-E Cascade
Contrary to the type I-F system, the recombinant type I-E Cascade of P. aeruginosa is difficult to prepare due to its poor expression and solubility [24,33].To identify Cas protein(s) that can bind to AcrIE2 in vivo, Mejdani et al. expressed (His) 6 -tagged AcrIE2 in P. aeruginosa and used Ni-affinity chromatography to detect co-purifying proteins [24].In this experiment, Cas7e was among the most confidently detected proteins, and other type I-E Cascade subunits were also observed with lower confidence, indicating that AcrIE2 interacts with the Cascade mainly via its Cas7e subunit [24].To confirm this finding in vitro, we tested whether AcrIE2 interacts with separately purified P. aeruginosa Cascade components in analytical SEC.Due to the poor solubility of the individual P. aeruginosa Cascade subunits [24,33], we used N-terminal (His) 6 -MBP tags to stabilize them in our experimental setting.
In type I-E CRISPR-Cas systems, five Cas species assemble to form the Cascade with a stoichiometry of Cas8e 1 :Cas11 2 :Cas7e 6 :Cas5e 1 :Cas6e 1 (Fig. 3A and B) [15,16].Thus, we performed multiple SEC runs, in which AcrIE2 and each of the five (His) 6 -MBP-tagged Cascade subunits (Cas8e, Cas11, Cas7e, Cas5e, and Cas6e) were incubated together and injected into the column.Unexpectedly, AcrIE2 did not interact with any of the P. aeruginosa Cascade subunits in our SEC experiments, as the elution volumes of AcrIE2 were identical regardless of whether the Cascade components were incubated together or not (Fig. 3C).This was surprising since AcrIE2 interacted with the Cascade in the previous study by Mejdani et al. [24].Moreover, our approach using (His) 6 -MBP-tagged Cascade subunits was successful for target identification of other type I-E Acr inhibitors, such as AcrIE4-F7 [26] and AcrIE4 (see below).
Nonetheless, we suspect that our inability to identify a target Cascade component of AcrIE2 may be due to several potential limitations of our experimental design.
First, the N-terminal solubility-enhancing tag may hinder the interaction with AcrIE2.The (His) 6 -MBP-tag, which is ~ 42 kDa, could be large enough to occlude a potential binding interface if it is adjacent to the N-terminus.Second, Acr binding to Cascade may require multiple subunits.Mejdani et al. suggested that AcrIE2 may have more than one functionally important surface [24].This suggests that AcrIE2 simultaneously interacts with multiple Cascade subunits located in proximity.Last, the solubility-enhancing effect of the (His) 6 -MBP tag may not be strong enough to fold all of the tested individual Cascade components correctly.We previously identified Cas8e as a single target Cascade subunit for AcrIE4-F7 using the same experimental strategy [26], implying the proper folding of the (His) 6 -MBP-tagged Cas8e.However, the solubility-enhancing effect of the (His) 6 -MBP tag may not be guaranteed for other Cascade subunits, such as Cas7e.In the co-purification analyses by Mejdani et al., AcrIE2 interacted mainly with Cas7e [24].Thus, the more precise Cascade binding mechanism and key interacting Cas residues for AcrIE2 inhibition remain to be determined.

AcrIE4 interacts withP. aeruginosa Cas8e
AcrIE4 was discovered in Pseudomonas phage D3112 [19].It contains only 52 amino acid residues and is the smallest (6.0 kDa) among the known type I Acr inhibitors [34].The fusion Acr inhibitor, AcrIE4-F7, was identified in the mobile genetic element in P. citronellolis [23], and the NTD of AcrIE4-F7 (AcrIE4-F7 NTD ) shares high sequence similarity (~ 69%) with AcrIE4 (Fig. 4A).This fusion Acr protein has been investigated structurally and functionally [26].However, to our knowledge, biochemical characterization of the original AcrIE4 has not been reported.To this end, we purified recombinant AcrIE4 protein and performed binding assays with a potential Cas target component.Since we previously identified Cas8e as the binding partner for AcrIE4-F7 NTD [26], we tested the interaction between AcrIE4 and Cas8e.Due to the poor solubility of P. aeruginosa Cas8e [24], we used the N-terminal (His) 6 -MBP-tagged version for our experiments.
In our analytical SEC experiments (Fig. 4B), AcrIE4 coeluted with the (His) 6 -MBP-tagged Cas8e of P. aeruginosa when incubated together before injection.The elution volume of the complex was significantly smaller than that of AcrIE4 alone, indicating that AcrIE4 interacted with P. aeruginosa Cas8e.In the quantitative analysis using ITC (Fig. 4D), AcrIE4 bound tightly to (His) 6 -MBP-tagged Cas8e of P. aeruginosa with submicromolar affinity.The equilibrium dissociation constant (K D ) was calculated to be ~ 178 nM, which is comparable with the K D of ~ 140 nM determined for AcrIE4-F7 NTD in our previous study [26].Together, these results demonstrate that AcrIE4 binds to the Cas8e subunit of the P. aeruginosa type I-E Cascade.
We expect that the binding mode of AcrIE4 to the Cas8e subunit is similar to that of AcrIE4-F7 NTD .In our additional SEC analyses (Fig. 4C), AcrIE4 did not interact with E. coli Cas8e, as observed previously for AcrIE4-F7 NTD [26].The divergence between P. aeruginosa and E. coli Cas8e proteins has been discussed previously [19,26], and the preferential Acr binding to the P. aeruginosa homologue was attributed to the difference in the protospacer adjacent motif (PAM) recognition site of Cas8e [26].The PAM interaction surface in P. aeruginosa Cas8e includes several non-conserved positively charged residues [26].The negatively charged (pI ~ 4.2) AcrIE4-F7 NTD supposedly acts as a target DNA mimic, interacting with the positively charged PAM-binding residues in P. aeruginosa Cas8e [26].AcrIE4 also has a low pI value (~ 3.7), and the negatively charged key interacting residues of AcrIE4-F7 NTD , including Glu19 and Asp22 [26], are also conserved in AcrIE4 (Fig. 4A).Thus, AcrIE4 also likely targets the PAM recognition site in Cas8e in a similar manner to AcrIE4-F7 NTD .

P
. aeruginosa (20 µM) in a 200-µL sample cell was titrated with 19 consecutive 2-µL injections of AcrIE4 (150 µM) in buffer [150 mM NaCl, 5% (w/v) glycerol, 1 mM tris(2carboxyethyl) phosphine, 20 mM HEPES pH 7.0].Origin software (OriginLab) was used to process and analyze the ITC titration data.The integrated heats were leastsquared best-fit using a simple one-site binding model, and the errors were obtained from the fitting.

Fig. 1
Fig. 1 Structure of AcrIE2.A Schematic representation of secondary structures in AcrIE2.Its amino acid sequence is shown and numbered below.B Crystal structure of AcrIE2 solved by single-wavelength anomalous diffraction.The structure is shown in rainbow format from the N-terminus (blue) to the C-terminus (red).N-and C-termini and secondary structures are also indicated.C Electrostatic potential surface (red = − 5.0 kT, blue = + 5.0 kT) of AcrIE2 in the same orientation as shown in B. The structures were displayed using the PyMOL software (The PyMOL Molecular Graphics System, Version 2.0 Schrödinger, LLC)

Fig. 2 Fig. 3
Fig. 2 Comparison of AcrIE2 structures determined by different techniques.A Cα-trace superposition of three AcrIE2 structures.Our crystal structure (PDB ID: 8HEK) is superimposed with one of the low-energy structures determined by NMR (PDB ID: 7KIX) and the previous crystal structure solved by ab initio phasing (PDB ID: 7CHQ).The three structures align well except for the loop connecting β4 and β5.Orientation of the superimposed structures is approximately identical to that shown in Fig. 1B.B Residual flexibility analyses of AcrIE2 structures.Average crystallographic B-factors of main chain atoms and average RMSD values of Cα atoms from NMR structure ensembles are shown as a function of residue number.Secondary structures of our AcrIE2 structure are also indicated

Fig. 4
Fig. 4 AcrIE4 targets Cas8e in the type I-E CRISPR-Cas system.A Sequence alignment of AcrIE4 and AcrIE4-F7 NTD .Conserved negatively charged residues, which were important for the interaction between AcrIE4-F7 NTD and Cas8e in our previous study [26], are indicated with blue asterisks.B, C Analytical SEC experiments for interactions between AcrIE4 and Cas8e homologues.AcrIE4 co-eluted with P. aeruginosa Cas8e (B), but not with E. coli Cas8e (C).The elution fractions were analyzed by SDS-PAGE.Uncropped gel images are shown in Additional file 1: Figure S1.D ITC analysis of AcrIE4 binding to (His) 6 -MBP-tagged Cas8e of P. aeruginosa.The experimentally determined K D value is indicated with the fitting errors.Pa and Ec indicate P. aeruginosa and E. coli, respectively

Table 1
Data collection, phasing, and refinement statistics of AcrIE2 a Values in parentheses are for the highest-resolution shell

Table 1
6ummarizes the data collection, phasing, and refinement statistics.AcrIE2 was crystallized in space group P2 1 2 1 2 1 with a single polypeptide chain and 118 water molecules per asymmetric unit.Residues in the C-terminal (His)6