Hello,
I want to calculate the relative binding free energy difference of a complex, with alchemical transformation using the double system/single box approach. I followed the tutorial on the pmx page (Mutation free energy calculations) but this covers a mutation in a single protein complex.
I am interested to see the free energy difference of a complex where the ligand has an arginine to alanine (R2A) mutation. Based on the previous tutorial and the Patel et al. (2021) paper (https://pubs.acs.org/doi/10.1021/acs.jctc.0c01045) I did the following:
- Hybrid topology generation with the pmx server. Then, I build the boxes manually containing the receptor/wild-type ligand complex and the unbound mutated ligand, and vice-versa.
- 100 ns equilibrium run. Then I discarded the first 20 ns and selected 100 snapshots from the rest of the run for the transition simulations.
- For each snapshot, 500 ps forward and backward runs were performed changing the lambda from 0 → 1 and from 1 → 0 respectively.
- Finally, I calculated the DeltaG with
pmx analyse
using thedhdl.xvg
files for each snapshot run.
The analysis output is the following:
========================================================
ANALYSIS
========================================================
Number of forward (0->1) trajectories: 100
Number of reverse (1->0) trajectories: 100
Temperature : 300.00 K
--------------------------------------------------------
Crooks Gaussian Intersection
--------------------------------------------------------
CGI: Forward Gauss mean = 1396.12 kJ/mol std = 11.06 kJ/mol
CGI: Reverse Gauss mean = 913.69 kJ/mol std = 30.94 kJ/mol
CGI: dG = 1268.34 kJ/mol
CGI: Std Err (bootstrap:parametric) = 8.11 kJ/mol
CGI: Std Err (bootstrap) = 12.14 kJ/mol
Forward: gaussian quality = 0.42
---> KS-Test Ok
Reverse: gaussian quality = 0.81
---> KS-Test Ok
--------------------------------------------------------
Bennett Acceptance Ratio
--------------------------------------------------------
BAR: dG = 1176.18 kJ/mol
BAR: Std Err (analytical) = 17594679082837450.00 kJ/mol
BAR: Std Err (bootstrap) = 6.18 kJ/mol
BAR: Conv = 1.00
BAR: Conv Std Err (bootstrap) = 0.00
--------------------------------------------------------
Jarzynski estimator
--------------------------------------------------------
JARZ: dG Forward = 1371.44 kJ/mol
JARZ: dG Reverse = 980.92 kJ/mol
JARZ: dG Mean = 1176.18 kJ/mol
JARZ: Std Err Forward (bootstrap) = 3.28 kJ/mol
JARZ: Std Err Reverse (bootstrap) = 11.18 kJ/mol
JARZ_Gauss: dG Forward = 1371.34 kJ/mol
JARZ_Gauss: dG Reverse = 1107.57 kJ/mol
JARZ_Gauss: dG Mean = 1239.45 kJ/mol
JARZ_Gauss: Std Err (analytical) Forward = 3.69 kJ/mol
JARZ_Gauss: Std Err (analytical) Reverse = 27.73 kJ/mol
JARZ_Gauss: Std Err Forward (bootstrap) = 4.61 kJ/mol
JARZ_Gauss: Std Err Reverse (bootstrap) = 31.12 kJ/mol
========================================================
The value I got is 1176.18 ± 6.12 kJ/mol (281.11 ± 1.46 kcal/mol) which is much greater than expected. I am repeating the transition runs using 10 ns.
Attached are the plots and histograms of the W per snapshot (sorry, the legend covers the forward data).
The W histograms are very separated and do not intersect. Is this a sign of poor convergence or rather an issue with the system setup?
Should I take something else into account? Any suggestion will be very welcomed.
Best,
Yasser