I am trying to specify an area for HADDOCK, however, I’m not sure the shape restrained model is suitable as currently, there is no inform of how the ligand would bind. I would like to specify the area on the protein that the ligand can potential bind to. How should I go about doing that?
Thank you for getting back to me. Stating all the residues in the area as active residues would affect the scoring of the binding site, no? In my case, I would like to specify an area (which contains 285 resides) to be the ‘searched area’. Would setting all 285 residues not affecting the scoring of the binding site?
That’s a rather large area - make sure to only specify the solvent accessible ones.
And define those as passive and the ligand as active. In that way the ligand has the freedom to explore the entire region.
You will also have to increase the sampling, e.g. to 10000 for it0 and 400 for it1 and water
I have submitted and gotten results for my calculations. For some reason, the ligand is interacting with residues that are not specified as active. May I know why that’s the case?
HADDOCK will define restraints from those to the ligand (and by default 50% will be randomly removed). But again this does not prevent the ligand to interact with other residues. The question is does it contact some of the active residues you defined?
In this run you did not define any active residues as I can see from the json parameter file.
You used instead the random AIR option ( “ranair”: true, in the json file).
This can not be combined with defining active residues.
You must have followed the wrong section of the tutorial when setting up your run. In case you have a binding site start at the following section:
And you will need to generate the restraints file to give to HADDOCK as explained in the tutorial.
Also another comment: You seem to be using an AlphaFold2 model which contains a lot of “spaghetti” regions, i.e. regions for no structural predictions could be made. Best to remove those for docking - it is waste of computational resources to include those.
Indeed a good approach, but I would try to keep at least a domain structures. i.e. not start cutting halfway in a domain.
To give one example, if you are interested in looking at Sars-Cov2 spike protein - ACE2 interaction, there is in principle no need to use the full spike trimer. The RBD is enough (provided you have some knowledge of the binding site). The situation would be different if you were to do ab-initio docking.