I performed protein-protein docking with three sets of active residues. While analyzing the results of protein-protein docking, I am confused in choosing the best structure. Should I choose the structure from the cluster with low Z-score or should I choose a structure that gives the least RMSD which has a poor Z-score?
My aim is to identify the optimal set of active residues that can help reproduce the crystal structure.
RMSDs have no predictive values - they only tell where a cluster is with respect to the best model generated by HADDOCK
As we always indicate in all papers, it is the HADDOCK score which should be leading.
The z-score is only a statistical value related to a particular run.
And if you want to compare three runs with different definitions of active residues, it might be best to then remove the restraint energy term from the scores (subtract 10% of the restraint energy from the HADDOCK score).
thank you for your reply. But if I have to compare the resulted structures with an existing crystal structure, can I still use Haddock score to pick the best model?
In a case where you would not know the answer this would be the strategy.
Although we always advise to look at more than one solution and try to validate those possibly with e.g. known mutations or any other info you might have (if not used to guide the docking).
But if you have a crystal structure you can evaluate all clusters