Interpreting a large, highly converged cluster in AIR-guided docking

henryhardigan · November 13, 2025, 1:19pm

I am docking two protein fragments that I suspect interact via short linear motifs. I used default docking parameters and two models taken from reviewed UniProtKB entries. For this run I defined AIRs on a 12-residue stretch on each protein that I believe represents the interacting motifs.

Runs with smaller and larger AIR definitions converge to the same surface, but only this AIR set produces such a large cluster.

Is this result capable of arising from incorrectly placed AIRs, or is this generally taken as evidence for a well-defined docking solution in HADDOCK?

amjjbonvin · November 13, 2025, 4:12pm

It is nice in general if the top cluster is highly populated, but does not have to be case.

When you increase of decrease your number of restraints, considering 50% is randomly deleted for each model generate, you can expect differences in the number and size of the clusters generated.

But if all top ranked clusters are rather consistent in their binding mode, it is a good indication that your runs do converge, irrespective of the exact definition of AIRs.

Topic		Replies	Views
Haddock generatong only one cluster HADDOCK	6	63	January 15, 2026
Only 24% of models clustered HADDOCK	4	411	January 18, 2021
Clustering only generates a single cluster? HADDOCK	0	189	May 5, 2023
Random AIR exclusions HADDOCK	6	703	June 21, 2020
100% of generated clusters clustered	1	328	May 6, 2022

Interpreting a large, highly converged cluster in AIR-guided docking

Related topics