About SMILES database

alevil · September 6, 2023, 7:06am

Hi,
How can you make sure that the SMILES database used is complete? How can you account for a specific family of drugs of which there is only a small number of them in the way larger dataset? Thank you

tlhr · September 6, 2023, 8:34am

REINVENT is trained on a very large set of drug-like molecules, but this mostly has the effect of the model learning the SMILES grammar – it can in fact generate molecules that are very different from anything seen in the training set. So if you want the model to generate only molecules with certain properties (e.g. a family with a common scaffold), you can use appropriate scores to guide the generation, such as some measure of similarity or common substructure.

Topic		Replies	Views
Convert small-molecule SMILES string to pdb file for docking HADDOCK	1	6312	June 26, 2018
Ligand conformer generator	7	27	January 6, 2026
PRODRG> Atom is bonded to multiple atoms HADDOCK	5	173	July 19, 2024
About pmx: Automated protein structure and topology generation for alchemical perturbations pmx	0	1621	June 10, 2016
Anywhere can generate a proper ligand PDB file? ChemDraw file is not working, and rewrite with pymol also not working HADDOCK support	3	84	March 7, 2025

About SMILES database

Related topics