Small Molecule Subgraph Detector (SMSD) toolkit

Syed Asad Rahman, Matthew Bashton, Gemma L Holliday, Rainer Schrader, Janet M Thornton

Research output: Contribution to journalArticlepeer-review

107 Citations (Scopus)
20 Downloads (Pure)


BACKGROUND: Finding one small molecule (query) in a large target library is a challenging task in computational chemistry. Although several heuristic approaches are available using fragment-based chemical similarity searches, they fail to identify exact atom-bond equivalence between the query and target molecules and thus cannot be applied to complex chemical similarity searches, such as searching a complete or partial metabolic pathway.In this paper we present a new Maximum Common Subgraph (MCS) tool: SMSD (Small Molecule Subgraph Detector) to overcome the issues with current heuristic approaches to small molecule similarity searches. The MCS search implemented in SMSD incorporates chemical knowledge (atom type match with bond sensitive and insensitive information) while searching molecular similarity. We also propose a novel method by which solutions obtained by each MCS run can be ranked using chemical filters such as stereochemistry, bond energy, etc.

RESULTS: In order to benchmark and test the tool, we performed a 50,000 pair-wise comparison between KEGG ligands and PDB HET Group atoms. In both cases the SMSD was shown to be more efficient than the widely used MCS module implemented in the Chemistry Development Kit (CDK) in generating MCS solutions from our test cases.

CONCLUSION: Presently this tool can be applied to various areas of bioinformatics and chemo-informatics for finding exhaustive MCS matches. For example, it can be used to analyse metabolic networks by mapping the atoms between reactants and products involved in reactions. It can also be used to detect the MCS/substructure searches in small molecules reported by metabolome experiments, as well as in the screening of drug-like compounds with similar substructures.Thus, we present a robust tool that can be used for multiple applications, including the discovery of new drug molecules. This tool is freely available on

Original languageEnglish
Article number12
Number of pages13
JournalJournal of Cheminformatics
Publication statusPublished - 10 Aug 2009
Externally publishedYes


Dive into the research topics of 'Small Molecule Subgraph Detector (SMSD) toolkit'. Together they form a unique fingerprint.

Cite this