This paper compares 22 different similarity coefficients when they are used for searching databases of 2D fragment bit-strings. Experiments with the National Cancer Institute's AIDS and IDAlert databases show that the coefficients fall into several well-marked clusters, in which the members of a cluster will produce comparable rankings of a set of molecules. These clusters provide a basis for selecting combinations of coefficients for use in data fusion experiments. The results of these experiments provide a simple way of increasing the effectiveness of fragment-based similarity searching systems.
Combinatorial Chemistry & High Throughput Screening publishes full length original research articles and reviews describing various topics in combinatorial chemistry (e.g. small molecules, peptide, nucleic acid or phage display libraries) and/or high throughput screening (e.g. developmental, practical or theoretical). Ancillary subjects of key importance, such as robotics and informatics, will also be covered by the journal. In these respective subject areas, Combinatorial Chemistry & High Throughput Screening is intended to function as the most comprehensive and up-to-date medium available. The journal should be of value to individuals engaged in the process of drug discoveryand development, in the settings of industry, academia or government.