The Art of Data Mining the Minefields of Toxicity Databases to Link Chemistry to Biology

Authors: Yang, Chihae; Richard, Ann M.; Cross, Kevin P.

Source: Current Computer - Aided Drug Design, Volume 2, Number 2, June 2006 , pp. 135-150(16)

Publisher: Bentham Science Publishers

Buy & download fulltext article:

OR

Price: $62.88 plus tax (Refund Policy)

Abstract:

Toxicity databases have a special role in predictive toxicology, providing ready access to historical information throughout the workflow of discovery, development, and product safety processes in drug development as well as in review by regulatory agencies. To provide accurate information within a hypothesesbuilding environment, the content of the databases needs to be rigorously modeled using standards and controlled vocabulary. The utilitarian purposes of databases widely vary, ranging from a source for (Q)SAR datasets for modelers to a basis for "read-across" for regulators. Many tasks involved in the use of databases are closely tied to data mining, hence database and data mining are essential technology pairs. To understand chemically-induced toxicity, chemical structures must be integrated into the toxicity databases. Data mining these "structure-integrated toxicity databases" requires techniques for handling both chemical structures and textual toxicity information. Structure data mining is similar with some modifications to that conventionally employed for large chemical databases, while data mining of toxicity endpoints is not well developed. This review presents a general strategy to data mine structure-integrated toxicity databases to link chemical structures to biological endpoints. Iterative probing of the chemical domain with toxicity endpoint descriptors and the biological domain with chemical descriptors enables linking of the two domains. Data mining steps to elucidate the hidden relationships between the target organs and chemical classes are presented as an example. Work is in progress in the public domain toward the linking of chemistry to biology by providing databases that can be mined.

Keywords: Bioinformatics; chemoinformatics; database; data mining; informatics; linking chemistry to biology; predictive toxicology; QSAR; toxicity

Document Type: Research article

Affiliations: 1: Leadscope Inc., 1393 Dublin Road, Columbus, OH 43235, USA.

Publication date: 2006-06-01

More about this publication?
  • Current Computer-Aided Drug Design aims to publish all the latest developments in drug design based on computational techniques. The field of computer-aided drug design has had extensive impact in the area of drug design. Current Computer-Aided Drug Design is an essential journal for all medicinal chemists who wish to be kept informed and up-to-date with all the latest and important developments in computer-aided methodologies and their applications in drug discovery. Each issue contains a series of timely, in-depth reviews written by leaders in the field, covering a range of computational techniques for drug design, screening, ADME studies, etc., providing excellent rationales for drug development.
Related content

Tools

Key

Free Content
Free content
New Content
New content
Open Access Content
Open access content
Subscribed Content
Subscribed content
Free Trial Content
Free trial content

Text size:

A | A | A | A
Share this item with others: These icons link to social bookmarking sites where readers can share and discover new web pages. print icon Print this page