A New Method Based on Entropy Theory for Genomic Sequence Analysis
Source: Acta Biotheoretica, Volume 50, Number 3, 2002 , pp. 155-165(11)
We have refined entropy theory to explore the meaning of the increasing sequence data on nucleic acids and proteins more conveniently. The concept of selection constraint was not introduced, only the analyzed sequences themselves were considered. The refined theory serves as a basis for deriving a method to analyze non-coding regions (NCRs) as well as coding regions. Positions with maximal entropy might play the most important role in genome functions as opposed to positions with minimal entropy. This method was tested in the well-characterized coding regions of 12 strains of Classical Swine Fever Virus (CSFV) and non-coding regions of 20 strains of CSFV. It is suitable to analyze nucleic acid sequences of a complete genome and to detect sensitive positions for mutagenesis. As such, the method serves to formulate the basis for elucidating the functional mechanism.
Document Type: Research article
Affiliations: 1: Institute of Virology, College of Life Sciences, Wuhan University, Wuhan, 430072, China 2: Department of Physics, Wuhan University, Wuhan, 430072, China. E-mail: email@example.com or firstname.lastname@example.org
Publication date: 2002-01-01