Skip to main content

Utilization of the N-Gram Modeling to Construct Phylogenetic Tree for Proteins

Buy Article:

$107.14 + tax (Refund Policy)

With the availability of ever-increasing gene and protein sequence data across a large number of species, reconstruction of phylogenetic trees to reveal the evolutionary relationship among those species becomes more and more important. In this paper, N-gram modeling and the Bhattacharyya distance are employed to analyze the phylogeny of protein sequences. The phylogenetic trees on the two data sets have illustrated that the proposed approach performs equally well as the other methods do and is more efficient than some of the methods. So our method may be used to complement phylogenetic analysis.

Keywords: BHATTACHARYYA DISTANCE; MAXIMUM LIKELIHOOD ESTIMATION; N-GRAMS; PHYLOGENETIC TREE; PROTEIN PRIMARY STRUCTURES; SIMILARITY; WORD FREQUENCY

Document Type: Research Article

Publication date: 01 November 2011

More about this publication?
  • Journal of Computational and Theoretical Nanoscience is an international peer-reviewed journal with a wide-ranging coverage, consolidates research activities in all aspects of computational and theoretical nanoscience into a single reference source. This journal offers scientists and engineers peer-reviewed research papers in all aspects of computational and theoretical nanoscience and nanotechnology in chemistry, physics, materials science, engineering and biology to publish original full papers and timely state-of-the-art reviews and short communications encompassing the fundamental and applied research.
  • Editorial Board
  • Information for Authors
  • Submit a Paper
  • Subscribe to this Title
  • Terms & Conditions
  • Ingenta Connect is not responsible for the content or availability of external websites
  • Access Key
  • Free content
  • Partial Free content
  • New content
  • Open access content
  • Partial Open access content
  • Subscribed content
  • Partial Subscribed content
  • Free trial content