Phylogenetic Clustering of Protein Sequences Using Recurrence Quantification Analysis
Molecular phylogeny analysis (MPA) investigates differences in molecular sequences to analyse the evolutionary relationships of organisms or their bio-macromolecules. Currently there are two categories of methods for MPA viz. distance based and character based. All the methods require
multiple sequence alignment (MSA) as a prerequisite to MPA and are usually followed by bootstrap analysis. MSA is efficient in terms of time, computations and memory requirement only when size and number of sequences are small. MSA of whole proteomes is computationally intensive and time consuming.
In this paper, an attempt is made to propose a new alignment free approach of MPA using Recurrence Quantification Analysis (RQA). The protein sequence is converted into a numeric sequence by assigning a unique score in the form of real numbers to each amino acid and various RQA features are
extracted from the numeric sequence. These features are used as co-ordinates for calculating the distance between the sequences using an appropriate distance function. The distance matrix, thus obtained is used for clustering using Neighbour-Joining method. Requirement of time for clustering
and tree construction is observed to be significantly reduced in comparison with the alignment-based algorithms. As an example, a test case involving an application of this method for clustering of 59 polyprotein sequences of family Flaviviridae is demonstrated and phylogenetic tree
thus obtained was found to be fairly accurate with 95% accuracy. Only 3 misclassifications out of 59 were observed. Thus, the proposed method has potential to be a reasonable alternative for existing MPA methods.
Document Type: Research Article
Publication date: May 1, 2013
- ADVANCED SCIENCE LETTERS is an international peer-reviewed journal with a very wide-ranging coverage, consolidates research activities in all areas of (1) Physical Sciences, (2) Biological Sciences, (3) Mathematical Sciences, (4) Engineering, (5) Computer and Information Sciences, and (6) Geosciences to publish original short communications, full research papers and timely brief (mini) reviews with authors photo and biography encompassing the basic and applied research and current developments in educational aspects of these scientific areas.
- Editorial Board
- Information for Authors
- Subscribe to this Title
- Ingenta Connect is not responsible for the content or availability of external websites
- Access Key
- Free content
- Partial Free content
- New content
- Open access content
- Partial Open access content
- Subscribed content
- Partial Subscribed content
- Free trial content