Advanced Science Letters, 19(5), 1336-1339p. (2013) DOI:10.1166/asl.2013.4499

Phylogenetic Clustering of Protein Sequences Using Recurrence Quantification Analysis

A. Yadav, V. K. Jayaraman, M. Kale, U. Kulkarni-Kale

Molecular phylogeny analysis (MPA) investigates differences in molecular sequences to analyse the evolutionary relationships of organisms or their bio-macromolecules. Currently there are two categories of methods for MPA viz. distance based and character based. All the methods require multiple sequence alignment (MSA) as a prerequisite to MPA and are usually followed by bootstrap analysis. MSA is efficient in terms of time, computations and memory requirement only when size and number of sequences are small. MSA of whole proteomes is computationally intensive and time consuming. In this paper, an attempt is made to propose a new alignment free approach of MPA using Recurrence Quantification Analysis (RQA). The protein sequence is converted into a numeric sequence by assigning a unique score in the form of real numbers to each amino acid and various RQA features are extracted from the numeric sequence. These features are used as co-ordinates for calculating the distance between the sequences using an appropriate distance function. The distance matrix, thus obtained is used for clustering using Neighbour-Joining method. Requirement of time for clustering and tree construction is observed to be significantly reduced in comparison with the alignment-based algorithms. As an example, a test case involving an application of this method for clustering of 59 polyprotein sequences of family Flaviviridae is demonstrated and phylogenetic tree thus obtained was found to be fairly accurate with 95% accuracy. Only 3 misclassifications out of 59 were observed. Thus, the proposed method has potential to be a reasonable alternative for existing MPA methods.

back


Creative Commons License © 2017 SOME RIGHTS RESERVED
The content of this web site is licensed under a Creative Commons Attribution-NonCommercial-NoDerivs 2.0 Germany License.

Please note: The abstracts of the bibliography database may underly other copyrights.

Ihr Browser versucht gerade eine Seite aus dem sogenannten Internet auszudrucken. Das Internet ist ein weltweites Netzwerk von Computern, das den Menschen ganz neue Möglichkeiten der Kommunikation bietet.

Da Politiker im Regelfall von neuen Dingen nichts verstehen, halten wir es für notwendig, sie davor zu schützen. Dies ist im beidseitigen Interesse, da unnötige Angstzustände bei Ihnen verhindert werden, ebenso wie es uns vor profilierungs- und machtsüchtigen Politikern schützt.

Sollten Sie der Meinung sein, dass Sie diese Internetseite dennoch sehen sollten, so können Sie jederzeit durch normalen Gebrauch eines Internetbrowsers darauf zugreifen. Dazu sind aber minimale Computerkenntnisse erforderlich. Sollten Sie diese nicht haben, vergessen Sie einfach dieses Internet und lassen uns in Ruhe.

Die Umgehung dieser Ausdrucksperre ist nach §95a UrhG verboten.

Mehr Informationen unter www.politiker-stopp.de.