Skip to main content

Protein Bioinformatics and Mixtures of Bivariate von Mises Distributions for Angular Data

Buy Article:

$51.00 plus tax (Refund Policy)



A fundamental problem in bioinformatics is to characterize the secondary structure of a protein, which has traditionally been carried out by examining a scatterplot (Ramachandran plot) of the conformational angles. We examine two natural bivariate von Mises distributions—referred to as Sine and Cosine models—which have five parameters and, for concentrated data, tend to a bivariate normal distribution. These are analyzed and their main properties derived. Conditions on the parameters are established which result in bimodal behavior for the joint density and the marginal distribution, and we note an interesting situation in which the joint density is bimodal but the marginal distributions are unimodal. We carry out comparisons of the two models, and it is seen that the Cosine model may be preferred. Mixture distributions of the Cosine model are fitted to two representative protein datasets using the expectation maximization algorithm, which results in an objective partition of the scatterplot into a number of components. Our results are consistent with empirical observations; new insights are discussed.

Keywords: Bivariate angular data; Bivariate circular mixture; Directional statistics; Distribution on torus; Myoglobin; Protein conformational angles; Ramachandran plots

Document Type: Research Article


Affiliations: 1: School of Mathematics, University of Leeds, Leeds, LS2 9JT, U.K. 2: AT&T Labs Research, Florham Park, New Jersey 07932, U.S.A.

Publication date: June 1, 2007


Access Key

Free Content
Free content
New Content
New content
Open Access Content
Open access content
Subscribed Content
Subscribed content
Free Trial Content
Free trial content
Cookie Policy
Cookie Policy
ingentaconnect website makes use of cookies so as to keep track of data that you have filled in. I am Happy with this Find out more