Exact Distribution of the Distances between Any Occurrences of a Set of Words

Authors: Robin S.; Daudin J.-J.

Source: Annals of the Institute of Statistical Mathematics, Volume 53, Number 4, 2001 , pp. 895-905(11)

Publisher: Springer

Buy & download fulltext article:

OR

Price: $47.00 plus tax (Refund Policy)

Abstract:

The distribution of the distance between two (or more) successive occurrences of a specific word in a random sequence of letters is known under different models. In this paper, a more general problem is studied: the distribution of the distance between two (or more) successive occurrences of any word of a given set under a Markov model for the sequence. The generating function and a recurrence for obtaining the probabilities are given. These results are applied to study the distribution of the "CHI" motif in the genome sequence of Haemophilus influenzae.

Keywords: Distance between occurrences; genome sequence analysis; semi Markov process

Language: English

Document Type: Regular paper

Affiliations: 1: INA-PG — INRA, 16 rue Claude Bernard, 75231 Paris, France

Publication date: 2001-01-01

Related content

Key

Free Content
Free content
New Content
New content
Open Access Content
Open access content
Subscribed Content
Subscribed content
Free Trial Content
Free trial content

Text size:

A | A | A | A
Share this item with others: These icons link to social bookmarking sites where readers can share and discover new web pages. print icon Print this page