Exact Distribution of the Distances between Any Occurrences of a Set of Words
Authors: Robin S.; Daudin J.-J.
Source: Annals of the Institute of Statistical Mathematics, Volume 53, Number 4, 2001 , pp. 895-905(11)
Publisher: Springer
Abstract:
The distribution of the distance between two (or more) successive occurrences of a specific word in a random sequence of letters is known under different models. In this paper, a more general problem is studied: the distribution of the distance between two (or more) successive occurrences of any word of a given set under a Markov model for the sequence. The generating function and a recurrence for obtaining the probabilities are given. These results are applied to study the distribution of the "CHI" motif in the genome sequence of Haemophilus influenzae.
Keywords: Distance between occurrences; genome sequence analysis; semi Markov process
Language: English
Document Type: Regular paper
Affiliations: 1: INA-PG INRA, 16 rue Claude Bernard, 75231 Paris, France
Publication date: 2001-01-01
- In this: publication
- By this: publisher
- In this Subject: Mathematics and Statistics
- By this author: Robin S. ; Daudin J.-J.

Shopping cart
Receive new issue alert