Mining sequential patterns from data streams: a centroid approach

Authors: Marascu, Alice; Masseglia, Florent

Source: Journal of Intelligent Information Systems, Volume 27, Number 3, November 2006 , pp. 291-307(17)

Publisher: Springer

Buy & download fulltext article:

OR

Price: $47.00 plus tax (Refund Policy)

Abstract:

In recent years, emerging applications introduced new constraints for data mining methods. These constraints are typical of a new kind of data: the data streams. In data stream processing, memory usage is restricted, new elements are generated continuously and have to be considered in a linear time, no blocking operator can be performed and the data can be examined only once. At this time, only a few methods has been proposed for mining sequential patterns in data streams. We argue that the main reason is the combinatory phenomenon related to sequential pattern mining. In this paper, we propose an algorithm based on sequences alignment for mining approximate sequential patterns in Web usage data streams. To meet the constraint of one scan, a greedy clustering algorithm associated to an alignment method is proposed. We will show that our proposal is able to extract relevant sequences with very low thresholds.

Keywords: Data streams; Sequential patterns; Web usage mining; Clustering; Sequences alignment

Document Type: Research article

DOI: http://dx.doi.org/10.1007/s10844-006-9954-6

Affiliations: 1: Email: amarascu@sophia.inria.fr

Publication date: 2006-11-01

Related content

Key

Free Content
Free content
New Content
New content
Open Access Content
Open access content
Subscribed Content
Subscribed content
Free Trial Content
Free trial content

Text size:

A | A | A | A
Share this item with others: These icons link to social bookmarking sites where readers can share and discover new web pages. print icon Print this page