A Bayesian Framework for XML Information Retrieval: Searching and Learning with the INEX Collection

Authors: Piwowarski, Benjamin1; Gallinari, Patrick2

Source: Information Retrieval, Volume 8, Number 4, December 2005 , pp. 655-681(27)

Publisher: Springer

Key:
Free Content - Free Content
New Content - New Content
Subscribed Content - Subscribed Content
Free Trial Content - Free Trial Content

Abstract:

Most recent document standards like XML rely on structured representations. On the other hand, current information retrieval systems have been developed for flat document representations and cannot be easily extended to cope with more complex document types. The design of such systems is still an open problem. We present a new model for structured document retrieval which allows computing scores of document parts. This model is based on Bayesian networks whose conditional probabilities are learnt from a labelled collection of structured documents—which is composed of documents, queries and their associated assessments. Training these models is a complex machine learning task and is not standard. This is the focus of the paper: we propose here to train the structured Bayesian Network model using a cross-entropy training criterion. Results are presented on the INEX corpus of XML documents.

Keywords: Bayesian Networks; structured information retrieval; XML; machine learning for structured retrieval

Document Type: Research article

DOI: 10.1007/s10791-005-0751-6

Affiliations: 1: Center for Web Research, DCC, Universidad de Chile, Blanco Encalada 2120, Santiago, Chile, Email: bpiwowar@dcc.uchile.cl 2: LIP6, 8, rue du capitaine Scott, 75015, Paris, France, Email: gallinar@poleia.lip6.fr

The full text electronic article is available for purchase. You will be able to download the full text electronic article after payment.

$47.00 plus tax      Refund Policy

 

OR

Back to top

Key:
Free Content - Free Content
New Content - New Content
Subscribed Content - Subscribed Content
Free Trial Content - Free Trial Content
Share this item with others: These icons link to social bookmarking sites where readers can share and discover new web pages.
Page Help Click here for Page Help
Shopping cart
Tools
Sign in






Need to register?
Sign up here
Text size: A | A | A | A