Diversity in logarithmic opinion pools

Authors: Smith, Andrew; Osborne, Miles

Source: Lingvisticae Investigationes, Volume 30, Number 1, 2007 , pp. 27-47(21)

Publisher: John Benjamins Publishing Company

Buy & download fulltext article:

OR

Price: $37.41 plus tax (Refund Policy)

Abstract:

Conditional random fields are state-of-the-art models for sequencing tasks such as named entity recognition. However, being globally conditioned, they have a tendency to overfit to a greater extent than other sequencing models. We introduce an approach to combat this overfitting called a logarithmic opinion pool (LOP). A LOP consists of a weighted combination of constituent models. We present the theory behind LOPs, and show that effective LOPs require constituent models that are diverse from one another. We examine different ways to introduce such diversity, including an approach that involves training the constituent models together, interactively. Our results show that, as expected from the underlying theory, explicitly optimising for constituent model diversity can improve performance over standard approaches to regularisation.

Keywords: NAMED ENTITY RECOGNITION; LOGARITHMIC OPINION POOLS; CONDITIONAL RANDOM FIELDS

Document Type: Research article

DOI: http://dx.doi.org/10.1075/li.30.1.04smi

Publication date: 2007-01-01

Related content

Tools

Key

Free Content
Free content
New Content
New content
Open Access Content
Open access content
Subscribed Content
Subscribed content
Free Trial Content
Free trial content

Text size:

A | A | A | A
Share this item with others: These icons link to social bookmarking sites where readers can share and discover new web pages. print icon Print this page