On the use of automatically acquired examples for all-nouns Word Sense Disambiguation
AuthorMartinez, D; de Lacalle, OL; Agirre, E
Source TitleJOURNAL OF ARTIFICIAL INTELLIGENCE RESEARCH
PublisherAI ACCESS FOUNDATION
University of Melbourne Author/sMartinez, David
AffiliationComputer Science and Software Engineering
Document TypeJournal Article
CitationsMartinez, D., de Lacalle, O. L. & Agirre, E. (2008). On the use of automatically acquired examples for all-nouns Word Sense Disambiguation. JOURNAL OF ARTIFICIAL INTELLIGENCE RESEARCH, 33 (1), pp.79-107. https://doi.org/10.1613/jair.2395.
Access StatusThis item is currently not available from this repository
<jats:p>This article focuses on Word Sense Disambiguation (WSD), which is a Natural Language Processing task that is thought to be important for many Language Technology applications, such as Information Retrieval, Information Extraction, or Machine Translation. One of the main issues preventing the deployment of WSD technology is the lack of training examples for Machine Learning systems, also known as the Knowledge Acquisition Bottleneck. A method which has been shown to work for small samples of words is the automatic acquisition of examples. We have previously shown that one of the most promising example acquisition methods scales up and produces a freely available database of 150 million examples from Web snippets for all polysemous nouns in WordNet. This paper focuses on the issues that arise when using those examples, all alone or in addition to manually tagged examples, to train a supervised WSD system for all nouns. The extensive evaluation on both lexical-sample and all-words Senseval benchmarks shows that we are able to improve over commonly used baselines and to achieve top-rank performance. The good use of the prior distributions from the senses proved to be a crucial factor.</jats:p>
KeywordsArtificial Intelligence and Image Processing
- Click on "Export Reference in RIS Format" and choose "open with... Endnote".
- Click on "Export Reference in RIS Format". Login to Refworks, go to References => Import References