Drexel University Home Pagewww.drexel.edu DREXEL UNIVERSITY LIBRARIES HOMEPAGE >>
iDEA DREXEL ARCHIVES >>

iDEA: Drexel E-repository and Archives > Drexel Academic Community > College of Information Science and Technology > Faculty Research and Publications (IST) > Using term sense to improve language modeling approach to genomic IR

Please use this identifier to cite or link to this item: http://hdl.handle.net/1860/921

Title: Using term sense to improve language modeling approach to genomic IR
Authors: Zhou, Xiaohua
Zhang, Xiaodan
Hu, Xiaohua
Issue Date: Apr-2006
Citation: Proceedings of the 28th European Conference on IR Research, ECIR 2006, London, UK, April 10-12, 2006. Lecture Notes in Computer Science 3936 (http://springerlink.metapress.com/link.asp?id=105633). Retrieved 6/26/2006 from http://www.ischool.drexel.edu/faculty/thu/My%20Publication/Conference-papers/ECIR06.pdf.
Abstract: Genomic IR, characterized by its highly specific information need, severe synonym and polysemy problem, long term name and rapid growing lit-erature size, is challenging IR community. In this paper, we are focused on ad-dressing the synonym and polysemy issue under the language modeling frame-work. Unlike the ways translation model and traditional query expansion tech-niques approach to this issue, we incorporate term sense into the basic language model, a more fundamental approach to the synonym and polysemy issue in IR. The sense approach not only maintains the simplicity of language models, but also makes the document ranking efficient and effective. A comparative ex-periment on the TREC 2004 Genomic Track data shows significant improve-ment of retrieval performance after incorporating the term sense into a basic language model. The MAP (mean average precision) is significantly raised from 29.17% (the baseline system) to 36.94%. The performance of the sense approach is also significantly superior to the mean (21.72%) of official runs participated in TREC 2004 Genomic Track and is comparable to the best work (40.75%) of the track. Most runs in the track extensively use various query ex-pansion and pseudo relevance feedback techniques while our approach does nothing except the incorporation of term sense, which evidences the view that semantic smoothing, i.e. the incorporation of synonym and sense information into the language models, is a more standard approach to achieving the effects traditional query expansion and pseudo-relevance feedback techniques are working for.
URI: http://hdl.handle.net/1860/921
Appears in Collections:Faculty Research and Publications (IST)

Files in This Item:

File Description SizeFormat
2006150024.pdf305.85 kBAdobe PDFView/Open
View Statistics

Items in iDEA are protected by copyright, with all rights reserved, unless otherwise indicated.

 

Valid XHTML 1.0! iDEA Software Copyright © 2002-2010  Duraspace - Feedback