Drexel University Home Pagewww.drexel.edu DREXEL UNIVERSITY LIBRARIES HOMEPAGE >>
iDEA DREXEL ARCHIVES >>

iDEA: Drexel E-repository and Archives > Drexel Academic Community > College of Information Science and Technology > Faculty Research and Publications (IST) > Utilization of global ranking information in GraphBased biomedical literature clustering

Please use this identifier to cite or link to this item: http://hdl.handle.net/1860/2740

Title: Utilization of global ranking information in GraphBased biomedical literature clustering
Authors: Zhang, Xiaodan
Hu, Xiaohua
Xia, Jiali
Zhou, Xiaohua
Achananuparp, Palakorn
Keywords: Document Clustering;Term Graph;Global Ranking
Issue Date: 3-Sep-2007
Citation: Paper presented at the 9th International Conference on Data Warehousing and Knowledge Discovery, DaWaK 2007, Regensburg, Germany.
Abstract: In this paper, we explore how global ranking method in conjunction with local density method help identify meaningful term clusters from ontology enriched graph representation of biomedical literature corpus. One big problem with document clustering is how to discount the effects of class-unspecific general terms and strengthen the effects of class-specific core terms. We claim that running global ranking method on a well constructed term graph can identify class-specific core terms. In detail, PageRank and HITS are applied on a direct abstract-title graph to target class specific core terms. Then k dense terms clusters (graph) are identified from these terms. Finally, a document is assigned to the closest term graph. A series of experiments are conducted on a document corpus collected from PubMed. Experimental results show that our approach is very effective to identify class-specific core terms and thus help document clustering.
URI: http://hdl.handle.net/1860/2740
Appears in Collections:Faculty Research and Publications (IST)

Files in This Item:

File Description SizeFormat
2006175422.pdf106.25 kBAdobe PDFView/Open
View Statistics

Items in iDEA are protected by copyright, with all rights reserved, unless otherwise indicated.

 

Valid XHTML 1.0! iDEA Software Copyright © 2002-2010  Duraspace - Feedback