Show simple item record

dc.contributor.advisorNelson, Paul
dc.creatorKini, Ananth Ullal
dc.date.accessioned2006-04-12T16:02:45Z
dc.date.available2006-04-12T16:02:45Z
dc.date.created2005-12
dc.date.issued2006-04-12
dc.identifier.urihttps://hdl.handle.net/1969.1/3116
dc.description.abstractCluster-based information retrieval systems often use a similarity measure to compute the association among text documents. In this thesis, we focus on a class of similarity measures named Query-Sensitive Similarity (QSS) measures. Recent studies have shown QSS measures to positively influence the outcome of a clustering procedure. These studies have used QSS measures in conjunction with the ltc term-weighting scheme. Several term-weighting schemes have superseded the ltc term-weighing scheme and demonstrated better retrieval performance relative to the latter. We test whether introducing one of these schemes, INQUERY, will offer any benefit over the ltc scheme when used in the context of QSS measures. The testing procedure uses the Nearest Neighbor (NN) test to quantify the clustering effectiveness of QSS measures and the corresponding term-weighting scheme. The NN tests are applied on certain standard test document collections and the results are tested for statistical significance. On analyzing results of the NN test relative to those obtained for the ltc scheme, we find several instances where the INQUERY scheme improves the clustering effectiveness of QSS measures. To be able to apply the NN test, we designed a software test framework, Ferret, by complementing the features provided by dtSearch, a search engine. The test framework automates the generation of NN coefficients by processing standard test document collection data. We provide an insight into the construction and working of the Ferret test framework.en
dc.format.extent286564 bytesen
dc.format.mediumelectronicen
dc.format.mimetypeapplication/pdf
dc.language.isoen_US
dc.publisherTexas A&M University
dc.subjectInformation Retrievalen
dc.subjectTerm-Weightingen
dc.subjectSimilarity Measureen
dc.subjectINQUERYen
dc.subjectltcen
dc.subjectQuery-Sensitive Similarityen
dc.subjectClusteringen
dc.subjectNearest Neighborsen
dc.subjectDocument Collectionen
dc.subjectTexten
dc.subjectSearchen
dc.titleOn the effect of INQUERY term-weighting scheme on query-sensitive similarity measuresen
dc.typeBooken
dc.typeThesisen
thesis.degree.departmentComputer Scienceen
thesis.degree.disciplineComputer Scienceen
thesis.degree.grantorTexas A&M Universityen
thesis.degree.nameMaster of Scienceen
thesis.degree.levelMastersen
dc.contributor.committeeMemberBurchill, William E.
dc.contributor.committeeMemberLeggett, John J.
dc.type.genreElectronic Thesisen
dc.type.materialtexten
dc.format.digitalOriginborn digitalen


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record