Original Article
Information Visualization advance online publication 9 April 2009; doi: 10.1057/ivs.2009.3
Visualising the structure of document search results: A comparison of graph theoretic approaches
Timothy Cribbin1
1Department of Information Systems and Computing, Brunel University, Kingston Lane, Uxbridge, Middlesex UB8 3PH, UK
Correspondence: Timothy Cribbin, E-mail: timothy.cribbin@brunel.ac.uk
Received 7 August 2008; Revised 9 February 2009; Accepted 11 February 2009; Published online 9 April 2009.
Abstract
Previous work has shown that distance-similarity visualisation or 'spatialisation' can provide a potentially useful context in which to browse the results of a query search, enabling the user to adopt a simple local foraging or 'cluster growing' strategy to navigate through the retrieved document set. However, faithfully mapping feature-space models to visual space can be problematic owing to their inherent high dimensionality and non-linearity. Conventional linear approaches to dimension reduction tend to fail at this kind of task, sacrificing local structural in order to preserve a globally optimal mapping. In this paper the clustering performance of a recently proposed algorithm called isometric feature mapping (Isomap), which deals with non-linearity by transforming dissimilarities into geodesic distances, is compared to that of non-metric multidimensional scaling (MDS). Various graph pruning methods, for geodesic distance estimation, are also compared. Results show that Isomap is significantly better at preserving local structural detail than MDS, suggesting it is better suited to cluster growing and other semantic navigation tasks. Moreover, it is shown that applying a minimum-cost graph pruning criterion can provide a parameter-free alternative to the traditional K-neighbour method, resulting in spatial clustering that is equivalent to or better than that achieved using an optimal-K criterion.
Keywords:
information retrieval, document visualisation, multidimensional scaling, isometric feature mapping, minimum spanning tree, pathfinder associative network




