For Authors_For Subscribers_For Librarians_For SocietiesFor Advertisers

Home | About Us | Contact Us | Site Map | FAQs

journal home
 
Services for Readers
Services for authors
Customer Services


September 2003, Volume 2, Number 3, Pages 160-170
Table of contents   Previous  Full text  Next   PDF
Original Article
Co-Citation count vs correlation for influence network visualization
Steven Noel1, Chee-Hung Henry Chu2 and Vijay Raghavan2

1Center for Secure Information Systems, George Mason University, Fairfax, VA, U.S.A.

2Center for Advanced Computer Studies, The University of Louisiana at Lafayette, Lafayette, LA, U.S.A.

Correspondence to: Henry Chu, Center for Advanced Computer Studies, The University of Louisiana at Lafayette, PO Box 44330, Lafayette, LA 70504-4330, U.S.A. Tel: +1 337 482 6309; Fax: +1 337 482 5791; E-mail: cice@cacs.louisiana.edu

Abstract

Visualization of author or document influence networks as a two-dimensional image can provide key insights into the direct influence of authors or documents on each other in a document collection. The influence network is constructed based on the minimum spanning tree, in which the nodes are documents and an edge is the most direct influence between two documents. Influence network visualizations have typically relied on co-citation correlation as a measure of document similarity. That is, the similarity between two documents is computed by correlating the sets of citations to each of the two documents. In a different line of research, co-citation count (the number of times two documents are jointly cited) has been applied as a document similarity measure. In this work, we demonstrate the impact of each of these similarity measures on the document influence network. We provide examples, and analyze the significance of the choice of similarity measure. We show that correlation-based visualizations exhibit chaining effects (low average vertex degree), a manifestation of multiple minor variations in document similarities. These minor similarity variations are absent in count-based visualizations. The result is that count-based influence network visualizations are more consistent with the intuitive expectation of authoritative documents being hubs that directly influence large numbers of documents.

Information Visualization (2003) 2, 160-170. doi:10.1057/palgrave.ivs.9500049

Keywords

Document collection visualization; co-citation analysis; influence networks; minimum spanning tree; graph layout

Received 14 March 2003; revised 5 September 2003; accepted 6 September 2003
Table of contents   Previous  Full text  Next   PDF