Original Article
Information Visualization (2007) 6, 139–154. doi:10.1057/palgrave.ivs.9500153
Comparison of visualization methods for an atlas of gene expression data sets
Jarkko Venna1 and Samuel Kaski1
1Helsinki Institute of Information Technology and Adaptive Informatics Research Centre, Helsinki University of Technology, Finland
Correspondence: Samuel Kaski, Laboratory of Computer and Information Science, Helsinki University of Technology, P.O. Box 5400, FI-02015 TKK, Finland. Tel.: +358 9 451 8203; Fax: +358 9 451 3277; E-mail: samuel.kaski@tkk.fi
Received 29 March 2005; Revised 11 June 2006; Accepted 15 June 2006; Published online 17 May 2007.
Abstract
This paper has two intertwined goals: (i) to study the feasibility of an atlas of gene expression data sets as a visual interface to expression databanks, and (ii) to study which dimensionality reduction methods would be suitable for visualizing very high-dimensional data sets. Several new methods have been recently proposed for the estimation of data manifolds or embeddings, but they have so far not been compared in the task of visualization. In visualizations the dimensionality is constrained, in addition to the data itself, by the presentation medium. It turns out that an older method, curvilinear component analysis, outperforms the new ones in terms of trustworthiness of the projections. In a sample databank on gene expression, the main sources of variation were the differences between data sets, different labs, and different measurement methods. This hints at a need for better methods for making the data sets commensurable, in accordance with earlier studies. The good news is that the visualized overview, expression atlas, reveals many of these subsets. Hence, we conclude that dimensionality reduction even from 1339 to 2 can produce a useful interface to gene expression databanks.
Keywords:
Gene expression, manifold extraction, nonlinear dimensionality reduction, visualization
MORE ARTICLES LIKE THIS
These links to content published by Palgrave Macmillan are automatically generated.
RESEARCH
Comparison of visualization methods for an atlas of gene expression data setsInformation Visualization Original Article
Data transformations and representations for computation and visualizationInformation Visualization Original Article
Visual cluster analysis of trajectory data with interactive Kohonen mapsInformation Visualization Original Article


