Original Article

Information Visualization (2007) 6, 139–154. doi:10.1057/palgrave.ivs.9500153

Comparison of visualization methods for an atlas of gene expression data sets

Jarkko Venna1 and Samuel Kaski1

1Helsinki Institute of Information Technology and Adaptive Informatics Research Centre, Helsinki University of Technology, Finland

Correspondence: Samuel Kaski, Laboratory of Computer and Information Science, Helsinki University of Technology, P.O. Box 5400, FI-02015 TKK, Finland. Tel.: +358 9 451 8203; Fax: +358 9 451 3277; E-mail: samuel.kaski@tkk.fi

Received 29 March 2005; Revised 11 June 2006; Accepted 15 June 2006; Published online 17 May 2007.

Top

Abstract

This paper has two intertwined goals: (i) to study the feasibility of an atlas of gene expression data sets as a visual interface to expression databanks, and (ii) to study which dimensionality reduction methods would be suitable for visualizing very high-dimensional data sets. Several new methods have been recently proposed for the estimation of data manifolds or embeddings, but they have so far not been compared in the task of visualization. In visualizations the dimensionality is constrained, in addition to the data itself, by the presentation medium. It turns out that an older method, curvilinear component analysis, outperforms the new ones in terms of trustworthiness of the projections. In a sample databank on gene expression, the main sources of variation were the differences between data sets, different labs, and different measurement methods. This hints at a need for better methods for making the data sets commensurable, in accordance with earlier studies. The good news is that the visualized overview, expression atlas, reveals many of these subsets. Hence, we conclude that dimensionality reduction even from 1339 to 2 can produce a useful interface to gene expression databanks.

Keywords:

Gene expression, manifold extraction, nonlinear dimensionality reduction, visualization

MORE ARTICLES LIKE THIS

These links to content published by Palgrave Macmillan are automatically generated.

Extra navigation

.
ADVERTISEMENT
Interactive Visualization and Data Analysis, Masters program at Danube University Krems, Austria