Quality-based guidance for exploratory dimensionality reduction

Sara Johansson Fernstad, Jane Shaw, Jimmy Johansson

Research output: Contribution to journalArticlepeer-review

18 Citations (Scopus)

Abstract

High-dimensional data sets containing hundreds of variables are difficult to explore, as traditional visualization methods often are unable to represent such data effectively. This is commonly addressed by employing dimensionality reduction prior to visualization. Numerous dimensionality reduction methods are available. However, few reduction approaches take the importance of several structures into account and few provide an overview of structures existing in the full high-dimensional data set. For exploratory analysis, as well as for many other tasks, several structures may be of interest. Exploration of the full high-dimensional data set without reduction may also be desirable. This paper presents flexible methods for exploratory analysis and interactive dimensionality reduction. Automated methods are employed to analyse the variables, using a range of quality metrics, providing one or more measures of ‘interestingness’ for individual variables. Through ranking, a single value of interestingness is obtained, based on several quality metrics, that is usable as a threshold for the most interesting variables. An interactive environment is presented in which the user is provided with many possibilities to explore and gain understanding of the high-dimensional data set. Guided by this, the analyst can explore the high-dimensional data set and interactively select a subset of the potentially most interesting variables, employing various methods for dimensionality reduction. The system is demonstrated through a use-case analysing data from a DNA sequence-based study of bacterial populations.
Original languageEnglish
Pages (from-to)44-64
JournalInformation Visualization
Volume12
Issue number1
DOIs
Publication statusPublished - Jan 2013

Fingerprint Dive into the research topics of 'Quality-based guidance for exploratory dimensionality reduction'. Together they form a unique fingerprint.

Cite this