Abstract
Projection of a high-dimensional dataset onto a two-dimensional space is a useful tool to visualise structures and relationships in the dataset. However, a single two-dimensional visualisation may not display all the intrinsic structure. Therefore, hierarchical/multi-level visualisation methods have been used to extract more detailed understanding of the data. Here we propose a multi-level Gaussian process latent variable model (MLGPLVM). MLGPLVM works by segmenting data (with e.g. K-means, Gaussian mixture model or interactive clustering) in the visualisation space and then fitting a visualisation model to each subset. To measure the quality of multi-level visualisation (with respect to parent and child models), metrics such as trustworthiness, continuity, mean relative rank errors, visualisation distance distortion and the negative log-likelihood per point are used. We evaluate the MLGPLVM approach on the ‘Oil Flow’ dataset and a dataset of protein electrostatic potentials for the ‘Major Histocompatibility Complex (MHC) class I’ of humans. In both cases, visual observation and the quantitative quality measures have shown better visualisation at lower levels.
Original language | English |
---|---|
Title of host publication | IVAPP 2014 |
Subtitle of host publication | proceedings of the 5th international conference on Information Visualization Theory and Applications |
Editors | Robert S. Laramee, Andreas Kerren, José Braz |
Place of Publication | Lisbon (PT) |
Publisher | SciTePress |
Pages | 122-129 |
Number of pages | 8 |
ISBN (Print) | 978-989-758-005-5 |
Publication status | Published - 2014 |
Event | 5th international conference on Information Visualization Theory and Applications - Lisbon, Portugal Duration: 5 Jan 2014 → 8 Jan 2014 |
Conference
Conference | 5th international conference on Information Visualization Theory and Applications |
---|---|
Abbreviated title | IVAPP 2014 |
Country/Territory | Portugal |
City | Lisbon |
Period | 5/01/14 → 8/01/14 |
Keywords
- continuity
- Gaussian mixture model
- K-means
- major histocompatibility complex
- mean relative rank errors
- multi-level Gaussian process latent variable model
- negative log-likelihood
- trustworthiness
- visualisation distance distortion