TY - GEN
T1 - Guiding local regression using visualisation
AU - Maniyar, Dharmesh M.
AU - Nabney, Ian T.
PY - 2005/12
Y1 - 2005/12
N2 - Solving many scientific problems requires effective regression and/or classification models for large high-dimensional datasets. Experts from these problem domains (e.g. biologists, chemists, financial analysts) have insights into the domain which can be helpful in developing powerful models but they need a modelling framework that helps them to use these insights. Data visualisation is an effective technique for presenting data and requiring feedback from the experts. A single global regression model can rarely capture the full behavioural variability of a huge multi-dimensional dataset. Instead, local regression models, each focused on a separate area of input space, often work better since the behaviour of different areas may vary. Classical local models such as Mixture of Experts segment the input space automatically, which is not always effective and it also lacks involvement of the domain experts to guide a meaningful segmentation of the input space. In this paper we addresses this issue by allowing domain experts to interactively segment the input space using data visualisation. The segmentation output obtained is then further used to develop effective local regression models.
AB - Solving many scientific problems requires effective regression and/or classification models for large high-dimensional datasets. Experts from these problem domains (e.g. biologists, chemists, financial analysts) have insights into the domain which can be helpful in developing powerful models but they need a modelling framework that helps them to use these insights. Data visualisation is an effective technique for presenting data and requiring feedback from the experts. A single global regression model can rarely capture the full behavioural variability of a huge multi-dimensional dataset. Instead, local regression models, each focused on a separate area of input space, often work better since the behaviour of different areas may vary. Classical local models such as Mixture of Experts segment the input space automatically, which is not always effective and it also lacks involvement of the domain experts to guide a meaningful segmentation of the input space. In this paper we addresses this issue by allowing domain experts to interactively segment the input space using data visualisation. The segmentation output obtained is then further used to develop effective local regression models.
KW - regression models
KW - classification models
KW - large high-dimensional datasets
UR - http://www.scopus.com/inward/record.url?scp=33645996049&partnerID=8YFLogxK
UR - http://link.springer.com/chapter/10.1007%2F11559887_6
U2 - 10.1007/11559887_6
DO - 10.1007/11559887_6
M3 - Conference publication
SN - 3-540-29073-7
SN - 978-3-540-29073-5
T3 - Lecture Notes in Computer Science
SP - 98
EP - 109
BT - Deterministic and statistical methods in machine learning
A2 - Winkler, Joab
A2 - Niranjan, Mahesan
A2 - Lawrence, Neil
PB - Springer
CY - Berlin (DE)
T2 - 1st International Workshop on Deterministic and Statistical Methods in Machine Learning
Y2 - 7 September 2004 through 10 September 2004
ER -