Abstract
In the medical domain, data are often collected over time, evolving from simple to refined categories. The data and the underlying structures of the medical data as to how they have grown to today's complexity can be decomposed into crude forms when data collection starts. For instance, the cancer dataset is labeled either benign or malignant at its simplest or perhaps the earliest form. As medical knowledge advances and/or more data become available, the dataset progresses from binary class to multi-class, having more labels of sub-categories of the disease added. In machine learning, inducing a multi-class model requires more computational power. Model optimization is enforced over the multi-class models for the highest possible accuracy, which of course, is necessary for life-and-death decision making. This model optimization task consumes an extremely long model training time. In this paper, a novel strategy called Group-of-Single-Class prediction (GOSC) coupled with majority voting and model transfer is proposed for achieving maximum accuracy by using only a fraction of the model training time. The main advantage is the ability to achieve an optimized multi-class classification model that has the highest possible accuracy near to the absolute maximum, while the training time could be saved by up to 70%. Experiments on machine learning over liver dataset classification and deep learning over COVID19 lung CT images were tested. Preliminary results suggest the feasibility of this new approach.
Original language | English |
---|---|
Pages (from-to) | 10-22 |
Number of pages | 13 |
Journal | Future Generation Computer Systems |
Volume | 133 |
Early online date | 17 Mar 2022 |
DOIs | |
Publication status | Published - 1 Aug 2022 |
Bibliographical note
© 2022, Elsevier. Licensed under the Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International http://creativecommons.org/licenses/by-nc-nd/4.0/Funding Information:
This work was supported in part by 2018 Guangzhou Science and Technology Innovation and Development of Special Funds , via Grant no. EF003/FST-FSJ/2019/GSTIC , and code no. 201907010001, and also VC Research ( VCR 0000149 ).
Keywords
- Algorithm
- Classification model training
- Deep learning
- Machine learning
- Medical dataset
- Multi-class classification
- Parameter optimization
- Radiological images recognition