File(s) not publicly available

Multiple strong and balanced cluster-based ensemble of deep learners

journal contribution
posted on 23.03.2021, 02:16 by Muhammad Zohaib Jan, Brijesh Verma
© 2020 Convolutional Neural Networks (CNNs), also known as deep learners have seen much success in the last few years due to the availability of large amounts of data and high-performance computational resources. A CNN can be trained effectively if large amounts of data are available as it enables a CNN to find the optimal set of features and weights that can achieve the highest generalization performance. However, due to the requirement of large data size, CNNs require a lot of resources for example running time and computational resources to achieve a reasonable performance. Additionally, unbalanced data makes it difficult to train a CNN effectively that can achieve good generalization performance. In order to alleviate these limitations, in this paper, we propose a novel ensemble of deep learners that learns by combining multiple deep learners trained on small strongly class associated input data effectively. We propose a novel methodology of generating random subspace through clustering input data and propose a measure which can classify each cluster as a strong data cluster and a balanced data cluster. A methodology is also proposed that balances all strong data clusters in the pool so that an architecturally simple CNN can be trained on all balanced data clusters simultaneously. Classification decisions on all trained CNNs are then fused through majority voting to generate class decisions of the ensemble. The performance of the proposed ensemble approach is evaluated on UCI benchmark datasets, and results are compared with existing state-of-the-art ensemble approaches. Significance testing was conducted to further validate the efficacy of the results and a significance test analysis is presented.

Funding

Category 1 - Australian Competitive Grants (this includes ARC, NHMRC)

History

Volume

107

Start Page

1

End Page

11

Number of Pages

11

eISSN

1873-5142

ISSN

0031-3203

Publisher

Elsevier

Language

en

Peer Reviewed

Yes

Open Access

No

Acceptance Date

04/05/2020

Author Research Institute

Centre for Intelligent Systems

Era Eligible

Yes

Journal

Pattern Recognition

Article Number

107420