A supervised learning approach for imbalanced data sets

Giang H. Nguyen, Abdesselam Bouzerdoum, Son L. Phung

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

36 Citations (Scopus)

Abstract

This paper presents a new learning approachfor pattern classification applications involving imbalanced data sets. In this approach, a clustering technique is employed to resample the original training set into a smaller set of representative training exemplars, represented by weighted cluster centers and their target outputs. Based on the proposed learning approach, four training algorithms are derived for feedjorward neural networks. These algorithms are implemented and tested on three benchmark data sets. Experimental results show that with the proposed learning approach, it is possible to design networks to tackle the class imbalance problem, without compromising the overall classification performance.

Original languageEnglish
Title of host publication2008 19th International Conference on Pattern Recognition, ICPR 2008
PublisherInstitute of Electrical and Electronics Engineers Inc.
ISBN (Print)9781424421756
DOIs
Publication statusPublished - 2008
Externally publishedYes

Publication series

NameProceedings - International Conference on Pattern Recognition
ISSN (Print)1051-4651

Fingerprint

Dive into the research topics of 'A supervised learning approach for imbalanced data sets'. Together they form a unique fingerprint.

Cite this