TY - GEN
T1 - A supervised learning approach for imbalanced data sets
AU - Nguyen, Giang H.
AU - Bouzerdoum, Abdesselam
AU - Phung, Son L.
PY - 2008
Y1 - 2008
N2 - This paper presents a new learning approachfor pattern classification applications involving imbalanced data sets. In this approach, a clustering technique is employed to resample the original training set into a smaller set of representative training exemplars, represented by weighted cluster centers and their target outputs. Based on the proposed learning approach, four training algorithms are derived for feedjorward neural networks. These algorithms are implemented and tested on three benchmark data sets. Experimental results show that with the proposed learning approach, it is possible to design networks to tackle the class imbalance problem, without compromising the overall classification performance.
AB - This paper presents a new learning approachfor pattern classification applications involving imbalanced data sets. In this approach, a clustering technique is employed to resample the original training set into a smaller set of representative training exemplars, represented by weighted cluster centers and their target outputs. Based on the proposed learning approach, four training algorithms are derived for feedjorward neural networks. These algorithms are implemented and tested on three benchmark data sets. Experimental results show that with the proposed learning approach, it is possible to design networks to tackle the class imbalance problem, without compromising the overall classification performance.
UR - http://www.scopus.com/inward/record.url?scp=77957960090&partnerID=8YFLogxK
U2 - 10.1109/icpr.2008.4761278
DO - 10.1109/icpr.2008.4761278
M3 - Conference contribution
AN - SCOPUS:77957960090
SN - 9781424421756
T3 - Proceedings - International Conference on Pattern Recognition
BT - 2008 19th International Conference on Pattern Recognition, ICPR 2008
PB - Institute of Electrical and Electronics Engineers Inc.
ER -