TY - JOUR
T1 - A statistical based feature extraction method for breast cancer diagnosis in digital mammogram using multiresolution representation
AU - Meselhy Eltoukhy, Mohamed
AU - Faye, Ibrahima
AU - Belhaouari Samir, Brahim
PY - 2012/1
Y1 - 2012/1
N2 - This paper presents a method for breast cancer diagnosis in digital mammogram images. Multiresolution representations, wavelet or curvelet, are used to transform the mammogram images into a long vector of coefficients. A matrix is constructed by putting wavelet or curvelet coefficients of each image in row vector, where the number of rows is the number of images, and the number of columns is the number of coefficients. A feature extraction method is developed based on the statistical t-test method. The method is ranking the features (columns) according to its capability to differentiate the classes. Then, a dynamic threshold is applied to optimize the number of features, which can achieve the maximum classification accuracy rate. The method depends on extracting the features that can maximize the ability to discriminate between different classes. Thus, the dimensionality of data features is reduced and the classification accuracy rate is improved. Support vector machine (SVM) is used to classify between the normal and abnormal tissues and to distinguish between benign and malignant tumors. The proposed method is validated using 5-fold cross validation. The obtained classification accuracy rates demonstrate that the proposed method could contribute to the successful detection of breast cancer.
AB - This paper presents a method for breast cancer diagnosis in digital mammogram images. Multiresolution representations, wavelet or curvelet, are used to transform the mammogram images into a long vector of coefficients. A matrix is constructed by putting wavelet or curvelet coefficients of each image in row vector, where the number of rows is the number of images, and the number of columns is the number of coefficients. A feature extraction method is developed based on the statistical t-test method. The method is ranking the features (columns) according to its capability to differentiate the classes. Then, a dynamic threshold is applied to optimize the number of features, which can achieve the maximum classification accuracy rate. The method depends on extracting the features that can maximize the ability to discriminate between different classes. Thus, the dimensionality of data features is reduced and the classification accuracy rate is improved. Support vector machine (SVM) is used to classify between the normal and abnormal tissues and to distinguish between benign and malignant tumors. The proposed method is validated using 5-fold cross validation. The obtained classification accuracy rates demonstrate that the proposed method could contribute to the successful detection of breast cancer.
KW - Breast cancer detection
KW - Curvelet transform
KW - Digital mammogram
KW - Feature extraction
KW - Wavelet transform
UR - http://www.scopus.com/inward/record.url?scp=83855162264&partnerID=8YFLogxK
U2 - 10.1016/j.compbiomed.2011.10.016
DO - 10.1016/j.compbiomed.2011.10.016
M3 - Article
C2 - 22115076
AN - SCOPUS:83855162264
SN - 0010-4825
VL - 42
SP - 123
EP - 128
JO - Computers in Biology and Medicine
JF - Computers in Biology and Medicine
IS - 1
ER -