Abstract
The concept of sparsity as more approptiate characteristic of the data representation than the number of features used was discussed. A feature ranking and a feature selection method based on the linear support vector machines (SVM) that was used in conjunction with the SVM classifier was also proposed. This method can be combined with other classification algorithms. The results show that, at the same level of vector sparcity, feature selection based on SVM normals yields better classification performance than odds ratio or information gain based feature selection when linear SVM classifiers are used.
Original language | English |
---|---|
Title of host publication | Data Mining III |
Editors | A. Zanasi, C.A. Brebbia, N.F.F.E. Ebecken, P. Melli |
Publisher | WITPress |
Pages | 261-273 |
Number of pages | 13 |
Volume | 6 |
ISBN (Print) | 1853128309 |
Publication status | Published - 2002 |
Externally published | Yes |
Event | Third International Conference on Data Mining, Data Mining III - Bologna, Italy Duration: 25 Sept 2002 → 27 Sept 2002 |
Conference
Conference | Third International Conference on Data Mining, Data Mining III |
---|---|
Country/Territory | Italy |
City | Bologna |
Period | 25/09/02 → 27/09/02 |