1994 IrrelevantFeaturesAndTheSubSelProb

From GM-RKB
Jump to navigation Jump to search

Subject Headings: Feature Subset Selection Algorithm.

Notes

Quotes

  • We address the problem of finding a subset of features that allows a supervised induction algorithm to induce small high-accuracy concepts. We examine notions of relevance and irrelevance, and show that the definitions used in the machine learning literature do not adequately partition the features into useful categories of relevance. We present definitions for irrelevance and for two degrees of relevance. These definitions improve our understanding of the behavior of previous subset selection algorithms, and help define the subset of features that should be sought. The features selected should depend not only on the features and the target concept, but also on the induction algorithm. We describe a method for feature subset selection using cross-validation that is applicable to any induction algorithm, and discuss experiments conducted with ID3 and C4.5 on artificial and real datasets,


 AuthorvolumeDate ValuetitletypejournaltitleUrldoinoteyear
1994 IrrelevantFeaturesAndTheSubSelProbGeorge H. John
Ron Kohavi
Karl Pflege
Irrelevant Features and the Subset Selection Problemhttp://citeseerx.ist.psu.edu/viewdoc/download?doi=10.1.1.30.3875&rep=rep1&type=pdf10.1.1.30.3875&rep=rep1&type=pdf