2010 EnsemblePruningviaIndividualCon

From GM-RKB
Jump to navigation Jump to search

Subject Headings:

Notes

Cited By

Quotes

Author Keywords

Abstract

An ensemble is a set of learned models that make decisions collectively. Although an ensemble is usually more accurate than a single learner, existing ensemble methods often tend to construct unnecessarily large ensembles, which increases the memory consumption and computational cost. Ensemble pruning tackles this problem by selecting a subset of ensemble members to form subensembles that are subject to less resource consumption and response time with accuracy that is similar to or better than the original ensemble. In this paper, we analyze the accuracy/diversity trade-off and prove that classifiers that are more accurate and make more predictions in the minority group are more important for subensemble construction. Based on the gained insights, a heuristic metric that considers both accuracy and diversity is proposed to ly|explicitly evaluate each individual classifier's contribution to the whole ensemble. By incorporating ensemble members in decreasing order of their contributions, subensembles are formed such that users can select the top [math]\displaystyle{ p }[/math] percent of ensemble members, depending on their resource availability and tolerable waiting time, for predictions. Experimental results on 26 UCI data sets show that subensembles formed by the proposed EPIC (Ensemble Pruning via Individual Contribution ordering) algorithm outperform the original ensemble and a state-of-the-art ensemble pruning method, Orientation Ordering (OO).

References

,

 AuthorvolumeDate ValuetitletypejournaltitleUrldoinoteyear
2010 EnsemblePruningviaIndividualConXindong Wu
Xingquan Zhu
Zhenyu Lu
Josh Bongard
Ensemble Pruning via Individual Contribution OrderingKDD-2010 Proceedingshttp://www.cs.uvm.edu/~jbongard/papers/2010 KDD Lu.pdf10.1145/1835804.18359142010