2010 PracticalVeryLargeScaleCRFs

From GM-RKB
Jump to navigation Jump to search

Subject Headings: CRF Training Algorithm.

Notes

Cited By

Quotes

Author Keywords

Abstract

Conditional Random Fields (CRFs) are a widely-used approach for supervised sequence labelling, notably due to their ability to handle large description spaces and to integrate structural dependency between labels. Even for the simple linear-chain model, taking structure into account implies a number of parameters and a computational effort that grows quadratically with the cardinality of the label set. In this paper, we address the issue of training very large CRFs, containing up to hundreds output labels and several billion features. Efficiency stems here from the sparsity induced by the use of a l1 penalty term. Based on our own implementation, we compare three recent proposals for implementing this regularization strategy. experiments demonstrate that very large CRFs can be trained efficiently and that very large models are able to improve the accuracy, while delivering compact parameter sets.



References

,

 AuthorvolumeDate ValuetitletypejournaltitleUrldoinoteyear
2010 PracticalVeryLargeScaleCRFsThomas Lavergne
Olivier Cappé
François Yvon
Practical Very Large Scale CRFshttp://www.aclweb.org/anthology-new/P/P10/P10-1052.pdf2010