Publication
INTERSPEECH 2011
Conference paper
Shrinkage-based features for natural language call routing
Abstract
The feature set used with a classifier can have a large impact on classification performance. This paper presents a set of shrinkage-based features for Maximum Entropy and other classifiers in the exponential family. These features are inspired by the exponential class-based language model, Model M. We motivate the use of these features for the task of text classification and evaluate them on a natural language call routing task. The proposed features along with a new word clustering method result in significant improvements in action classification accuracy over typical word-based features, particularly for small amounts of training data. Copyright © 2011 ISCA.