Building text classifiers using positive and unlabeled examples

Bing Liu; Yang Dai; Xiaoli Li; Wee Sun Lee; Philip S. Yu

ICDM 2003

Conference paper

01 Dec 2003

Building text classifiers using positive and unlabeled examples

Abstract

This paper studies the problem of building text classifiers using positive and unlabeled examples. The key feature of this problem is that there is no negative example for learning. Recently, a few techniques for solving this problem were proposed in the literature. These techniques are based on the same idea, which builds a classifier in two steps. Each existing technique uses a different method for each step. In this paper, we first introduce some new methods for the two steps, and perform a comprehensive evaluation of all possible combinations of methods of the two steps. We then propose a more principled approach to solving the problem based on a biased formulation of SVM, and show experimentally that it is more accurate than the existing techniques. © 2003 IEEE.

Conference paper

Building text classifiers using positive and unlabeled examples

Abstract

Related

Cost-sensitive learning by cost-proportionate example weighting

Top 10 algorithms in data mining

Mining community structure of named entities from web pages and blogs

Adding the temporal dimension to search - A case study in publication search