DEPARTMENT OF COMPUTER SCIENCE AND ENGINEERING
UNIVERSITY OF CALIFORNIA, SAN DIEGO
Training sets of positive and unlabeled records
Here are the P, Q, and N sets used in the paper Learning Classifiers from Only Positive and Unlabeled Data by Charles Elkan and Keith Noto. Each file is a Zip archive of SwissProt records stored as plain text.
Most recently updated on June 18, 2008 by Charles Elkan, elkan@cs.ucsd.edu