com.sun.labs.minion.classification
Interface BulkClassifier

All Known Implementing Classes:
Rocchio

public interface BulkClassifier

An interface for classifiers that can do bulk classification.


Method Summary
 float[][] classify(java.lang.String fromField, ClassifierDiskPartition cdp, DiskPartition sdp)
          Evaluates all of the classifiers in the given classifier disk partition against all of the new documents in the given disk partition.
 FeatureClusterSet getFeatures()
           
 

Method Detail

getFeatures

FeatureClusterSet getFeatures()

classify

float[][] classify(java.lang.String fromField,
                   ClassifierDiskPartition cdp,
                   DiskPartition sdp)
Evaluates all of the classifiers in the given classifier disk partition against all of the new documents in the given disk partition.

Parameters:
fromField - the field from which the terms should be gathered.
cdp - A partition of classifiers to evaluate
sdp - A partition of documents to evaluate the classifiers against
Returns:
a two dimensional array of evaluation scores. Element i,j of the array is the score for document with ID j in the new partition for the classifier with document ID i in the classifier partition.