Four different feature selection methods are discussed,including Document Frequency(DF),Mutual Information(MI),X2 test(CHI),Correlation Coefficient(CC),and the correction of text categorization is compared using the algorithm of K nearest neighbor.

  • 考察了文档频率DF、互信息MI、CHI统计、CC统计四种不同的特征选择方法;并结合K近邻算法进行分类精度上的比较.
目录 查词历史