The statistical characteristics of dimensionality in latent semantic analysis LSA space were studied to realize automatic document clustering under different concept levels.
英
美
- 另外,在基于潜在语义分析的文档聚类算法中,采用文档自检索矩阵的行向量,代替低维文档向量作为聚类对象,获得了更好的聚类准确率。