Clustering is a sub-area of data mining, which congregates similar data records in a group.So we put forward applying this technology into detecting approximately duplicate data records.

  • 聚类是将相似度高的数据对象聚集到一个类中,于是我们提出将该技术用于近似重复记录的发现上。
目录 查词历史