Clustering is a sub-area of data mining, which congregates similar data records in a group.So we put forward applying this technology into detecting approximately duplicate data records.
英
美
- 聚类是将相似度高的数据对象聚集到一个类中,于是我们提出将该技术用于近似重复记录的发现上。