Journal of Shanghai Jiaotong University ›› 2015, Vol. 49 ›› Issue (08): 1075-1083.

• Automation Technique, Computer Technology •     Next Articles

An Attribute Similarity Adjusting Algorithm Based on Functional Dependency

TAN Mingchaoa,DIAO Xingchuna,CAO Jianjuna,FENG Jingb   

  1. (a. College of Command Information Systems, Nanjing  210007, China; b. College of Meteorology and Oceanography, PLA University of Science and Technology,  Nanjing  211101, China)
  • Received:2014-10-27 Online:2015-08-31 Published:2015-08-31

Abstract:

Abstract: The accuracy of attribute similarity is one of the important factors affecting the precision of entity resolution (ER). To improve the accuracy of attribute similarity, the relation between attribute similarity and functional dependency (FD) was analyzed and the principles for attribute similarity adjusting were suggested. The FD based methods for similarity partition, similarity transitively adjusting and cost computing of similarity adjusting were proposed. An algorithm for attribute similarity adjusting with FD (SAWFD) was put forward to improve the accuracy of attribute similarity. The experiment results show that the algorithm can better distinguish matching and unmatching records, and get higher scores of recall, precision and F1 measure.

Key words: entity resolution, attribute similarity, functional dependencies

CLC Number: