Prediction of Datasets sameAs Interlinking on Web of Data
DOI: 10.23977/jwsa.2017.11005 | Downloads: 17 | Views: 3054
Jintao Tang 1, Ting Wang 1, Haichi Liu 1
1 College of Computer, National University of Defense Technology, Changsha, Hunan Province, China
Corresponding AuthorHaichi Liu
In order to be considered as Linked Data, the datasets on the web must be linked to other datasets. We focus on predicting the possible links between datasets with the most important RDF link type, owl:sameAs using link prediction and classification techniques. Since the goal is to discriminate between linked dataset pairs against not-linked ones, we formulate the link prediction problem as a classification problem. We adopt Random Forest as the basic classifier to incorporate features of the scores output by unsupervised predictors, and apply the bagging technique to combine multiple forests to reduce variance and improve the accuracy. Experiments show we can improve the prediction performance by about 10% in AUROC compared with the best unsupervised predictor.
KEYWORDSLinked data, Dataset, sameAs interlinking, Link Prediction.
CITE THIS PAPER
Haichi, L. , Ting, W. , Jintao, T. (2017) Prediction of Datasets sameAs Interlinking on Web of Data. Journal of Web Systems and Applications (2017) 1: 25-29.
 Schmachtenberg M, Bizer C, Paulheim H. Adoption of the Linked Data Best Practices in Different Topical Domains[J]. Lecture Notes in Computer Science, 2014:245-260.
 S. Bechhofer, F. van Harmelen, J. Hendler, I. Horrocks, D. McGuinness, P. Patel- Schneider, and L. A. Stein. OWL Web Ontology Language Reference. W3C Recom- mendation. www.w3.org/TR/owl-ref (2004).
 Nikolov A, Motta E. What Should I Link to? Identifying Relevant Sources and Classes for Data Linking[J]. Lecture Notes in Computer Science, 2012:284-299.
 Liu H. et al. (2016) Identifying Linked Data Datasets for sameAs Interlinking Using Recommendation Techniques. In: Cui B., Zhang N., Xu J., Lian X., Liu D. (eds) Web-Age Information Management. WAIM 2016. Lecture Notes in Computer Science, vol 9658. Springer, Cham
 Lopes G. R., Leme L.A.P.P, Nunes B.P., et al. Two Approaches to the Dataset Interlinking Recommendation Problem. 2014. In: 15th International Conference on Web Information System Engineering (WISE 2014).71-74.
 HC Liu, PL Liu, JT Tang, H Ning, DP Wei, T Wang, Collaborative Datasets Retrieval for Interlinking on Web of Data, WWW 2015 Companion, May 18–22, 2015, Florence, Italy.
 Lü L, Zhou T. Link prediction in complex networks: A survey[J]. Physica A Statistical Mechanics & Its Applications, 2011, 390(6):1150–1170.