A hybrid method for detecting outdated information in wikipedia infoboxes

Wikipedia has grown fast and become a major information resource for users as well as for many knowledge bases derived from it. However it is still edited manually while the world is changing rapidly. In this paper, we propose a method to detect outdated attribute values in Wikipedia infoboxes by us...

全面介绍

Đã lưu trong:
书目详细资料
Những tác giả chính: Tran, Thong, Cao Hoang Tru
格式: Conference paper
语言:Vietnamese
出版: IEEE 2024
在线阅读:https://scholar.dlu.edu.vn/handle/123456789/3526
标签: 添加标签
没有标签, 成为第一个标记此记录!
Thư viện lưu trữ: Thư viện Trường Đại học Đà Lạt
实物特征
总结:Wikipedia has grown fast and become a major information resource for users as well as for many knowledge bases derived from it. However it is still edited manually while the world is changing rapidly. In this paper, we propose a method to detect outdated attribute values in Wikipedia infoboxes by using facts extracted from the general Web. Our proposed method extracts new information by combining pattern-based approach with entity-search-based approach to deal with the diversity of natural language presentation forms of facts on the Web. Our experimental results show that the achieved accuracies of the proposed method are 70% and 82% respectively on the chief-executive-officer attribute and the number-of-employees attribute in company infoboxes. It significantly improves the accuracy of the single pattern-based or entity-search-based method. The results also reveal the striking truth about the outdated …