A hybrid method for detecting outdated information in wikipedia infoboxes

Wikipedia has grown fast and become a major information resource for users as well as for many knowledge bases derived from it. However it is still edited manually while the world is changing rapidly. In this paper, we propose a method to detect outdated attribute values in Wikipedia infoboxes by us...

Mô tả đầy đủ

Đã lưu trong:
Chi tiết về thư mục
Những tác giả chính: Tran, Thong, Cao Hoang Tru
Định dạng: Conference paper
Ngôn ngữ:Vietnamese
Được phát hành: IEEE 2024
Truy cập trực tuyến:https://scholar.dlu.edu.vn/handle/123456789/3526
Các nhãn: Thêm thẻ
Không có thẻ, Là người đầu tiên thẻ bản ghi này!
Thư viện lưu trữ: Thư viện Trường Đại học Đà Lạt
id oai:scholar.dlu.edu.vn:123456789-3526
record_format dspace
spelling oai:scholar.dlu.edu.vn:123456789-35262024-07-17T06:26:14Z A hybrid method for detecting outdated information in wikipedia infoboxes Tran, Thong Cao Hoang Tru Wikipedia has grown fast and become a major information resource for users as well as for many knowledge bases derived from it. However it is still edited manually while the world is changing rapidly. In this paper, we propose a method to detect outdated attribute values in Wikipedia infoboxes by using facts extracted from the general Web. Our proposed method extracts new information by combining pattern-based approach with entity-search-based approach to deal with the diversity of natural language presentation forms of facts on the Web. Our experimental results show that the achieved accuracies of the proposed method are 70% and 82% respectively on the chief-executive-officer attribute and the number-of-employees attribute in company infoboxes. It significantly improves the accuracy of the single pattern-based or entity-search-based method. The results also reveal the striking truth about the outdated … 2024-07-09T08:06:30Z 2024-07-09T08:06:30Z 2013-10 Conference paper Bài báo đăng trên tạp chí thuộc ISI, bao gồm book chapter https://scholar.dlu.edu.vn/handle/123456789/3526 vi The 2013 RIVF International Conference on Computing & Communication Technologies-Research, Innovation, and Vision for Future (RIVF) IEEE
institution Thư viện Trường Đại học Đà Lạt
collection Thư viện số
language Vietnamese
description Wikipedia has grown fast and become a major information resource for users as well as for many knowledge bases derived from it. However it is still edited manually while the world is changing rapidly. In this paper, we propose a method to detect outdated attribute values in Wikipedia infoboxes by using facts extracted from the general Web. Our proposed method extracts new information by combining pattern-based approach with entity-search-based approach to deal with the diversity of natural language presentation forms of facts on the Web. Our experimental results show that the achieved accuracies of the proposed method are 70% and 82% respectively on the chief-executive-officer attribute and the number-of-employees attribute in company infoboxes. It significantly improves the accuracy of the single pattern-based or entity-search-based method. The results also reveal the striking truth about the outdated …
format Conference paper
author Tran, Thong
Cao Hoang Tru
spellingShingle Tran, Thong
Cao Hoang Tru
A hybrid method for detecting outdated information in wikipedia infoboxes
author_facet Tran, Thong
Cao Hoang Tru
author_sort Tran, Thong
title A hybrid method for detecting outdated information in wikipedia infoboxes
title_short A hybrid method for detecting outdated information in wikipedia infoboxes
title_full A hybrid method for detecting outdated information in wikipedia infoboxes
title_fullStr A hybrid method for detecting outdated information in wikipedia infoboxes
title_full_unstemmed A hybrid method for detecting outdated information in wikipedia infoboxes
title_sort hybrid method for detecting outdated information in wikipedia infoboxes
publisher IEEE
publishDate 2024
url https://scholar.dlu.edu.vn/handle/123456789/3526
_version_ 1813142621326934016