VIETNAMESE TEXT EXTRACTION FROM BOOK COVERS

Automatic information extraction from images reduces the cost, human interference, and timely processing. Converting printed book covers to readable text for later automation process would be useful for a wide range of users such as librarians, bookshop keepers, and individual users. In this paper,...

Mô tả đầy đủ

Đã lưu trong:
Chi tiết về thư mục
Những tác giả chính: Phan, Thị Thanh Nga, Nguyễn, Thị Huyền Trang, Nguyễn, Văn Phúc, Thái, Duy Quý, Võ, Phương Bình
Định dạng: Bài viết
Ngôn ngữ:English
Được phát hành: Trường Đại học Đà Lạt 2023
Truy cập trực tuyến:https://tckh.dlu.edu.vn/index.php/tckhdhdl/article/view/234
https://scholar.dlu.edu.vn/thuvienso/handle/DLU123456789/114239
Các nhãn: Thêm thẻ
Không có thẻ, Là người đầu tiên thẻ bản ghi này!
Thư viện lưu trữ: Thư viện Trường Đại học Đà Lạt
Miêu tả
Tóm tắt:Automatic information extraction from images reduces the cost, human interference, and timely processing. Converting printed book covers to readable text for later automation process would be useful for a wide range of users such as librarians, bookshop keepers, and individual users. In this paper, we present a novel method for the Vietnamese text extraction from images of scanned book covers. The proposed system accepts the book covers snapshot, filters the input image for an enhancement of quality, locates the regions with text, then utilizes the optical character recognizer (OCR) to extract the text. The last step is to filter the extracted text in accompany with at dictionary to achieve the final text result. Carrying out the experiments with the proposed system using our dataset delivered encouraging experimental results.