VIETNAMESE TEXT EXTRACTION FROM BOOK COVERS

Automatic information extraction from images reduces the cost, human interference, and timely processing. Converting printed book covers to readable text for later automation process would be useful for a wide range of users such as librarians, bookshop keepers, and individual users. In this paper,...

Mô tả đầy đủ

Đã lưu trong:
Chi tiết về thư mục
Những tác giả chính: Phan, Thị Thanh Nga, Nguyễn, Thị Huyền Trang, Nguyễn, Văn Phúc, Thái, Duy Quý, Võ, Phương Bình
Định dạng: Bài viết
Ngôn ngữ:English
Được phát hành: Trường Đại học Đà Lạt 2023
Truy cập trực tuyến:https://tckh.dlu.edu.vn/index.php/tckhdhdl/article/view/234
https://scholar.dlu.edu.vn/thuvienso/handle/DLU123456789/114239
Các nhãn: Thêm thẻ
Không có thẻ, Là người đầu tiên thẻ bản ghi này!
Thư viện lưu trữ: Thư viện Trường Đại học Đà Lạt
id oai:scholar.dlu.edu.vn:DLU123456789-114239
record_format dspace
spelling oai:scholar.dlu.edu.vn:DLU123456789-1142392023-10-27T14:43:53Z VIETNAMESE TEXT EXTRACTION FROM BOOK COVERS Phan, Thị Thanh Nga Nguyễn, Thị Huyền Trang Nguyễn, Văn Phúc Thái, Duy Quý Võ, Phương Bình Automatic information extraction from images reduces the cost, human interference, and timely processing. Converting printed book covers to readable text for later automation process would be useful for a wide range of users such as librarians, bookshop keepers, and individual users. In this paper, we present a novel method for the Vietnamese text extraction from images of scanned book covers. The proposed system accepts the book covers snapshot, filters the input image for an enhancement of quality, locates the regions with text, then utilizes the optical character recognizer (OCR) to extract the text. The last step is to filter the extracted text in accompany with at dictionary to achieve the final text result. Carrying out the experiments with the proposed system using our dataset delivered encouraging experimental results. 2023-03-04T08:23:12Z 2023-03-04T08:23:12Z 2017 Article 0866-787X https://tckh.dlu.edu.vn/index.php/tckhdhdl/article/view/234 https://scholar.dlu.edu.vn/thuvienso/handle/DLU123456789/114239 10.37569/DalatUniversity.7.2.234(2017) en Tạp chí Khoa học Đại học Đà Lạt, Tập 7, Số 2; tr. 142-152 application/pdf Trường Đại học Đà Lạt
institution Thư viện Trường Đại học Đà Lạt
collection Thư viện số
language English
description Automatic information extraction from images reduces the cost, human interference, and timely processing. Converting printed book covers to readable text for later automation process would be useful for a wide range of users such as librarians, bookshop keepers, and individual users. In this paper, we present a novel method for the Vietnamese text extraction from images of scanned book covers. The proposed system accepts the book covers snapshot, filters the input image for an enhancement of quality, locates the regions with text, then utilizes the optical character recognizer (OCR) to extract the text. The last step is to filter the extracted text in accompany with at dictionary to achieve the final text result. Carrying out the experiments with the proposed system using our dataset delivered encouraging experimental results.
format Article
author Phan, Thị Thanh Nga
Nguyễn, Thị Huyền Trang
Nguyễn, Văn Phúc
Thái, Duy Quý
Võ, Phương Bình
spellingShingle Phan, Thị Thanh Nga
Nguyễn, Thị Huyền Trang
Nguyễn, Văn Phúc
Thái, Duy Quý
Võ, Phương Bình
VIETNAMESE TEXT EXTRACTION FROM BOOK COVERS
author_facet Phan, Thị Thanh Nga
Nguyễn, Thị Huyền Trang
Nguyễn, Văn Phúc
Thái, Duy Quý
Võ, Phương Bình
author_sort Phan, Thị Thanh Nga
title VIETNAMESE TEXT EXTRACTION FROM BOOK COVERS
title_short VIETNAMESE TEXT EXTRACTION FROM BOOK COVERS
title_full VIETNAMESE TEXT EXTRACTION FROM BOOK COVERS
title_fullStr VIETNAMESE TEXT EXTRACTION FROM BOOK COVERS
title_full_unstemmed VIETNAMESE TEXT EXTRACTION FROM BOOK COVERS
title_sort vietnamese text extraction from book covers
publisher Trường Đại học Đà Lạt
publishDate 2023
url https://tckh.dlu.edu.vn/index.php/tckhdhdl/article/view/234
https://scholar.dlu.edu.vn/thuvienso/handle/DLU123456789/114239
_version_ 1819837997722370048