TEXT CLASSIFICATION BASED ON SUPPORT VECTOR MACHINE

The development of the Internet has increased the need for daily online information storage. Finding the correct information that we are interested in takes a lot of time, so the use of techniques for organizing and processing text data are needed. These techniques are called text classification or...

Cijeli opis

Spremljeno u:
Bibliografski detalji
Glavni autor: Lê, Thị Minh Nguyện
Format: Članak
Jezik:English
Izdano: Trường Đại học Đà Lạt 2023
Online pristup:https://scholar.dlu.edu.vn/thuvienso/handle/DLU123456789/114322
https://tckh.dlu.edu.vn/index.php/tckhdhdl/article/view/536
Oznake: Dodaj oznaku
Bez oznaka, Budi prvi tko označuje ovaj zapis!
Thư viện lưu trữ: Thư viện Trường Đại học Đà Lạt
Opis
Sažetak:The development of the Internet has increased the need for daily online information storage. Finding the correct information that we are interested in takes a lot of time, so the use of techniques for organizing and processing text data are needed. These techniques are called text classification or text categorization. There are many methods of text classification, but for this paper we study and apply the Support Vector Machine (SVM) method and compare its effect with the Naïve Bayes probability method. In addition, before implementing text classification, we performed preprocessing steps on the training set by extracting keywords with dimensional reduction techniques to reduce the time needed in the classification process.