TEXT CLASSIFICATION BASED ON SUPPORT VECTOR MACHINE

The development of the Internet has increased the need for daily online information storage. Finding the correct information that we are interested in takes a lot of time, so the use of techniques for organizing and processing text data are needed. These techniques are called text classification or...

Popoln opis

Shranjeno v:
Bibliografske podrobnosti
Glavni avtor: Lê, Thị Minh Nguyện
Format: Bài viết
Jezik:English
Izdano: Trường Đại học Đà Lạt 2023
Online dostop:https://scholar.dlu.edu.vn/thuvienso/handle/DLU123456789/114322
https://tckh.dlu.edu.vn/index.php/tckhdhdl/article/view/536
Oznake: Označite
Brez oznak, prvi označite!
Thư viện lưu trữ: Thư viện Trường Đại học Đà Lạt
Opis
Izvleček:The development of the Internet has increased the need for daily online information storage. Finding the correct information that we are interested in takes a lot of time, so the use of techniques for organizing and processing text data are needed. These techniques are called text classification or text categorization. There are many methods of text classification, but for this paper we study and apply the Support Vector Machine (SVM) method and compare its effect with the Naïve Bayes probability method. In addition, before implementing text classification, we performed preprocessing steps on the training set by extracting keywords with dimensional reduction techniques to reduce the time needed in the classification process.