Record Details

USING WEKA FRAMEWORK IN DOCUMENT CLASSIFICATION

International Journal of Advanced Statistics and IT&c for Economics and Life Sciences

View Archive Info
 
 
Field Value
 
Title USING WEKA FRAMEWORK IN DOCUMENT CLASSIFICATION
 
Creator Crețulescu, Radu
Morariu, Daniel
Breazu, Macarie
 
Description Text document classification problem is a special case of a supervised data mining problem. In order to solve a text document classification problem some steps are required to fulfill. The common steps are: feature extraction, feature selection, classification, evaluation and visualization. The WEKA is a framework that helps us with all these steps. WEKA was initially developed as a library of java classes that help us to implement data mining applications. In the last years, in order to avoid java programming skills, the components from WEKA are also available into a visual form inside “WEKA Knowledge Flow Environment”. We have studied and present in this paper some of the most important visual components that are available in the WEKA framework for the previously presented steps. These components are: “Arff Loader”, “Attribute Selection”, “Normalize”, “Train Test Split Maker”, a lot of classifier algorithms, “Performance Evaluator” and “Text Viewer”. In order to prove the functionality of the visual framework in text document classification we have made and present some experiments. The most important advantage of the visual WEKA framework is the possibility to test different approaches without programming abilities.
 
Publisher Lucia Blaga University of Sibiu
 
Contributor
 
Date 2016-12-02
 
Type info:eu-repo/semantics/article
info:eu-repo/semantics/publishedVersion
Peer-reviewed Article
 
Format application/pdf
 
Identifier http://magazines.ulbsibiu.ro/ijasitels/index.php/IJASITELS/article/view/11
 
Source International Journal of Advanced Statistics and IT&c for Economics and Life Sciences; Vol 6, No 2 (2016): IJASITELS
2067-354X
 
Language eng
 
Relation http://magazines.ulbsibiu.ro/ijasitels/index.php/IJASITELS/article/view/11/14
 
Rights Copyright (c) 2017 International Journal of Advanced Statistics and IT&c for Economics and Life Sciences