Computer Science & Electrical

Computer Science & Electrical

Archive
Join as an Editor/Reviewer

Text Classification using KNN with different Features Selection Methods

Volume: 8  ,  Issue: 1 , July    Published Date: 23 July 2018
Publisher Name: IJRP
Views: 853  ,  Download: 546

Authors

# Author Name
1 Rajshree Jodha, Gaur Sanjay BC, K.R Chowdhary, Amit Mishra

Abstract

This paper presents a fast and efficient approach for text classification using KNN for different feature selection method. Typically, this approach evaluates the performance of the system for minimum number of features required to classify the text documents. 20 Newsgroup dataset collected by Ken Lang, have been taken to check performance of the KNN classifier algorithm. The above dataset is separated into two parts viz. training set (60%) and test set (40%).

  The KNN classifier has been implemented against the different number of stemmed and unstemmed features for CHI (Chi-Squared Statistic), IG (Information Gain) and MI (Mutual Information). The Accuracy, Precision, Recall and F1-Score are used to test the system.

Keywords

  • KNN
  • Text Classification