IJERT-EMS
IJERT-EMS

A Simple Approach for Scientific Document Categorization


A Simple Approach for Scientific Document Categorization
Authors : Arlina D'cunha, Dr. A. K. Sen
Publication Date: 03-09-2015

Authors

Author(s):  Arlina D'cunha, Dr. A. K. Sen

Published in:   International Journal of Engineering Research & Technology

License:  This work is licensed under a Creative Commons Attribution 4.0 International License.

Website: www.ijert.org

Volume/Issue:   Volume. 4 - Issue. 09 , September - 2015

e-ISSN:   2278-0181

 DOI:  http://dx.doi.org/10.17577/IJERTV4IS090027

Abstract

Classification is the alignment of data or items in predefined labeled groups based on resemblances. Exponential progression amount of scientific documents leads to uncontrollable physical classification. Feature extraction is the crucial condition of automatic document classification. TF-IDF (term frequency-inverse document frequency) is frequently used to represent the text feature weight. This paper proposes a new yet simple feature weighting scheme by modifying TF-IDF formula. The experimental results show that the modified method improves the accuracy and other parameters.

Citations

Number of Citations for this article:  Data not Available

Keywords

Key Word(s):    

Downloads

Number of Downloads:     133
Similar-Paper

7   Paper(s) Found related to your topic:    

Call for Papers - May - 2017

        

 

                 Call for Thesis - 2017 

     Publish your Ph.D/Master's Thesis Online

              Publish Ph.D Master Thesis Online as Book