Computer Science Faculty Publications
Information Fusion for Text Classification –– an Experimental Comparison
Document Type
Article
Abstract
This article reports on our experiments and results on the effectiveness of different feature sets and information fusion from some combinations of them in classifying free text documents into a given number of categories. We use different feature sets and integrate neural network learning into the method. The feature sets are based on the “latent semantics” of a reference library — a collection of documents adequately representing the desired concepts. We found that a larger reference library is not necessarily better. Information fusion almost always gives better results than the individual constituent feature sets, with certain combinations doing better than the others.
Repository Citation
Dasigi, Venu; Mann, Reinhold C.; and Protopopescu, Vladimir A., "Information Fusion for Text Classification –– an Experimental Comparison" (2001). Computer Science Faculty Publications. 4.
https://scholarworks.bgsu.edu/comp_sci_pub/4
Publication Date
2001
Publication Title
Pattern Recognition
DOI
https://doi.org/10.1016/S0031-3203(00)00171-0
Start Page No.
2413
End Page No.
2425