Classification of News Document in English Based on Ontology

Muflikhah, Lailil and Murdianto, Aldi Sunantyo Ali (2016) Classification of News Document in English Based on Ontology. British Journal of Applied Science & Technology, 17 (4). pp. 1-9. ISSN 22310843

[thumbnail of Muflikhah1742015BJAST27571.pdf] Text
Muflikhah1742015BJAST27571.pdf - Published Version

Download (162kB)

Abstract

Aims: This paper aims to propose ontology method of news document classification. The common method of document classification is based on morphology of term, without considering the meaning. It is impact to the number of term-document and computational time. Furthermore, the performance is decrease, even though the number of training data is increase.

Methodology: The main idea of ontology is to handle the similarity of terms that have different morphological form but the same meaning (synonym). The ontology is built using WordNet database to find similary of meaning among terms-document. The terms that have similar meaning are merged including their term frequency to be constructed in vector space model. After that, the unknown document is classified using cosine similarity measurement of the weight-term. The text document that is used is English news text in general topic, such as interest, money-fx, trade, and crude. The experiment is compared to the conventional method which is document classification without ontology.

Results: Classification of news document can be implemented using cosine similarity method based on ontology. The performance measure of this method including precission, recall and f-measure has increased eventhough the number of terms is reduced.

Item Type: Article
Subjects: Apsci Archives > Multidisciplinary
Depositing User: Unnamed user with email support@apsciarchives.com
Date Deposited: 15 Jun 2023 06:34
Last Modified: 15 Jan 2024 04:23
URI: http://eprints.go2submission.com/id/eprint/1151

Actions (login required)

View Item
View Item