Εμφάνιση απλής εγγραφής

dc.creatorAkritidis L., Fevgas A., Bozanis P., Alamaniotis M.en
dc.date.accessioned2023-01-31T07:30:38Z
dc.date.available2023-01-31T07:30:38Z
dc.date.issued2019
dc.identifier10.1109/IISA.2019.8900751
dc.identifier.isbn9781728149592
dc.identifier.urihttp://hdl.handle.net/11615/70357
dc.description.abstractNews aggregators are on-line services that collect articles from numerous reputable media and news providers and reorganize them in a convenient manner with the aim of assisting their users to access the information they seek. One of the most important tools offered by news aggregators is based on the classification of the articles into a fixed set of categories. In this article, we introduce a supervised classification method for news articles that analyzes their titles and constructs multiple types of tokens including single words and n-grams of variable sizes. In the sequel, it employs several statistics, such as frequencies and token-class correlations, to assign two importance scores to each token. These scores reflect the ambiguity of a token; namely, how significant it is for the classification of an article to a category. The tokens and their scores are stored in a support structure that is subsequently used to classify the unlabeled articles. In addition, we propose a dimensionality reduction approach that reduces the size of the model without significant degradation of its classification performance. The algorithm is experimentally evaluated by employing a popular dataset of news articles and is found to outperform standard classification methods. © 2019 IEEE.en
dc.language.isoenen
dc.source10th International Conference on Information, Intelligence, Systems and Applications, IISA 2019en
dc.source.urihttps://www.scopus.com/inward/record.uri?eid=2-s2.0-85075873782&doi=10.1109%2fIISA.2019.8900751&partnerID=40&md5=5b0e0bd384208269cc1a902a46fbe672
dc.subjectData miningen
dc.subjectLearning systemsen
dc.subjectMachine learningen
dc.subjectSupervised learningen
dc.subjectClassification methodsen
dc.subjectClassification modelsen
dc.subjectClassification performanceen
dc.subjectDimensionality reductionen
dc.subjectNewsen
dc.subjectNews aggregatorsen
dc.subjectSupervised classificationen
dc.subjectSupport structuresen
dc.subjectClassification (of information)en
dc.subjectInstitute of Electrical and Electronics Engineers Inc.en
dc.titleA Self-Pruning Classification Model for Newsen
dc.typeconferenceItemen


Αρχεία σε αυτό το τεκμήριο

ΑρχείαΜέγεθοςΤύποςΠροβολή

Δεν υπάρχουν αρχεία που να σχετίζονται με αυτό το τεκμήριο.

Αυτό το τεκμήριο εμφανίζεται στις ακόλουθες συλλογές

Εμφάνιση απλής εγγραφής