Εμφάνιση απλής εγγραφής

dc.creatorKolias V., Anagnostopoulos I., Kayafas E.en
dc.date.accessioned2023-01-31T08:43:36Z
dc.date.available2023-01-31T08:43:36Z
dc.date.issued2015
dc.identifier10.1109/BDC.2014.17
dc.identifier.isbn9781479918973
dc.identifier.urihttp://hdl.handle.net/11615/74971
dc.description.abstractWith the ever increasing production of data from various heterogeneous sources in modern information societies, the need for scalable data-intensive processing is increasing. MapReduce quickly became the de facto framework for large scale data analysis, due to its simple and abstract programming model and its efficient underlying execution system. However, this simplicity comes with a price: its unidirectional communication model and the lack of support for iterations, makes repeated querying of datasets difficult and imposes limitations in many fields including Machine Learning. In this paper we describe the implementation of a classification rule induction algorithm based on MapReduce, with the aim of building a classification model within as few iterations as possible. After a thorough description of the algorithm, we evaluate its performance from three perspectives: its accuracy, its parallel performance and the communication costs. The evaluations indicate that the approach is scalable and since it produces a comprehensive human-readable model it can be proven valuable for a wide range of applications. © 2014 IEEE.en
dc.language.isoenen
dc.sourceProceedings - 2014 International Symposium on Big Data Computing, BDC 2014en
dc.source.urihttps://www.scopus.com/inward/record.uri?eid=2-s2.0-84962826277&doi=10.1109%2fBDC.2014.17&partnerID=40&md5=8a1747ac9f0af14709226115d5d1f893
dc.subjectAlgorithmsen
dc.subjectArtificial intelligenceen
dc.subjectClassification (of information)en
dc.subjectData handlingen
dc.subjectLearning systemsen
dc.subjectClassification modelsen
dc.subjectCommunication modelingen
dc.subjectHeterogeneous sourcesen
dc.subjectIncreasing productionen
dc.subjectLarge-scale data analysisen
dc.subjectMap-reduceen
dc.subjectParallel performanceen
dc.subjectRule inductionen
dc.subjectBig dataen
dc.subjectInstitute of Electrical and Electronics Engineers Inc.en
dc.titleA Covering Classification Rule Induction Approach for Big Datasetsen
dc.typeconferenceItemen


Αρχεία σε αυτό το τεκμήριο

ΑρχείαΜέγεθοςΤύποςΠροβολή

Δεν υπάρχουν αρχεία που να σχετίζονται με αυτό το τεκμήριο.

Αυτό το τεκμήριο εμφανίζεται στις ακόλουθες συλλογές

Εμφάνιση απλής εγγραφής