Convolutional neural networks for toxic comment classification

Georgakopoulos S.V., Vrahatis A.G., Tasoulis S.K., Plagianakos V.P.

dc.creator	Georgakopoulos S.V., Vrahatis A.G., Tasoulis S.K., Plagianakos V.P.	en
dc.date.accessioned	2023-01-31T07:40:23Z
dc.date.available	2023-01-31T07:40:23Z
dc.date.issued	2018
dc.identifier	10.1145/3200947.3208069
dc.identifier.isbn	9781450364331
dc.identifier.uri	http://hdl.handle.net/11615/72074
dc.description.abstract	Flood of information is produced in a daily basis through the global internet usage arising from the online interactive communications among users. While this situation contributes significantly to the quality of human life, unfortunately it involves enormous dangers, since online texts with high toxicity can cause personal attacks, online harassment and bullying behaviors. This has triggered both industrial and research community in the last few years while there are several attempts to identify an efficient model for online toxic comment prediction. However, these steps are still in their infancy and new approaches and frameworks are required. On parallel, the data explosion that appears constantly, makes the construction of new machine learning computational tools for managing this information, an imperative need. Thankfully advances in hardware, cloud computing and big data management allow the development of Deep Learning approaches appearing very promising performance so far. For text classification in particular the use of Convolutional Neural Networks (CNN) have recently been proposed approaching text analytics in a modern manner emphasizing in the structure of words in a document. In this work, we employ this approach to discover toxic comments in a large pool of documents provided by a current Kaggle’s competition regarding Wikipedia’s talk page edits. To justify this decision we choose to compare CNNs against the traditional bag-of-words approach for text analysis combined with a selection of algorithms proven to be very effective in text classification. The reported results provide enough evidence that CNN enhance toxic comment classification reinforcing research interest towards this direction. © 2018 Association for Computing Machinery.	en
dc.language.iso	en	en
dc.source	ACM International Conference Proceeding Series	en
dc.source.uri	https://www.scopus.com/inward/record.uri?eid=2-s2.0-85052022425&doi=10.1145%2f3200947.3208069&partnerID=40&md5=7b890648c9dca80289581d5726f4aedc
dc.subject	Big data	en
dc.subject	Classification (of information)	en
dc.subject	Convolution	en
dc.subject	Data mining	en
dc.subject	Deep learning	en
dc.subject	Industrial research	en
dc.subject	Information management	en
dc.subject	Information retrieval systems	en
dc.subject	Neural networks	en
dc.subject	Tellurium compounds	en
dc.subject	Convolutional neural network	en
dc.subject	Embeddings	en
dc.subject	Text classification	en
dc.subject	Text mining	en
dc.subject	Word2vec	en
dc.subject	Text processing	en
dc.subject	Association for Computing Machinery	en
dc.title	Convolutional neural networks for toxic comment classification	en
dc.type	conferenceItem	en

Αρχεία σε αυτό το τεκμήριο

Αρχεία	Μέγεθος	Τύπος	Προβολή
Δεν υπάρχουν αρχεία που να σχετίζονται με αυτό το τεκμήριο.

Αυτό το τεκμήριο εμφανίζεται στις ακόλουθες συλλογές

Δημοσιεύσεις σε περιοδικά, συνέδρια, κεφάλαια βιβλίων κλπ. [19735]

Εμφάνιση απλής εγγραφής