dc.creator | Nebaba A.N., Savvas I.K., Butakova M.A., Chernov A.V., Shevchuk P.S. | en |
dc.date.accessioned | 2023-01-31T09:40:05Z | |
dc.date.available | 2023-01-31T09:40:05Z | |
dc.date.issued | 2021 | |
dc.identifier | 10.1145/3501774.3501789 | |
dc.identifier.isbn | 9781450385060 | |
dc.identifier.uri | http://hdl.handle.net/11615/77130 | |
dc.description.abstract | Machine learning approaches and algorithms are spreading in wide areas in research and technology. Cybersecurity breaches are the common anomalies for networked and distributed infrastructures which are monitored, registered, and described carefully. However, the description of each security breaches episode and its classification is still a difficult problem, especially in highly complex telecommunication infrastructure. Railway information infrastructure usually has a large scale and large diversity of possible security breaches. Today's situation shows the registering of the security breaches has a mature and stable character, but the problem of their automated classification is not solved completely. Many studies on security breaches multiclass classification show inadequate accuracy of classification. We investigated the origins of this problem and suggested the possible roots consist in disbalance the datasets used for machine learning multiclass classification. Thus, we proposed an approach to improve the accuracy of the classification and verified our approach on the really collected datasets with cybersecurity breaches in railway telecommunication infrastructure. We analyzed the results of applying three imbalanced learning methodologies, namely random oversampling, synthetic minority oversampling technique, and the last one with Tomek links. We have implemented three machine learning algorithms, namely Naïve Bayes, K-means, and support vector machine, on disbalances and balanced data to estimate imbalance learning methodologies with comparing results. The proposed approach demonstrated the increase of the accuracy for multiclass classification in the range from 30 to 41%, depending on the imbalanced learning technique. © 2021 ACM. | en |
dc.language.iso | en | en |
dc.source | ACM International Conference Proceeding Series | en |
dc.source.uri | https://www.scopus.com/inward/record.uri?eid=2-s2.0-85127530847&doi=10.1145%2f3501774.3501789&partnerID=40&md5=43301b3167dcc14bd599ced9aa7fd1d4 | |
dc.subject | Classification (of information) | en |
dc.subject | Cybersecurity | en |
dc.subject | K-means clustering | en |
dc.subject | Learning algorithms | en |
dc.subject | Railroad transportation | en |
dc.subject | Railroads | en |
dc.subject | Cyber security | en |
dc.subject | Cybersecurity breach | en |
dc.subject | Disbalance | en |
dc.subject | Imbalanced Learning | en |
dc.subject | Machine learning algorithms | en |
dc.subject | Machine learning approaches | en |
dc.subject | Multi-class classification | en |
dc.subject | Railway infrastructure | en |
dc.subject | Security breaches | en |
dc.subject | Telecommunications infrastructures | en |
dc.subject | Support vector machines | en |
dc.subject | Association for Computing Machinery | en |
dc.title | Improving Multiclass Classification of Cybersecurity Breaches in Railway Infrastructure using Imbalanced Learning | en |
dc.type | conferenceItem | en |