Overlapped Sound Event Classification via Multi-Channel Sound Separation Network
dc.creator | Giannoulis P., Potamianos G., Maragos P. | en |
dc.date.accessioned | 2023-01-31T07:42:13Z | |
dc.date.available | 2023-01-31T07:42:13Z | |
dc.date.issued | 2021 | |
dc.identifier | 10.23919/EUSIPCO54536.2021.9616131 | |
dc.identifier.isbn | 9789082797060 | |
dc.identifier.issn | 22195491 | |
dc.identifier.uri | http://hdl.handle.net/11615/72373 | |
dc.description.abstract | Overlapped sound event classification (SEC) can be a challenging task, especially in scenarios where the number of possible event classes or the number of simultaneous events occurring (polyphony level) are large. In such cases, the effective training of a multi-label SEC neural network can be challenging, as enough and diverse data need to be available for each of the combinatorially many possible event sets. To alleviate this problem, we examine in this paper the combination and joint training of a multi-channel sound source separation network with a multi-label SEC network. With the separation module acting as a pre-processing step, the task can be approximately reduced to isolated SEC, therefore avoiding the training complexity of overlapped scenarios. In addition, we introduce a multi-channel polyphony detection module that is trained to selectively apply the separation network only in overlapping instances during testing. We evaluate our approaches on a multi-channel dataset of overlapping sound events originating from 50 different classes. Under moderate reverberation conditions, the proposed method achieves up to 7.7% absolute improvement in terms of Fscore in the overlapped scenarios, compared to the baseline approach with traditional multi-label training. © 2021 European Signal Processing Conference. All rights reserved. | en |
dc.language.iso | en | en |
dc.source | European Signal Processing Conference | en |
dc.source.uri | https://www.scopus.com/inward/record.uri?eid=2-s2.0-85123176711&doi=10.23919%2fEUSIPCO54536.2021.9616131&partnerID=40&md5=6d9d824937189624e63223807a853ef4 | |
dc.subject | Separation | en |
dc.subject | Event class | en |
dc.subject | Multi channel | en |
dc.subject | Multi-labels | en |
dc.subject | Multichannel sounds | en |
dc.subject | Neural-networks | en |
dc.subject | Overlapping event | en |
dc.subject | Separation network | en |
dc.subject | Sound event classification | en |
dc.subject | Sound separation | en |
dc.subject | Universal sound separation | en |
dc.subject | Source separation | en |
dc.subject | European Signal Processing Conference, EUSIPCO | en |
dc.title | Overlapped Sound Event Classification via Multi-Channel Sound Separation Network | en |
dc.type | conferenceItem | en |
Fichier(s) constituant ce document
Fichiers | Taille | Format | Vue |
---|---|---|---|
Il n'y a pas de fichiers associés à ce document. |