Zur Kurzanzeige

dc.creatorGiannoulis P., Potamianos G., Maragos P.en
dc.date.accessioned2023-01-31T07:42:13Z
dc.date.available2023-01-31T07:42:13Z
dc.date.issued2021
dc.identifier10.23919/EUSIPCO54536.2021.9616131
dc.identifier.isbn9789082797060
dc.identifier.issn22195491
dc.identifier.urihttp://hdl.handle.net/11615/72373
dc.description.abstractOverlapped sound event classification (SEC) can be a challenging task, especially in scenarios where the number of possible event classes or the number of simultaneous events occurring (polyphony level) are large. In such cases, the effective training of a multi-label SEC neural network can be challenging, as enough and diverse data need to be available for each of the combinatorially many possible event sets. To alleviate this problem, we examine in this paper the combination and joint training of a multi-channel sound source separation network with a multi-label SEC network. With the separation module acting as a pre-processing step, the task can be approximately reduced to isolated SEC, therefore avoiding the training complexity of overlapped scenarios. In addition, we introduce a multi-channel polyphony detection module that is trained to selectively apply the separation network only in overlapping instances during testing. We evaluate our approaches on a multi-channel dataset of overlapping sound events originating from 50 different classes. Under moderate reverberation conditions, the proposed method achieves up to 7.7% absolute improvement in terms of Fscore in the overlapped scenarios, compared to the baseline approach with traditional multi-label training. © 2021 European Signal Processing Conference. All rights reserved.en
dc.language.isoenen
dc.sourceEuropean Signal Processing Conferenceen
dc.source.urihttps://www.scopus.com/inward/record.uri?eid=2-s2.0-85123176711&doi=10.23919%2fEUSIPCO54536.2021.9616131&partnerID=40&md5=6d9d824937189624e63223807a853ef4
dc.subjectSeparationen
dc.subjectEvent classen
dc.subjectMulti channelen
dc.subjectMulti-labelsen
dc.subjectMultichannel soundsen
dc.subjectNeural-networksen
dc.subjectOverlapping eventen
dc.subjectSeparation networken
dc.subjectSound event classificationen
dc.subjectSound separationen
dc.subjectUniversal sound separationen
dc.subjectSource separationen
dc.subjectEuropean Signal Processing Conference, EUSIPCOen
dc.titleOverlapped Sound Event Classification via Multi-Channel Sound Separation Networken
dc.typeconferenceItemen


Dateien zu dieser Ressource

DateienGrößeFormatAnzeige

Zu diesem Dokument gibt es keine Dateien.

Das Dokument erscheint in:

Zur Kurzanzeige