Show simple item record

dc.creatorThermos S., Potamianos G.en
dc.date.accessioned2023-01-31T10:08:16Z
dc.date.available2023-01-31T10:08:16Z
dc.date.issued2017
dc.identifier10.1109/SLT.2016.7846321
dc.identifier.isbn9781509049035
dc.identifier.urihttp://hdl.handle.net/11615/79699
dc.description.abstractMotivated by increasing popularity of depth visual sensors, such as the Kinect device, we investigate the utility of depth information in audio-visual speech activity detection. A two-subject scenario is assumed, allowing to also consider speech overlap. Two sensory setups are employed, where depth video captures either a frontal or profile view of the subjects, and is subsequently combined with the corresponding planar video and audio streams. Further, multi-view fusion is regarded, using audio and planar video from a sensor at the complementary view setup. Support vector machines provide temporal speech activity classification for each visually detected subject, fusing the available modality streams. Classification results are further combined to yield speaker diarization. Experiments are reported on a suitable audio-visual corpus recorded by two Kinects. Results demonstrate the benefits of depth information, particularly in the frontal depth view setup, reducing speech activity detection and speaker diarization errors over systems that ignore it. © 2016 IEEE.en
dc.language.isoenen
dc.source2016 IEEE Workshop on Spoken Language Technology, SLT 2016 - Proceedingsen
dc.source.urihttps://www.scopus.com/inward/record.uri?eid=2-s2.0-85016000416&doi=10.1109%2fSLT.2016.7846321&partnerID=40&md5=1d74fca461c773aea90f19566e4a0ef6
dc.subjectSpeechen
dc.subjectSpeech analysisen
dc.subjectAudio-visual fusionen
dc.subjectKinecten
dc.subjectSpeaker diarizationen
dc.subjectSpeech activity detectionsen
dc.subjectVisual depthen
dc.subjectSpeech recognitionen
dc.subjectInstitute of Electrical and Electronics Engineers Inc.en
dc.titleAudio-visual speech activity detection in a two-speaker scenario incorporating depth information from a profile or frontal viewen
dc.typeconferenceItemen


Files in this item

FilesSizeFormatView

There are no files associated with this item.

This item appears in the following Collection(s)

Show simple item record