Mostrar el registro sencillo del ítem

dc.creatorGalatas, G.en
dc.creatorPotamianos, G.en
dc.creatorMakedon, F.en
dc.date.accessioned2015-11-23T10:26:54Z
dc.date.available2015-11-23T10:26:54Z
dc.date.issued2012
dc.identifier10.1145/2413097.2413100
dc.identifier.isbn9781450313001
dc.identifier.urihttp://hdl.handle.net/11615/27629
dc.description.abstractIn this paper we build on our recent work, where we successfully incorporated facial depth data of a speaker captured by the Microsoft Kinect device, as a third data stream in an audio-visual automatic speech recognizer. In particular, we focus our interest on whether the depth stream provides sufficient speech information that can improve system robustness to noisy audio-visual conditions, thus studying system operation beyond the traditional scenarios, where noise is applied to the audio signal alone. For this purpose, we consider four realistic visual modality degradations at various noise levels, and we conduct small-vocabulary recognition experiments on an appropriate, previously collected, audiovisual database. Our results demonstrate improved system performance due to the depth modality, as well as considerable accuracy increase, when using both the visual and depth modalities over audio only speech recognition.en
dc.source.urihttp://www.scopus.com/inward/record.url?eid=2-s2.0-84871979378&partnerID=40&md5=1c3fed620a063e661a88537e93fab25d
dc.subjectAudio-visual speech recognitionen
dc.subjectDepth informationen
dc.subjectMicrosoft Kinecten
dc.subjectVideo noiseen
dc.subjectAudio signalen
dc.subjectAudio visual speech recognitionen
dc.subjectAudio-visualen
dc.subjectAudio-visual databaseen
dc.subjectAutomatic speech recognizersen
dc.subjectData streamen
dc.subjectMicroSoften
dc.subjectNoise levelsen
dc.subjectSpeech informationen
dc.subjectSystem operationen
dc.subjectSystem robustnessen
dc.subjectVisual modalitiesen
dc.subjectAcoustic noiseen
dc.subjectAudio acousticsen
dc.subjectSpeech recognitionen
dc.titleAudio-visual speech recognition using depth information from the Kinect in noisy video conditionsen
dc.typeconferenceItemen


Ficheros en el ítem

FicherosTamañoFormatoVer

No hay ficheros asociados a este ítem.

Este ítem aparece en la(s) siguiente(s) colección(ones)

Mostrar el registro sencillo del ítem