Listar por tema "Discrete cosine transforms"
Mostrando ítems 1-3 de 3
-
Audio-visual speech recognition incorporating facial depth information captured by the Kinect
(2012)We investigate the use of facial depth data of a speaking subject, captured by the Kinect device, as an additional speechinformative modality to incorporate to a traditional audiovisual automatic speech recognizer. We ... -
Robust multi-modal speech recognition in two languages utilizing video and distance information from the kinect
(2013)We investigate the performance of our audio-visual speech recognition system in both English and Greek under the influence of audio noise. We present the architecture of our recently built system that utilizes information ... -
Scattering vs. Discrete Cosine Transform Features in Visual Speech Processing
(2015)Appearance-based feature extraction constitutes the dominant approach for visual speech representation in a variety of problems, such as automatic speechreading, visual speech detection, and others. To obtain the necessary ...