Listar por tema "Discrete cosine transforms"

Mostrando ítems 1-3 de 3

Audio-visual speech recognition incorporating facial depth information captured by the Kinect

Galatas, G.; Potamianos, G.; Makedon, F. (2012)

We investigate the use of facial depth data of a speaking subject, captured by the Kinect device, as an additional speechinformative modality to incorporate to a traditional audiovisual automatic speech recognizer. We ...
Robust multi-modal speech recognition in two languages utilizing video and distance information from the kinect

Galatas, G.; Potamianos, G.; Makedon, F. (2013)

We investigate the performance of our audio-visual speech recognition system in both English and Greek under the influence of audio noise. We present the architecture of our recently built system that utilizes information ...
Scattering vs. Discrete Cosine Transform Features in Visual Speech Processing

Marcheret E., Potamianos G., Vopicka J., Goel V. (2015)

Appearance-based feature extraction constitutes the dominant approach for visual speech representation in a variety of problems, such as automatic speechreading, visual speech detection, and others. To obtain the necessary ...