Πλοήγηση ανά Θέμα "Encoder-decoder"

Αποτελέσματα 1-3 από 3

Exploiting 3D Hand Pose Estimation in Deep Learning-Based Sign Language Recognition from RGB Videos

Parelli M., Papadimitriou K., Potamianos G., Pavlakos G., Maragos P. (2020)

In this paper, we investigate the benefit of 3D hand skeletal information to the task of sign language (SL) recognition from RGB videos, within a state-of-the-art, multiple-stream, deep-learning recognition system. As most ...
Joint Object Affordance Reasoning and Segmentation in RGB-D Videos

Thermos S., Potamianos G., Daras P. (2021)

Understanding human-object interaction is a fundamental challenge in computer vision and robotics. Crucial to it is the ability to infer 'object affordances' from visual data, namely the types of interaction supported by ...
Multimodal sign language recognition via temporal deformable convolutional sequence learning

Papadimitriou K., Potamianos G. (2020)

In this paper we address the challenging problem of sign language recognition (SLR) from videos, introducing an end-to-end deep learning approach that relies on the fusion of a number of spatio-temporal feature streams, ...