Show simple item record

dc.creatorPapadimitriou K., Potamianos G.en
dc.date.accessioned2023-01-31T09:42:21Z
dc.date.available2023-01-31T09:42:21Z
dc.date.issued2020
dc.identifier10.21437/Interspeech.2020-2691
dc.identifier.issn2308457X
dc.identifier.urihttp://hdl.handle.net/11615/77586
dc.description.abstractIn this paper we address the challenging problem of sign language recognition (SLR) from videos, introducing an end-to-end deep learning approach that relies on the fusion of a number of spatio-temporal feature streams, as well as a fully convolutional encoder-decoder for prediction. Specifically, we examine the contribution of optical flow, human skeletal features, as well as appearance features of handshapes and mouthing, in conjunction with a temporal deformable convolutional attention-based encoder-decoder for SLR. To our knowledge, this is the first use in this task of a fully convolutional multi-step attention-based encoder-decoder employing temporal deformable convolutional block structures. We conduct experiments on three sign language datasets and compare our approach to existing state-of-the-art SLR methods, demonstrating its superiority. © 2020 ISCAen
dc.language.isoenen
dc.sourceProceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECHen
dc.source.urihttps://www.scopus.com/inward/record.uri?eid=2-s2.0-85098115814&doi=10.21437%2fInterspeech.2020-2691&partnerID=40&md5=e02589f0f9e04137218118f5947adef4
dc.subjectComputer hardware description languagesen
dc.subjectDecodingen
dc.subjectDeep learningen
dc.subjectDeformationen
dc.subjectOptical flowsen
dc.subjectSignal encodingen
dc.subjectSpeech communicationen
dc.subjectBlock structuresen
dc.subjectConvolutional encodersen
dc.subjectEncoder-decoderen
dc.subjectLearning approachen
dc.subjectSequence learningen
dc.subjectSign Language recognitionen
dc.subjectSpatio temporal featuresen
dc.subjectState of the arten
dc.subjectConvolutionen
dc.subjectInternational Speech Communication Associationen
dc.titleMultimodal sign language recognition via temporal deformable convolutional sequence learningen
dc.typeconferenceItemen


Files in this item

FilesSizeFormatView

There are no files associated with this item.

This item appears in the following Collection(s)

Show simple item record