dc.creator | Papadimitriou K., Potamianos G. | en |
dc.date.accessioned | 2023-01-31T09:42:22Z | |
dc.date.available | 2023-01-31T09:42:22Z | |
dc.date.issued | 2019 | |
dc.identifier | 10.23919/EUSIPCO.2019.8902541 | |
dc.identifier.isbn | 9789082797039 | |
dc.identifier.issn | 22195491 | |
dc.identifier.uri | http://hdl.handle.net/11615/77587 | |
dc.description.abstract | Fingerspelling is a crucial part of sign-based communication, however its recognition remains a challenging and mostly overlooked computer vision problem. To address it, this paper presents a system that recognizes the 24 static fingerspelled alphabet signs of the American Sign Language. The system consists of two algorithmic stages, comprising an efficient preprocessing phase that generates candidate hand-region proposals, followed by their deep-learning based classification. Specifically, the first stage exploits own earlier work on hand detection and segmentation in videos that also contain the signer's face, allowing face detection to drive skin-tone based hand segmentation, with motion further utilized to localize hands, extending it with a peak detection module that yields proposal regions likely to contain the signs of interest. These regions are then classified by a variant of a convolutional neural network that extends traditional convolutions to quadratic operations on the inputs, being, to our knowledge, the first application of such architecture to this task. Both system stages are evaluated on three well-known fingerspelling corpora, significantly outperforming a number of alternative approaches under both multi-signer and signer-independent experimental frameworks. © 2019 IEEE | en |
dc.language.iso | en | en |
dc.source | European Signal Processing Conference | en |
dc.source.uri | https://www.scopus.com/inward/record.uri?eid=2-s2.0-85075600100&doi=10.23919%2fEUSIPCO.2019.8902541&partnerID=40&md5=57b0bafe0e58c16de67eeb82bfea285c | |
dc.subject | Classification (of information) | en |
dc.subject | Convolution | en |
dc.subject | Deep learning | en |
dc.subject | Error detection | en |
dc.subject | Neural networks | en |
dc.subject | American sign language | en |
dc.subject | Computer vision problems | en |
dc.subject | Convolutional neural network | en |
dc.subject | Fingerspelling | en |
dc.subject | Hand detection | en |
dc.subject | Peak detection | en |
dc.subject | Preprocessing phase | en |
dc.subject | Sign recognition | en |
dc.subject | Face recognition | en |
dc.subject | European Signal Processing Conference, EUSIPCO | en |
dc.title | Fingerspelled alphabet sign recognition in upper-body videos | en |
dc.type | conferenceItem | en |