Εμφάνιση απλής εγγραφής

dc.creatorTsiami A., Filntisis P.P., Efthymiou N., Koutras P., Potamianos G., Maragos P.en
dc.date.accessioned2023-01-31T10:13:00Z
dc.date.available2023-01-31T10:13:00Z
dc.date.issued2018
dc.identifier10.1109/ICASSP.2018.8462425
dc.identifier.isbn9781538646588
dc.identifier.issn15206149
dc.identifier.urihttp://hdl.handle.net/11615/79911
dc.description.abstractHuman-robot interaction (HRI) is a research area of growing interest with a multitude of applications for both children and adult user groups, as, for example, in edutainment and social robotics. Crucial, however, to its wider adoption remains the robust perception of HRI scenes in natural, untethered, and multi-party interaction scenarios, across user groups. Towards this goal, we investigate three focal HRI perception modules operating on data from multiple audio-visual sensors that observe the HRI scene from the far-field, thus bypassing limitations and platform-dependency of contemporary robotic sensing. In particular, the developed modules fuse intra- and/or inter-modality data streams to perform: (i) audio-visual speaker localization; (ii) distant speech recognition; and (iii) visual recognition of hand-gestures. Emphasis is also placed on ensuring high speech and gesture recognition rates for both children and adults. Development and objective evaluation of the three modules is conducted on a corpus of both user groups, collected by our far-field multisensory setup, for an interaction scenario of a question-answering 'guess-the-object' collaborative HRI game with a 'Furhat' robot. In addition, evaluation of the game incorporating the three developed modules is reported. Our results demonstrate robust far-field audio-visual perception of the multi-party HRI scene. © 2018 IEEE.en
dc.language.isoenen
dc.sourceICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedingsen
dc.source.urihttps://www.scopus.com/inward/record.uri?eid=2-s2.0-85054223151&doi=10.1109%2fICASSP.2018.8462425&partnerID=40&md5=a530a777fc2b48a9faad3889cf368139
dc.subjectInstitute of Electrical and Electronics Engineers Inc.en
dc.titleFar-field audio-visual scene perception of multi-party human-robot interaction for children and adultsen
dc.typeconferenceItemen


Αρχεία σε αυτό το τεκμήριο

ΑρχείαΜέγεθοςΤύποςΠροβολή

Δεν υπάρχουν αρχεία που να σχετίζονται με αυτό το τεκμήριο.

Αυτό το τεκμήριο εμφανίζεται στις ακόλουθες συλλογές

Εμφάνιση απλής εγγραφής