Εμφάνιση απλής εγγραφής

dc.creatorThermos S., Papadopulos G.T., Daras P., Potamianos G.en
dc.date.accessioned2023-01-31T10:08:15Z
dc.date.available2023-01-31T10:08:15Z
dc.date.issued2018
dc.identifier10.1109/ICIP.2018.8451158
dc.identifier.isbn9781479970612
dc.identifier.issn15224880
dc.identifier.urihttp://hdl.handle.net/11615/79698
dc.description.abstractSensorimotor learning, namely the process of understanding the physical world by combining visual and motor information, has been recently investigated, achieving promising results for the task of 2D/3D object recognition. Following the recent trend in computer vision, powerful deep neural networks (NNs) have been used to model the 'sensory' and 'motor' information, namely the object appearance and affordance. However, the existing implementations cannot efficiently address the spatio-temporal nature of the human-object interaction. Inspired by recent work on attention-based learning, this paper introduces an attention-enhanced NN-based model that learns to selectively focus on parts of the physical interaction where the object appearance is corrupted by occlusions and deformations. The model's attention mechanism relies on the confidence of classifying an object based solely on its appearance. Three metrics are used to measure the latter, namely the prediction entropy, the average N-best likelihood difference, and the N-best likelihood dispersion. Evaluation of the attention-enhanced model on the SOR3D dataset reports 33% and 26% relative improvement over the appearance-only and the spatio-temporal fusion baseline models, respectively. © 2018 IEEE.en
dc.language.isoenen
dc.sourceProceedings - International Conference on Image Processing, ICIPen
dc.source.urihttps://www.scopus.com/inward/record.uri?eid=2-s2.0-85062919222&doi=10.1109%2fICIP.2018.8451158&partnerID=40&md5=cbde6ad1ed54310419f5c2e98614802b
dc.subjectDeep neural networksen
dc.subjectAttention mechanismsen
dc.subjectAttention-based learningen
dc.subjectBaseline modelsen
dc.subjectHuman-object interactionen
dc.subjectNeural networks (NNS)en
dc.subjectObject appearanceen
dc.subjectPhysical interactionsen
dc.subjectSpatio-temporal fusionsen
dc.subjectObject recognitionen
dc.subjectIEEE Computer Societyen
dc.titleAttention-Enhanced Sensorimotor Object Recognitionen
dc.typeconferenceItemen


Αρχεία σε αυτό το τεκμήριο

ΑρχείαΜέγεθοςΤύποςΠροβολή

Δεν υπάρχουν αρχεία που να σχετίζονται με αυτό το τεκμήριο.

Αυτό το τεκμήριο εμφανίζεται στις ακόλουθες συλλογές

Εμφάνιση απλής εγγραφής