A non-linguistic approach for human emotion recognition from speech

Spyrou E., Vernikos I., Nikopoulou R., Mylonas P.

dc.creator	Spyrou E., Vernikos I., Nikopoulou R., Mylonas P.	en
dc.date.accessioned	2023-01-31T10:01:35Z
dc.date.available	2023-01-31T10:01:35Z
dc.date.issued	2019
dc.identifier	10.1109/IISA.2018.8633644
dc.identifier.isbn	9781538637319
dc.identifier.uri	http://hdl.handle.net/11615/79346
dc.description.abstract	One of the most important issues in several aspects of human-computer interaction is the understanding of the users' emotional state. In several applications such as monitoring of humans in assistive living environments, or assessing students' affective state during a course, it is imperative to use an unobtrusive method, so as to avoid discomforting or distracting the user. Thus, one should opt for approaches that use either visual or audio sensors which may observe users without any kind of direct contact. In this work, our goal is to recognize the emotional state of humans using only the non-linguistic aspect of speech information, i.e., the acoustic properties of speech. Therefore, we propose an emotion classification that is based on the bag-of-visual words model that has been previously applied in many computer vision tasks. A given audio segment is transformed to a spectrogram, i.e., a visual representation of its spectrum. From this representation we first extract SURF features and using a previously constructed visual vocabulary, we quantize them into a set of visual words. Then a histogram is constructed per image; These feature vectors are used to train SVM classifiers. Our approach is evaluated using a) 3 publicly available datasets that contain speech from different languages and b) a custom dataset that has been constructed during a real-life classroom experiments, involving middle-school students. ©2018 IEEE	en
dc.language.iso	en	en
dc.source	2018 9th International Conference on Information, Intelligence, Systems and Applications, IISA 2018	en
dc.source.uri	https://www.scopus.com/inward/record.uri?eid=2-s2.0-85062861017&doi=10.1109%2fIISA.2018.8633644&partnerID=40&md5=013f27a62a341e06db33c59b306230e7
dc.subject	Acoustic properties	en
dc.subject	Classification (of information)	en
dc.subject	Human computer interaction	en
dc.subject	Linguistics	en
dc.subject	Students	en
dc.subject	Bag-of-visual-words	en
dc.subject	Emotion classification	en
dc.subject	Human emotion recognition	en
dc.subject	Linguistic approach	en
dc.subject	Middle school students	en
dc.subject	Speech information	en
dc.subject	Visual representations	en
dc.subject	Visual vocabularies	en
dc.subject	Speech recognition	en
dc.subject	Institute of Electrical and Electronics Engineers Inc.	en
dc.title	A non-linguistic approach for human emotion recognition from speech	en
dc.type	conferenceItem	en

Files in this item

Files	Size	Format	View
There are no files associated with this item.

This item appears in the following Collection(s)

Δημοσιεύσεις σε περιοδικά, συνέδρια, κεφάλαια βιβλίων κλπ. [19705]

Show simple item record