ATHENA: A Greek multi-sensory database for home automation control
dc.creator | Tsiami, A. | en |
dc.creator | Rodomagoulakis, I. | en |
dc.creator | Giannoulis, P. | en |
dc.creator | Katsamanis, A. | en |
dc.creator | Potamianos, G. | en |
dc.creator | Maragos, P. | en |
dc.date.accessioned | 2015-11-23T10:51:20Z | |
dc.date.available | 2015-11-23T10:51:20Z | |
dc.date.issued | 2014 | |
dc.identifier.issn | 2308457X | |
dc.identifier.uri | http://hdl.handle.net/11615/33882 | |
dc.description.abstract | In this paper we present a Greek speech database with real multi-modal data in a smart home two-room environment. In total, 20 speakers were recorded in 240 one-minute long sessions. The recordings include utterances of activation keywords and commands for home automation control, but also phonetically rich sentences and conversational speech. Audio, speaker movements and gestures were captured by 20 condenser microphones installed on the walls and ceiling, 6 MEMS microphones, 2 close-talk microphones and one Kinect camera. The new publicly available database exhibits adverse noise conditions because of background noises and acoustic events performed during the recordings to better approximate a realistic everyday home scenario. Thus, it is suitable for experimentation on voice activity and event detection, source localization, speech enhancement and far-field speech recognition. We present the details of the corpus as well as baseline results on multi-channel voice activity detection and spoken command recognition. Copyright © 2014 ISCA. | en |
dc.source.uri | http://www.scopus.com/inward/record.url?eid=2-s2.0-84910049750&partnerID=40&md5=15beb6851e805cf876b03bac7b3e2de7 | |
dc.subject | Data collection | en |
dc.subject | Smart homes | en |
dc.subject | Speech database | en |
dc.subject | Acoustic noise | en |
dc.subject | Audio recordings | en |
dc.subject | Automation | en |
dc.subject | Database systems | en |
dc.subject | Intelligent buildings | en |
dc.subject | Microphones | en |
dc.subject | Modal analysis | en |
dc.subject | Speech | en |
dc.subject | Speech communication | en |
dc.subject | Speech enhancement | en |
dc.subject | Command recognition | en |
dc.subject | Condenser microphone | en |
dc.subject | Conversational speech | en |
dc.subject | Source localization | en |
dc.subject | Voice activity detection | en |
dc.subject | Speech recognition | en |
dc.title | ATHENA: A Greek multi-sensory database for home automation control | en |
dc.type | conferenceItem | en |
Fichier(s) constituant ce document
Fichiers | Taille | Format | Vue |
---|---|---|---|
Il n'y a pas de fichiers associés à ce document. |