Εμφάνιση απλής εγγραφής

dc.creatorGiannoulis P., Potamianos G., Maragos P.en
dc.date.accessioned2023-01-31T07:42:13Z
dc.date.available2023-01-31T07:42:13Z
dc.date.issued2019
dc.identifier10.1186/s13636-019-0158-8
dc.identifier.issn16874714
dc.identifier.urihttp://hdl.handle.net/11615/72375
dc.description.abstractVoice-enabled interaction systems in domestic environments have attracted significant interest recently, being the focus of smart home research projects and commercial voice assistant home devices. Within the multi-module pipelines of such systems, speech activity detection (SAD) constitutes a crucial component, providing input to their activation and speech recognition subsystems. In typical multi-room domestic environments, SAD may also convey spatial intelligence to the interaction, in addition to its traditional temporal segmentation output, by assigning speech activity at the room level. Such room-localized SAD can, for example, disambiguate user command referents, allow localized system feedback, and enable parallel voice interaction sessions by multiple subjects in different rooms. In this paper, we investigate a room-localized SAD system for smart homes equipped with multiple microphones distributed in multiple rooms, significantly extending our earlier work. The system employs a two-stage algorithm, incorporating a set of hand-crafted features specially designed to discriminate room-inside vs. room-outside speech at its second stage, refining SAD hypotheses obtained at its first stage by traditional statistical modeling and acoustic front-end processing. Both algorithmic stages exploit multi-microphone information, combining it at the signal, feature, or decision level. The proposed approach is extensively evaluated on both simulated and real data recorded in a multi-room, multi-microphone smart home, significantly outperforming alternative baselines. Further, it remains robust to reduced microphone setups, while also comparing favorably to deep learning-based alternatives. © 2019, The Author(s).en
dc.language.isoenen
dc.sourceEurasip Journal on Audio, Speech, and Music Processingen
dc.source.urihttps://www.scopus.com/inward/record.uri?eid=2-s2.0-85071630220&doi=10.1186%2fs13636-019-0158-8&partnerID=40&md5=57b484838f86afd53c59ec2bf9e83a70
dc.subjectAutomationen
dc.subjectDeep learningen
dc.subjectIntelligent buildingsen
dc.subjectMicrophonesen
dc.subjectRefiningen
dc.subjectSpeechen
dc.subjectActive room selectionen
dc.subjectMicrophone arraysen
dc.subjectMulti channelen
dc.subjectSmart homesen
dc.subjectSpeech activity detectionsen
dc.subjectSpeech recognitionen
dc.subjectSpringer International Publishingen
dc.titleRoom-localized speech activity detection in multi-microphone smart homesen
dc.typejournalArticleen


Αρχεία σε αυτό το τεκμήριο

ΑρχείαΜέγεθοςΤύποςΠροβολή

Δεν υπάρχουν αρχεία που να σχετίζονται με αυτό το τεκμήριο.

Αυτό το τεκμήριο εμφανίζεται στις ακόλουθες συλλογές

Εμφάνιση απλής εγγραφής