Future Speech Interfaces with Sensors and Machine Intelligence

Download Url(s)
https://mdpi.com/books/pdfview/book/6990Contributor(s)
Denby, Bruce (editor)
Gábor Csapó, Tamás (editor)
Wand, Michael (editor)
Language
EnglishAbstract
Speech is the most spontaneous and natural means of communication, as well as the preferred modality for interacting with mobile or fixed electronic devices, but speech in-terfaces have drawbacks, such as a lack of user privacy; non-inclusivity for certain users; poor robustness in noisy conditions; and the difficulty of creating complex man–machine interfaces. The Special Issue “Future Speech Interfaces with Sensors and Machine Intelligence” assembles eleven contributions that cover multimodal and silent speech interfaces; lip reading applications; novel sensors for speech interfaces; and enhanced speech inclusivity tools for future speech interfaces. The articles make important improvements beyond the state of the art, advancing the state of the art to new frontiers in some cases. Short summaries of all articles, grouped by topic, are presented, followed by a global commentary and evaluation.
Keywords
neural machine translation (NMT); transformer; Arabic dialects; modern standard Arabic; subword units; multi-head attention; shared vocabulary; self-attention; 3D densely connected CNN; 3D multi-layer feature fusion CNN; convolutional neural network; deep learning; lipreading; speech recognition; visual speech recognition; silent speech; continuous-wave radar; European Portuguese; machine learning; multimodal speech; lip reading; ultrasound tongue imaging; pose estimation; speech kinematics; keypoints; landmarks; audio-visual speech recognition; lip-reading; application programming interface; multi-modal interaction; deep neural networks; multi-view VSR; attention mechanism; spatial attention module; local self-attention; connectionist temporal classification; text-to-lip; speech synthesis; text-to-speech; speech-to-lip; zero-shot adaptation; generative models; artificial intelligence; objective measures; hybrid models; end-to-end recognition; reliability measures; decision fusion net; articulation-to-speech synthesis; silent speech interface; speaker adaption; voice conversion; audiovisual speech recognition; multimodal interaction; edutainment; virtual aquarium; speech processing; ultrasound imaging; silent speech interfaces; speech sensorsWebshop link
https://mdpi.com/books/pdfview ...ISBN
9783036569383, 9783036569390Publisher website
www.mdpi.com/booksPublication date and place
Basel, 2023Classification
Technology: general issues
History of engineering and technology