This book presents a summary of the cognitively inspired basis behind multimodal speech enhancement, covering the relationship between audio and visual modalities in speech, as well as recent research into audiovisual speech correlation. A number of audiovisual speech filtering approaches that make use of this relationship are also discussed. A novel multimodal speech enhancement system, making use of both visual and audio information to filter speech, is presented, and this book explores the extension of this system with the use of fuzzy logic to demonstrate an initial implementation of an autonomous, adaptive, and context aware multimodal system. This work also discusses the challenges presented with regard to testing such a system, the limitations with many current audiovisual speech corpora, and discusses a suitable approach towards development of a corpus designed to test this novel, cognitively inspired, speech filtering system.                                                                                
Read more
This book presents a summary of the cognitively inspired basis behind multimodal speech enhancement, covering the relationship between audio and visual modalities in speech, as well as recent research into audiovisual speech correlation.
Read more
Introduction.-  Audio and visual speech relationship.-  The research context.- 4 A two stage multimodal speech enhancement system.-  Experiments, results, and analysis.-  Towards fuzzy logic based multimodal speech filtering.-  Evaluation of fuzzy logic proof of concept.-  Conclusions and future work.                                                                       
Read more
                                                                                                                             
                                                                                                                                    
State-of-the-art summary of multimodal speech filtering literature Novel interdisciplinary cognitive inspiration in audio and visual aspects of speech A novel approach to combining audio and visual speech filtering for real-world applications
Read more
GPSR Compliance The European Union's (EU) General Product Safety Regulation (GPSR) is a set of rules that requires consumer products to be safe and our obligations to ensure this. If you have any concerns about our products you can contact us on ProductSafety@springernature.com. In case Publisher is established outside the EU, the EU authorized representative is: Springer Nature Customer Service Center GmbH Europaplatz 3 69115 Heidelberg, Germany ProductSafety@springernature.com
Read more

Product details

ISBN
9783319135083
Published
2015-08-19
Publisher
Springer International Publishing AG
Height
235 mm
Width
155 mm
Age
Research, P, 06
Language
Product language
Engelsk
Format
Product format
Heftet

Biographical note