Time-Domain Beamforming and Blind Source Separation: Speech Input in the Car Environment: 3 2009th Edition
Author(s): Julien Bourgeois (Author), Wolfgang Minker (Author)
Publisher: Springer
Publication Date: 14 April 2009
Edition: 2009th
Language: English
Print length: 237 pages
ISBN-10: 9780387688350
ISBN-13: 9780387688350
Book Description
The development of computer and telecommunication technologies led to a revolutioninthewaythatpeopleworkandcommunicatewitheachother.One of the results is that large amount of information will increasingly be held in a form that is natural for users, as speech in natural language. In the presented work, we investigate the speech signal capture problem, which includes the separation of multiple interfering speakers using microphone arrays. Adaptive beamforming is a classical approach which has been developed since the seventies. However it requires a double-talk detector (DTD) that interrupts the adaptation when the target is active, since otherwise target cancelation occurs. The fact that several speakers may be active simulta- ouslymakesthisdetectiondi?cult,andifadditionalbackgroundnoiseoccurs, even less reliable. Our proposed approaches address this separation problem using continuous, uninterrupted adaptive algorithms. The advantage seems twofold:Firstly,thealgorithmdevelopmentismuchsimplersincenodetection mechanism needs to be designed and no threshold is to be tuned. Secondly, the performance may be improved due to the adaptation during periods of double-talk. In the ?rst part of the book, we investigate a modi?cation of the widely usedNLMSalgorithm,termedImplicitLMS(ILMS),whichimplicitlyincludes an adaptation control and does not require any threshold. Experimental ev- uations reveal that ILMS mitigates the target signal cancelation substantially with the distributed microphone array. However, in the more di?cult case of the compact microphone array, this algorithm does not su?ciently reduce the target signal cancelation. In this case, more sophisticated blind source se- ration techniques (BSS) seem necessary.
Editorial Reviews
From the Back Cover
Time-Domain Beamforming and Blind Source Separation addresses the problem of separating spontaneous multi-party speech by way of microphone arrays (beamformers) and adaptive signal processing techniques. While existing techniques require a Double-Talk Detector (DTD) that interrupts the adaptation when the target is active, the described method addresses the separation problem using continuous, uninterrupted adaptive algorithms. With this approach, algorithm development is much simpler since no detection mechanism needs to be designed and needs no threshold to be tuned. Also, performance can be improved due to the adaptation during periods of double-talk.
The authors use two techniques to achieve these results: implicit beamforming, which requires the position of the target speaker to be known; and time-domain blind-source separation (BSS), which exploits second-order statistics of the source signals. In combination, beamforming and BSS can be used to develop novel algorithms. Emphasis is placed on the development of an algorithm that combines the benefits of both approaches. The book presents experimental results obtained with real in-car microphone recordings involving simultaneous speech of the driver and the co-driver. In addition, experiments with background noise have been carried out in order to assess the robustness of the considered methods in noisy conditions.