Blind Speech Separation 2007th Edition

Blind Speech Separation 2007th Edition book cover

Blind Speech Separation 2007th Edition

Author(s): Shoji Makino (Editor), Te-Won Lee (Editor), Hiroshi Sawada (Editor)

  • Publisher: Springer
  • Publication Date: September 20, 2007
  • Edition: 2007th
  • Language: English
  • Print length: 448 pages
  • ISBN-10: 1402064780
  • ISBN-13: 9781402064784

Book Description

We are surrounded by sounds. Such a noisy environment makes it di?cult to obtain desired speech and it is di?cult to converse comfortably there. This makes it important to be able to separate and extract a target speech signal from noisy observations for both man–machine and human–human communication. Blindsourceseparation(BSS)isanapproachforestimatingsourcesignals using only information about their mixtures observed in each input channel. The estimation is performed without possessing information on each source, such as its frequency characteristics and location, or on how the sources are mixed. The use of BSS in the development of comfortable acoustic com- nication channels between humans and machines is widely accepted. Some books have been published on BSS, independent component ana- sis (ICA), and related subjects. There, ICA-based BSS has been well studied in the statistics and information theory ?elds, for applications to a variety of disciplines including wireless communication and biomedicine. However, as speech and audio signal mixtures in a real reverberant environment are generally convolutive mixtures, they involve a structurally much more ch- lenging task than instantaneous mixtures, which are prevalent in many other applications.

Editorial Reviews

From the Back Cover

This is the first book to provide a cutting edge reference to the fascinating topic of blind source separation (BSS) for convolved speech mixtures. Through contributions by the foremost experts on the subject, the book provides an up-to-date account of research findings, explains the underlying theory, and discusses potential applications. The individual chapters are designed to be tutorial in nature with specific emphasis on an in-depth treatment of state of the art techniques.

Blind Speech Separation 2007th Edition is divided into three parts:

Part 1 presents overdetermined or critically determined BSS. Here the main technology is independent component analysis (ICA). ICA is a statistical method for extracting mutually independent sources from their mixtures. This approach utilizes spatial diversity to discriminate between desired and undesired components, i.e., it reduces the undesired components by forming a spatial null towards them. It is, in fact, a blind adaptive beamformer realized by unsupervised adaptive filtering.

Part 2 addresses underdetermined BSS, where there are fewer microphones than source signals. Here, the sparseness of speech sources is very useful; we can utilize time-frequency diversity, where sources are active in different regions of the time-frequency plane.

Part 3 presents monaural BSS where there is only one microphone. Here, we can separate a mixture by using the harmonicity and temporal structure of the sources. We can build a probabilistic framework by assuming a source model, and separate a mixture by maximizing the a posteriori probability of the sources.

About the Author

Dr. Shoji Makino is an IEEE Fellow, Associate Editor of the IEEE Transactions on Speech & Audio Processing, and Executive Manager NTT Communication Science Laboratories. Dr. Makino was also co-editor on the succesful 2005 Springer book: Benesty – Speech Enhancement.

View on Amazon

电子书代发PDF格式价格30我要求助
未经允许不得转载:Wow! eBook » Blind Speech Separation 2007th Edition