TITLE: Energy-based multi-speaker voice activity detection with an ad hoc microphone array
AUTHORS: Alexander Bertrand and Marc Moonen
ABSTRACT:
In this paper, we propose an energy-based technique to track the power of multiple simultaneous speakers using an ad hoc microphone array with unknown microphone positions. By considering the short-term power of the microphone signals, the problem can be converted into a non-negative blind source separation (NBSS) problem. By exploiting the prior knowledge that the source signals are non-negative and well-grounded, very efficient algorithms can be used to solve this NBSS problem, based only on second order statistics. We provide simulation results that demonstrate the effectiveness of the presented algorithm.
STATUS: Published in Proc. of the IEEE International Conference on Acoustics, Speech and Signal processing (ICASSP), Dallas, Texas USA, March 2010, pp. 85-88.
REPORT NUMBER: 09-186
Errata
!! The paper as published by IEEE contains an error in formula (6), i.e. the nominator and denominator must be switched. This error is corrected in the version that is downloadable on this page.