This paper introduces a new technique of multi-microphone processing which aims to provide featuresfor the extraction of fundamental frequency and for the classification of voiced/unvoiced segments in distant-talking speech.A multichannel periodicity function (MPF) is derived from an adaptive weighting of normalized and compressed magnitude spectra.This function highlights periodic clues of the given speech signals, even under noisy and reverberant conditions.The resulting MPF features are then exploited for voiced/unvoiced classification based on Hidden Markov Models.Experiments, conducted both on simulated data and on real seminar recordings based on a network of reversed T-shaped arrays,showed the robustness of the proposed technique.
Adaptive weighting of microphone arrays for distant-talking F0 and voiced/unvoiced estimation
Flego, Federico;Zieger, Christian;Omologo, Maurizio
2007-01-01
Abstract
This paper introduces a new technique of multi-microphone processing which aims to provide featuresfor the extraction of fundamental frequency and for the classification of voiced/unvoiced segments in distant-talking speech.A multichannel periodicity function (MPF) is derived from an adaptive weighting of normalized and compressed magnitude spectra.This function highlights periodic clues of the given speech signals, even under noisy and reverberant conditions.The resulting MPF features are then exploited for voiced/unvoiced classification based on Hidden Markov Models.Experiments, conducted both on simulated data and on real seminar recordings based on a network of reversed T-shaped arrays,showed the robustness of the proposed technique.I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.