A distant-talking scenario is addressed, where a distributed microphone network provides multi-channel input sequences to process for speaker modeling purposes. Possible related application, given a noisy and reverberant environment with one or more speakers. The paper investigates on the use of a multi-channel version of a Weighted Autocorrelation(WAUTOC)-based F0 estimation method, with the purpose of deriving a common excitation model. Experiments conducted on a real database show the advantages and the robustness of the proposed method in extracting the fundamental frequency with no regard about the microphone and talker position as well as the head orientation
Weighted Autocorrelation-Based F0 Estimation for Distant-Talking Interaction with a Distributed Microphone Network
Omologo, Maurizio
2004-01-01
Abstract
A distant-talking scenario is addressed, where a distributed microphone network provides multi-channel input sequences to process for speaker modeling purposes. Possible related application, given a noisy and reverberant environment with one or more speakers. The paper investigates on the use of a multi-channel version of a Weighted Autocorrelation(WAUTOC)-based F0 estimation method, with the purpose of deriving a common excitation model. Experiments conducted on a real database show the advantages and the robustness of the proposed method in extracting the fundamental frequency with no regard about the microphone and talker position as well as the head orientationI documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.