This work reports on preliminary activity at ITC-irst on the problem of acoustic segmentation, classification and clustering of an Italian audio broadcast news corpus. The approach is based on the following stages. First, the input data stream is segmented by detecting spectral changes through the Bayesian Information Criterion (BIC). Second, segments are classified in terms of acoustic conditions, modeled by mixtures of Gaussians. Finally, segments from the same speakers are clustered, by using again the BIC. The scheme proposed for the automatic segmentation, classification and clustering causes a degradation of the recognition error rate, with respect to the fully supervisioned experiment, equal to 1.3% before adaptation, and 3.4% after adaptation.
Segmentation, Classification and Clustering of an Italian Broadcast News Corpus
Cettolo, Mauro
2000-01-01
Abstract
This work reports on preliminary activity at ITC-irst on the problem of acoustic segmentation, classification and clustering of an Italian audio broadcast news corpus. The approach is based on the following stages. First, the input data stream is segmented by detecting spectral changes through the Bayesian Information Criterion (BIC). Second, segments are classified in terms of acoustic conditions, modeled by mixtures of Gaussians. Finally, segments from the same speakers are clustered, by using again the BIC. The scheme proposed for the automatic segmentation, classification and clustering causes a degradation of the recognition error rate, with respect to the fully supervisioned experiment, equal to 1.3% before adaptation, and 3.4% after adaptation.I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.