News Monitor (NeMo) is an environment in which the Human Language Technology research unit at FBK brings together its technologies pertaining to Automatic Speech Recognition (ASR), Machine Translation (MT) and Natural Language Processing (NLP). In this view it is a dynamic framework where we can share ideas and technologies, refine algorithms, see and discuss performance and errors of our algorithms that are daily applied on fresh data. In this paper we describe a framework in which a set of parallel news streams in different languages are automatically transcribed and translated. The architecture of the system utilizes modules that perform ASR, MT and NLP. The development of the various modules relies upon a continuous acquisition activity of parallel data (both audio and texts) in different languages. In particular, the availability of large corpora of aligned multi-lingual text/audio data has allowed to implement unsupervised Acoustic Model (AM) training approaches.

NeMo: a Platform for Multilingual News Monitoring

Girardi, Christian;Gretter, Roberto;Falavigna, Giuseppe Daniele;Brugnara, Fabio;Giuliani, Diego;Federico, Marcello
2011-01-01

Abstract

News Monitor (NeMo) is an environment in which the Human Language Technology research unit at FBK brings together its technologies pertaining to Automatic Speech Recognition (ASR), Machine Translation (MT) and Natural Language Processing (NLP). In this view it is a dynamic framework where we can share ideas and technologies, refine algorithms, see and discuss performance and errors of our algorithms that are daily applied on fresh data. In this paper we describe a framework in which a set of parallel news streams in different languages are automatically transcribed and translated. The architecture of the system utilizes modules that perform ASR, MT and NLP. The development of the various modules relies upon a continuous acquisition activity of parallel data (both audio and texts) in different languages. In particular, the availability of large corpora of aligned multi-lingual text/audio data has allowed to implement unsupervised Acoustic Model (AM) training approaches.
File in questo prodotto:
Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11582/54012
 Attenzione

Attenzione! I dati visualizzati non sono stati sottoposti a validazione da parte dell'ateneo

Citazioni
  • ???jsp.display-item.citation.pmc??? ND
social impact