In this paper we present the multilingual VoIP (Voice over Internet Protocol networks) corpora collected for the second showcase of the NESPOLE! project in the tourism and medical domains. The corpora comprise over 20 hours of human-to-human monolingual dialogues in English, French, German and Italian: 66 dialogues in the tourism domain and 49 in the medical domain. We describe in detail the data collection (technical set-up, scenarios for each domain, recording procedure and data transcription), as well as statistically illustrated corpora and preliminary data analysis
The NESPOLE! VoIP Multilingual Corpora in Tourism and Medical Domains
Mana, Nadia;Cattoni, Roldano;
2003-01-01
Abstract
In this paper we present the multilingual VoIP (Voice over Internet Protocol networks) corpora collected for the second showcase of the NESPOLE! project in the tourism and medical domains. The corpora comprise over 20 hours of human-to-human monolingual dialogues in English, French, German and Italian: 66 dialogues in the tourism domain and 49 in the medical domain. We describe in detail the data collection (technical set-up, scenarios for each domain, recording procedure and data transcription), as well as statistically illustrated corpora and preliminary data analysisFile in questo prodotto:
Non ci sono file associati a questo prodotto.
I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.