This paper describes a context annotation language, called IF (Interchange Format), which is being used for coding dialogues in spontaneous speech and as information exchange protocol within the speech-to-speech translation systems of the international C-STAR consortium. A characteristic of IF is that it results from an effort to approximate the balance point between the conflicting requirements of high expressive power and reduced formal complexity, so as to preserve the possibility of good quality translation while pursuing robustness of the processors. IF captures all the pieces of information which are necessary for a conversation to go on successfully. Therefore it may be used as interlingua, since it need bot be supplemented with extra annotations or paired with other types of representation, and as content annotation for dialogue databases, since some fields in its labels easily provide keywords. IF labels consist of four fields containing an indication of the speaker role in the dialogue, the speech act, a list of domain concepts describing the informational focus of the encoded fragment and a list of attribute-value pairs carrying more specific information

A content annotation language for spoken dialogues

1999-01-01

Abstract

This paper describes a context annotation language, called IF (Interchange Format), which is being used for coding dialogues in spontaneous speech and as information exchange protocol within the speech-to-speech translation systems of the international C-STAR consortium. A characteristic of IF is that it results from an effort to approximate the balance point between the conflicting requirements of high expressive power and reduced formal complexity, so as to preserve the possibility of good quality translation while pursuing robustness of the processors. IF captures all the pieces of information which are necessary for a conversation to go on successfully. Therefore it may be used as interlingua, since it need bot be supplemented with extra annotations or paired with other types of representation, and as content annotation for dialogue databases, since some fields in its labels easily provide keywords. IF labels consist of four fields containing an indication of the speaker role in the dialogue, the speech act, a list of domain concepts describing the informational focus of the encoded fragment and a list of attribute-value pairs carrying more specific information
1999
File in questo prodotto:
Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11582/1828
 Attenzione

Attenzione! I dati visualizzati non sono stati sottoposti a validazione da parte dell'ateneo

Citazioni
  • ???jsp.display-item.citation.pmc??? ND
social impact