We describe effort to improve machine translation of item titles found in a large e-commerce inventory through topic modeling and adaptation. Item titles are short texts which typically contain brand names that do not have to be translated, and item attributes whose translation often depends on the context. Both issues call for robust methods to integrate context infor- mation in the machine translation process in order to reduce translation ambiguity. We survey both existing topic adaptation approaches and propose novel methods that augment the standard phrase-table models with sparse features and dense features measuring the topic match between each phrase-pair and the input text. We report extensive experiments on the translation of item titles from English into Brazilian Portuguese, and show the impact of topic adaptation both with and without domain adaptation.

Topic Adaptation for Machine Translation of E-Commerce Content

Mathur, Prashant;Federico, Marcello;
2015-01-01

Abstract

We describe effort to improve machine translation of item titles found in a large e-commerce inventory through topic modeling and adaptation. Item titles are short texts which typically contain brand names that do not have to be translated, and item attributes whose translation often depends on the context. Both issues call for robust methods to integrate context infor- mation in the machine translation process in order to reduce translation ambiguity. We survey both existing topic adaptation approaches and propose novel methods that augment the standard phrase-table models with sparse features and dense features measuring the topic match between each phrase-pair and the input text. We report extensive experiments on the translation of item titles from English into Brazilian Portuguese, and show the impact of topic adaptation both with and without domain adaptation.
File in questo prodotto:
File Dimensione Formato  
MTS-2015-Mathur.pdf

accesso aperto

Tipologia: Documento in Post-print
Licenza: Dominio pubblico
Dimensione 656.9 kB
Formato Adobe PDF
656.9 kB Adobe PDF Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11582/309035
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
social impact