Improving Transformer-Based Neural Machine Translation with Prior Alignments

Complexity 2021:1-10 (2021)
  Copy   BIBTEX

Abstract

Transformer is a neural machine translation model which revolutionizes machine translation. Compared with traditional statistical machine translation models and other neural machine translation models, the recently proposed transformer model radically and fundamentally changes machine translation with its self-attention and cross-attention mechanisms. These mechanisms effectively model token alignments between source and target sentences. It has been reported that the transformer model provides accurate posterior alignments. In this work, we empirically prove the reverse effect, showing that prior alignments help transformer models produce better translations. Experiment results on Vietnamese-English news translation task show not only the positive effect of manually annotated alignments on transformer models but also the surprising outperformance of statistically constructed alignments reinforced with the flexibility of token-type selection over manual alignments in improving transformer models. Statistically constructed word-to-lemma alignments are used to train a word-to-word transformer model. The novel hybrid transformer model improves the baseline transformer model and transformer model trained with manual alignments by 2.53 and 0.79 BLEU, respectively. In addition to BLEU score, we make limited human judgment on translation results. Strong correlation between human and machine judgment confirms our findings.

Links

PhilArchive



    Upload a copy of this work     Papers currently archived: 91,202

External links

Setup an account with your affiliations in order to access resources via your University's proxy server

Through your library

Similar books and articles

Artificial Intelligence: Machine Translation Accuracy in Translating French-Indonesian Culinary Texts.Hasyim Muhammad - 2021 - International Journal of Advanced Computer Science and Applications 12 (3):186-191.
Machine vs. Human Translation.Elona Limaj - 2014 - Journal of Turkish Studies 9 (Volume 9 Issue 6):783-783.
Understanding from Machine Learning Models.Emily Sullivan - 2022 - British Journal for the Philosophy of Science 73 (1):109-133.

Analytics

Added to PP
2021-05-09

Downloads
5 (#1,469,565)

6 months
4 (#698,851)

Historical graph of downloads
How can I increase my downloads?

Author Profiles

Thien Le Le Nguyen
Universität Osnabrück
H. Nguyen
Northwestern University

Citations of this work

No citations found.

Add more citations

References found in this work

No references found.

Add more references