Beyond Shared Vocabulary: Increasing Representational Word Similarities across Languages for Multilingual Machine Translation

Di Wu; Christof Monz

doi:10.18653/v1/2023.emnlp-main.605

Beyond Shared Vocabulary: Increasing Representational Word Similarities across Languages for Multilingual Machine Translation

Abstract

Using a shared vocabulary is common practice in Multilingual Neural Machine Translation (MNMT). In addition to its simple design, shared tokens play an important role in positive knowledge transfer, which manifests naturally when the shared tokens refer to similar meanings across languages. However, when words overlap is small, e.g., using different writing systems, transfer is inhibited. In this paper, we propose a re-parameterized method for building embeddings to alleviate this problem. More specifically, we define word-level information transfer pathways via word equivalence classes and rely on graph networks to fuse word embeddings across languages. Our experiments demonstrate the advantages of our approach: 1) the semantics of embeddings are better aligned across languages, 2) our method achieves evident BLEU improvements on high- and low-resource MNMT, and 3) only less than 1.0% additional trainable parameters are required with a limited increase in computational costs, while the inference time is identical to baselines.

Anthology ID:: 2023.emnlp-main.605
Volume:: Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing
Month:: December
Year:: 2023
Address:: Singapore
Editors:: Houda Bouamor, Juan Pino, Kalika Bali
Venue:: EMNLP
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 9749–9764
Language:
URL:: https://aclanthology.org/2023.emnlp-main.605
DOI:: 10.18653/v1/2023.emnlp-main.605
Bibkey:
Cite (ACL):: Di Wu and Christof Monz. 2023. Beyond Shared Vocabulary: Increasing Representational Word Similarities across Languages for Multilingual Machine Translation. In Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, pages 9749–9764, Singapore. Association for Computational Linguistics.
Cite (Informal):: Beyond Shared Vocabulary: Increasing Representational Word Similarities across Languages for Multilingual Machine Translation (Wu & Monz, EMNLP 2023)
Copy Citation:
PDF:: https://aclanthology.org/2023.emnlp-main.605.pdf
Video:: https://aclanthology.org/2023.emnlp-main.605.mp4

PDF Cite Search Video