AI

Towards Cross-Cultural Machine Translation with Retrieval-Augmented Generation from Multilingual Knowledge Graphs

1 Mins read

Translating text that contains entity names is a challenging task, as cultural-related references can vary significantly across languages. These variations may also be caused by transcreation, an adaptation process that entails more than transliteration and word-for-word translation. In this paper, we address the problem of cross-cultural translation on two fronts: (i) we introduce XC-Translate, the first large-scale, manually-created benchmark for machine translation that focuses on text that contains potentially culturally-nuanced entity names, and (ii) we propose KG-MT, a novel end-to-end method to integrate information from a multilingual knowledge graph into a neural machine translation model by leveraging a dense retrieval mechanism. Our experiments and analyses show that current machine translation systems and large language models still struggle to translate texts containing entity names, whereas KG-MT outperforms state-of-the-art approaches by a large margin, obtaining a 129% and 62% relative improvement compared to NLLB-200 and GPT-4, respectively.


Source link

Related posts
AI

Google AI Releases Gemma 3: Lightweight Multimodal Open Models for Efficient and On‑Device AI

3 Mins read
In the field of artificial intelligence, two persistent challenges remain. Many advanced language models require significant computational resources, which limits their use…
AI

Google’s new open model based on Gemini 2.0

3 Mins read
For a deeper dive into the technical details behind these capabilities, as well as a comprehensive overview of our approach to responsible…
AI

A Step by Step Guide to Build an Interactive Health Data Monitoring Tool Using Hugging Face Transformers and Open Source Model Bio_ClinicalBERT

4 Mins read
In this tutorial, we will learn how to build an interactive health data monitoring tool using Hugging Face’s transformer models, Google Colab,…

 

 

Leave a Reply

Your email address will not be published. Required fields are marked *