NORMA eResearch @NCI Library

Investigating Active Learning in Interactive Neural Machine Translation

Gupta, Kamal Kumar, Boppana, Dhanvanth, Haque, Rejwanul, Ekbal, Asif and Bhattacharyya, Pushpak (2021) Investigating Active Learning in Interactive Neural Machine Translation. In: Proceedings of Machine Translation Summit XVIII: Research Track. Association for Machine Translation in the Americas, pp. 10-22.

Full text not available from this repository.
Official URL: https://aclanthology.org/2021.mtsummit-research.2/

Abstract

Interactive-predictive translation is a collaborative iterative process, where human translators produce translations with the help of machine translation (MT) systems interactively. Various sampling techniques in active learning (AL) exist to update the neural MT (NMT) model in the interactive-predictive scenario. In this paper, we explore term based (named entity count (NEC)) and quality based (quality estimation (QE), sentence similarity (Sim)) sampling techniques – which are used to find the ideal candidates from the incoming data – for human supervision and MT model’s weight updation. We carried out experiments with three language pairs, viz. German-English, Spanish-English and Hindi-English. Our proposed sampling technique yields 1.82, 0.77 and 0.81 BLEU points improvements for German-English, Spanish-English and Hindi-English, respectively, over random sampling based baseline. It also improves the present state-of-the-art by 0.35 and 0.12 BLEU points for German-English and Spanish-English, respectively. Human editing effort in terms of number-of-words-changed also improves by 5 and 4 points for German-English and Spanish-English, respectively, compared to the state-of-the-art.

Item Type: Book Section
Additional Information: Affiliation of Haque, Rejwanul should be School of Computing, National College of Ireland, Dublin, Ireland instead of ADAPT Centre, School of Computing, Dublin City University, Dublin, Ireland as mentioned in the paper.
Subjects: Q Science > QA Mathematics > Electronic computers. Computer science
T Technology > T Technology (General) > Information Technology > Electronic computers. Computer science
P Language and Literature > P Philology. Linguistics > Language Services
Divisions: School of Computing > Staff Research and Publications
Depositing User: Clara Chan
Date Deposited: 07 Sep 2021 16:29
Last Modified: 26 Jan 2022 17:10
URI: https://norma.ncirl.ie/id/eprint/5015

Actions (login required)

View Item View Item