NORMA eResearch @NCI Library

Investigating Active Learning in Interactive Neural Machine Translation

Gupta, Kamal Kumar, Boppana, Dhanvanth, Haque, Rejwanul, Ekbal, Asif and Bhattacharyya, Pushpak (2021) Investigating Active Learning in Interactive Neural Machine Translation. In: 18th Biennial Machine Translation Summit, 16-20 August 2021, Virtual.

Full text not available from this repository.
Official URL: https://aclanthology.org/2021.mtsummit-research.2/

Abstract

Interactive-predictive translation is a collaborative iterative process, where human translators produce translations with the help of machine translation (MT) systems interactively. Various sampling techniques in active learning (AL) exist to update the neural MT (NMT) model in the interactive-predictive scenario. In this paper, we explore term based (named entity count (NEC)) and quality based (quality estimation (QE), sentence similarity (Sim)) sampling techniques – which are used to find the ideal candidates from the incoming data – for human supervision and MT model’s weight updation. We carried out experiments with three language pairs, viz. German-English, Spanish-English and Hindi-English. Our proposed sampling technique yields 1.82, 0.77 and 0.81 BLEU points improvements for German-English, Spanish-English and Hindi-English, respectively, over random sampling based baseline. It also improves the present state-of-the-art by 0.35 and 0.12 BLEU points for German-English and Spanish-English, respectively. Human editing effort in terms of number-of-words-changed also improves by 5 and 4 points for German-English and Spanish-English, respectively, compared to the state-of-the-art.

Item Type: Conference or Workshop Item (Paper)
Additional Information: Affiliation of Haque, Rejwanul should be School of Computing, National College of Ireland, Dublin, Ireland instead of ADAPT Centre, School of Computing, Dublin City University, Dublin, Ireland as mentioned in the paper.
Subjects: Q Science > QA Mathematics > Electronic computers. Computer science
T Technology > T Technology (General) > Information Technology > Electronic computers. Computer science

P Language and Literature > P Philology. Linguistics > Language Services
Divisions: School of Computing > Staff Research and Publications
Depositing User: Clara Chan
Date Deposited: 07 Sep 2021 16:29
Last Modified: 08 Sep 2021 08:53
URI: http://norma.ncirl.ie/id/eprint/5015

Actions (login required)

View Item View Item