Climbing Mount BLEU: The Strange World of Reachable High-BLEU Translations
2016 (English)In: Baltic Journal of Modern Computing, ISSN 2255-8942, E-ISSN 2255-8950, Vol. 4, no 2, 269-281 p.Article in journal (Refereed) PublishedText
We present a method for finding oracle BLEU translations in phrase-based statistical machine translation using exact document-level scores. Experiments are presented where the BLEU score of a candidate translation is directly optimised in order to examine the properties of reachable translations with very high BLEU scores. This is achieved by running the document-level decoder Docent in BLEU-decoding mode, where proposed changes to the translation of a document are only accepted if they increase BLEU. The results confirm that the reference translation cannot in most cases be reached by the decoder, which is limited by the set of phrases in the phrase table, and demonstrate that high-BLEU translations are often of poor quality.
Place, publisher, year, edition, pages
2016. Vol. 4, no 2, 269-281 p.
Statistical machine translation, oracle decoding, BLEU, Docent
Language Technology (Computational Linguistics)
IdentifiersURN: urn:nbn:se:uu:diva-299620ISI: 000377706600016OAI: oai:DiVA.org:uu-299620DiVA: diva2:949838