The BLEU sausage
◀ Prev | 2025-12-29, access: $ Basic | Next ▶
translation evaluation text tools Every paper has an "evaluation" table showing how the paper's new idea gives greater numbers than previous work in the same domain; but where do those numbers actually come from? Here we look at BLEU, a classic measurement for evaluating the quality of machine translation.
