-
METEOR
Monday, November 16th, 2009
The METEOR metric is designed to address some of the deficiencies inherent in the BLEU metric. The metric is based on the weighted harmonic mean of unigram precision and unigram recall. The metric was designed after research by Lavie (2004) into the significance of recall in evaluation metrics. Their research showed [...]
-
NIST
Tuesday, April 28th, 2009
The NIST metric is based on the BLEU metric, but with some alterations. Where BLEU simply calculates n-gram precision adding equal weight to each one, NIST also calculates how informative a particular n-gram is. That is to say when a correct n-gram is found, the rarer that n-gram is, the more weight [...]
-
Automatic evaluation
Thursday, March 12th, 2009
In the context of this article, a metric will be understood as a measurement. A metric for the evaluation of machine translation output is a measurement of the quality of the output. The quality of a translation is inherently subjective, there is no objective or quantifiable “good”. Therefore, the task for [...]















































