JST_LOGO.JPG NICT_LOGO.JPG KYOTO-U_LOGO.JPG

WAT

The Workshop on Asian Translation
Automatic Evaluation Procedures
for English Translation Results

[AUTOMATIC EVALUATION PROCEDURES TOP] | [SETUP] | [SEGMENTATION] | [EVALUATION]

Setup

Here, ${TASK} representes the source and target language pair, and "ja-en" can be applied.
We assume that there are ASPEC reference files (${TASK}.txt) in tests.org/, translation result files (${TASK}.txt) in results.org/ and all tools in ${path}.
TASK=ja-en
SCRIPT=${path}/script.segmentation.distribution
MOSES_SCRIPT=${path}/mosesdecoder-RELEASE-2.1.1/scripts
RIBES=${path}/RIBES-1.02.4
mkdir tests.tok
mkdir results.tok
Back to top

Segmentation

  • Moses tokenizer
  • for file in tests results; do
      cat ${file}.org/${TASK}.txt | \
       perl ${SCRIPT}/z2h-utf8.pl | \
       ${MOSE_SCRIPT}/tokenizer/tokenizer.perl -l en | \
       > ${file}.tok/${TASK}.moses.txt
    done

    Back to top

    Evaluation

  • BLEU
  • perl ${MOSES_SCRIPT}/generic/multi-bleu.perl tests.tok/${TASK}.moses.txt < results.tok/${TASK}.moses.txt

  • RIBES
  • python3 ${RIBES}/RIBES.py -c -r tests.tok/${TASK}.moses.txt results.tok/${TASK}.moses.txt

    Back to top

    JST (Japan Science and Technology Agency)
    NICT (National Institute of Information and Communications Technology)
    Kyoto University
    Last Modified: 2015-03-20